Buiilding reactive distributed systems with Akka

Johan Andrén | jDays 2018
Building reactive
distributed systems with
Akka

Johan Andrén
Akka Team
Stockholm Scala User Group
@apnylle
johan.andren@lightbend.com

A man, a plan …
Akka recap, distsys background and Akka Cluster basics 
We’ll get an overview of how Akka does clustering
Cluster Sharding 
Shard actors across cluster
Distributed Data 
Eventual consistency
Distributed PubSub 
A message bus across the cluster
Cluster Singleton 
How to introduce a single point of failure
Akka Streams
Async, backpressured streams
}Cluster Tools

Eventual consistency
Distributed PubSub 
A message bus across the cluster
How to introduce a single point of failure
Akka Streams
Async, backpressured streams
Alpakka
Enterprise integration on top of streams
Akka Persistence
Eventsourcing with actors
Akka HTTP
Async streaming HTTP server and client
}Cluster Tools

Actor
Message
Inbox
MessageMessage
Akka Actors Recap
ActorRef.tell(message, sender)

Actor
Message
Message
Actor
Actor
Akka Actors Recap

Child
Parent
Child
Failures
ActorSystem
Akka Actors Recap

Distributed Systems
–Leslie Lamport
”A distributed system is one in which the failure of a
computer you didn't even know existed can render
your own computer unusable”

Why is it so hard?
Reliability: power failure, old network  
equipment, network congestion, coﬀee in router,  
rodents, that guy in the IT dept., DDOS attacks…
Latency: loopback vs local net vs shared congested local net vs
internet
Bandwidth: again loopback vs local vs shared local vs internet
The Joys of Computer Networks:

Why do it, if it is so hard?
Data or processing doesn’t fit a single machine 
Many objects, that should be kept in memory. Many not so
powerful servers can be cheaper than a supercomputer.
Elasticity 
Being able to scale in (less servers) and out (more servers)
depending on load. Not paying for servers unless you need them.
Resilience 
Building systems that will keep working in the face of failures or
degrade gracefully.

Actor Model vs Network
Interaction already modelled as immutable messages 
Data travels over the network in packages, changes has to be
explicitly sent back.
At most once 
Data reaches a node on the other side at most once, but can be
lost, already part of model!
A recipient of a message can reply directly to sender 
Regardless if there were intermediate recipients of the message
Messages not limited to request response 
Messages can flow in either direction when two systems are
connected.

Local
ActorSystem
Message
Message
Actor
Actor
Actor

JVM 2
JVM 1
Distributed
ActorSystem
ActorSystem
Message
Message
Actor
Actor
Actor

Consistency vs Availability
Strong Consistency Always Available
Node 1 Node 2
✂

Failure detector
(leader)
A
A is unreachable!
🤢

Failure detector
(leader)
A
A is reachable again!
🤕

Failure detector
(leader)
A
A is unreachable!
😵

Partitions
(leader)
A
A, E & D is unreachable!
B
C
D
E
B & C is unreachable!
✂
✂
✂
Split Brain Resolver Docs (commercial)

Cluster Member Lifecycle
Joining
Up
Leaving
Exiting
removedDown
User Action
Join Leader
Action
User Action
Leave Leader
Action
Leader
Action
User Action
Down

Roles
[api]
[api]
[worker, backend]
[worker]
[worker]

Joining - when there is no
cluster yet
?
Join

Seed nodes
!First seed node 
if none of the other nodes in the list are in
the cluster - joins itself to form cluster
Rest of seed nodes
just pings all other nodes and joins as soon
as one is in the cluster responds
Join

What would happen if we mess it up?
I’m the leader, this is the cluster!
No! I’m the leader, this is the cluster!
Join
Join
No! I’m the leader, this is the cluster!

User API of Cluster
Node details 
What roles am I in, what is my address
Join, Leave, Down 
Programatic control over cluster membership
Register listeners for cluster events 
Every time the cluster state changes the listening
actor will get a message

Cluster through config
akka {
actor.provider = cluster
remote.artery {
enabled = true
transport = tcp
canonical.hostname = 192.168.0.1
canonical.port = 25520
}
cluster.seed-nodes = [
“akka://cluster@192.168.0.1:25520",
“akka://cluster@192.168.0.2:25520”
]
}
application.conf:
final ActorSystem system =
ActorSystem.create("cluster");

Programmatic join
final ActorSystem node1 = ActorSystem.create("cluster");
// joins itself to form cluster
final Cluster node1Cluster = Cluster.get(node1);
node1Cluster.join(node1Cluster.selfAddress());
// joins the cluster through the one node in the cluster
// subsequent nodes can join through any node that is already in the cluster

Cluster Sharding
Actor
with id 1
Shard actors based on an identifier
Message(recipientId = 1)
Online docs for Cluster Sharding

Entity C-1
Entity B-4
Shard C
Cluster Sharding
ShardCoordinator (singleton)
ShardRegion
ShardRegion
ShardRegion
Shard A
Shard B
Entity A-1
Entity A-7

Entity C-1
Shard C
Cluster Sharding
ShardRegion
ShardRegion
ShardRegion
Envelope(“c-1”)

Entity C-1
Shard C
Cluster Sharding
ShardRegion
ShardRegion
Envelope(“c-1”)
Downed

case class Envelope(entityId: String, payload: Any)
val extractEntityIdFunction: ShardRegion.ExtractEntityId = {
case Envelope(entityId, payload) => (entityId, payload)
}
val extractShardIdFunction: ShardRegion.ExtractShardId = {
case Envelope(entityId, _) => (entityId.hashCode % NumberOfShards).toString
}
val region =
ClusterSharding(node).start(
typeName = ShardTypeName, entityProps = CountingActor.props,
settings = ClusterShardingSettings(node),
extractEntityId = extractEntityIdFunction,
extractShardId = extractShardIdFunction
)
node1Region ! Envelope(entityId = "1", payload = “Hello actor 1”)
Cluster Sharding

guarantees single instance of same entity
Cluster Sharding

Distributed Data
CRDTs: Conflict free Replicated Data Types
allow for updates on any node and then
spreading that update to other cluster nodes
through gossip for eventual consistency
Note: Does not ﬁt every problem!
Online docs for Distributed Data

Special requirements
Commutative 
Order of operation does not matter  
like 3 + 4 = 4 + 3
Associative 
Grouping operations does not matter  
like 3 + (4 + 5) = (3 + 4) + 5
Monotonic 
Absence of rollbacks, ”only growing” (but we can do sneaky
tricks)

Built in data structures
Counters 
GCounter - grow only, PNCounter - increment and decrement counter
Sets 
GSet - grow only, ORSet - observed remove set
Maps  
ORMap - observed remove map, ORMultiMap - observed remove multi
map, PNCounterMap - positive negative counter map, LWWMap - last
writer wins map
Flags and Register 
Flag - toggle once boolean, LWWRegister - last writer wins register

Replicator
Replicator
Replicator
In Action
Replicator
Replicator

Replicator
Replicator
Replicator
In Action
Update(key, …)
Replicator
Replicator
Get(key)
Subscribe(key, actor)
Update(key, …)

Distributed Data
val replicator = DistributedData(system).replicator
val CounterKey = GCounterKey("visit-counter-1")
val InitialCounterValue = GCounter.empty
replicator ! Replicator.Subscribe(CounterKey, actorRef)
replicator ! Replicator.Update(
key = CounterKey,
initial = InitialCounterValue,
writeConsistency = Replicator.WriteLocal
) { counter =>
counter.increment(Cluster(system))
}

Eventually consistent - always accepts writes
Distributed Data

Distributed Pub Sub
Send(path, msg1)
Publish(topic, msg2)
Subscriber
Registered
Actor
msg1
msg1
msg2
Subscriber
Online docs for Distributed PubSub

With topics
Subscribe(topic)
Mediator
Subscriber
Mediator
Mediator
Subscriber
Subscribe(topic)

With topics
gossip
Mediator
Subscriber
Mediator
Mediator
Subscriber
gossip
gossip

With topics
Publish(topic, msg)
Mediator
Subscriber
Mediator
Mediator
Subscriber
msg
msg

With actor path
Mediator
/user/my-actor
Put(actorRef)
Send(“/user/my-actor”)
SendToAll(“/user/my-actor”)
Mediator
Mediator

Distributed PubSub
val mediator = DistributedPubSub(system).mediator
val actorRef = system.actorOf(props, "my-subscriber")
mediator ! DistributedPubSubMediator.Subscribe("my-topic", actorRef)
node3Mediator ! DistributedPubSubMediator.Publish(
"my-topic",
messageToAllSubscribers)

Subscribers/Topics eventually consistent
always accepts writes
Distributed PubSub

Cluster Singleton
Singleton
Or: how to introduce a single point of failure
Online docs for Cluster Singleton
%

Cluster Singleton
SingletonManager
SingletonManager
SingletonManager
(oldest)
SingletonActor

Cluster Singleton
SingletonManager
SingletonManager
SingletonManager
(oldest)
SingletonActor
SingletonProxy
Message

Cluster Singleton
SingletonManager
SingletonManager
(oldest)
SingletonActor
Downed

system.actorOf(
ClusterSingletonManager.props(
singletonProps = CounterActor.props(),
terminationMessage = PoisonPill,
settings = ClusterSingletonManagerSettings(system)
),
“counter-singleton-manager“)
val proxy = system.actorOf(
ClusterSingletonProxy.props(
singletonManagerPath = ”/user/counter-singleton-manager“,
settings = ClusterSingletonProxySettings(node)
),
“counter-singleton-proxy")
proxy ! CountMessage
Cluster Singleton

Only one cluster singleton will ever live
Cluster Singleton

Multi-DC
[dc-1]
[dc-2]
[dc-3]
doc.akka.io/docs/akka/current/cluster-dc.html
(leader)
(leader)
(leader)

Multi-DC support in cluster tools
Sharding 
Per DC, proxy defaults to local dc, can be started with a data-
center setting
Distributed Data and PubSub
No special support (optimisation possibilities)
Is a singleton per DC
Multi-DC Persistence
Commercial feature allowing for active-active or active-
passive eventually consistent persistence between
datacenters (docs)

Additional resources
discuss.akka.io - forums
developer.lightbend.com - sample
distributed workers project
github.com/akka/akka-samples - a
bunch of sample projects

Thanks for listening!
Complete Samples (Java and Scala):
github.com/johanandren/akka-cluster-samples/tree/jdays-2018
@apnylle
johan.andren@lightbend.com

Buiilding reactive distributed systems with Akka

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Buiilding reactive distributed systems with Akka

Similaire à Buiilding reactive distributed systems with Akka (20)

Plus de Johan Andrén

Plus de Johan Andrén (9)

Dernier

Dernier (20)

Buiilding reactive distributed systems with Akka