42. You are in a room full of overlapping cron jobs.
You can hear the screams of a dying MySQL server.
An Oracle vendor is here.
To the West, a door is marked “Map/Reduce”
To the East, a door is marked “Stream Processing”
62. #CloudstackCephDay
Publish & Subscribe
● Records are produced on topics
● Topics have a predefined number of partitions
● Records have a key which determines their
partition
63. #CloudstackCephDay
● Consumers get assigned a set of partitions
● Consumers store their last consumed offset
● Brokers own partitions, handle replication
78. #CloudstackCephDay
What about batch?
● Streaming doesn’t work for everything
● Sometimes throughput matters more than latency
● Building models in batch, applying with stream
processing