6. Apache Kafka
A high-throughput distributed messaging system.
“Distributed Publish-Subiscribe
messaging system, Hight
Troughtput, Persistent,
Partitioning Messages, Parallel
data lod into Hadoop.”
7. Apache Kafka
A high-throughput distributed messaging system.
Pure offline log
processing are:
Real-time,
High Performance,
Hight Troughtput
Lightweight business logic(and
not lots) to deliver that.
8. Apache Kafka
A high-throughput distributed messaging system.
Pure Messasing
issues(ActiveMQ/RabbitMQ):
NO API for Batching,
Transactional,
No persistence means, multiple
consumers are limited by arch.
12. Apache Kafka
A high-throughput distributed messaging system.
Supports *Activity Stream Processing*, like: Facebook/Scribe and Apache Flume.
*Activity Stream Processing* => Collecting, Aggregating, larges
ammout on data, very present on social business. Later you often do
offline analysys with hadoop. A.K.A Offline log Aggregation.