Dissolving the Problem (Making an ACID-Compliant Database Out of Apache Kafka®)

Dissolving
Problem
the
@tlberglund
(making an ACID-compliant database out of Apache Kafka®)

https://www.amazon.com/dp/1449373321

A program that
remembers things.

A program that remembers
things and has a data
model.

A program that remembers things and
has a data model and ACID
transactional properties.

Atomicity
Consistency
Isolation
Durability

picture of tape
https://www.ﬂickr.com/photos/phrenologist/3252001011/

picture of disks
https://www.ﬂickr.com/photos/philipus/29711988683

Broker 4Broker 3Broker 2Broker 1
Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 4
Topic 1
Partition 1
Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 3
Topic 1
Partition 4
Topic 1
Partition 4
Topic 1
Partition 4

Database Transaction
BEGIN;
UPDATE account 
SET balance += 100 
WHERE username = tlberglund;
UPDATE account  
SET balance -= 100  
WHERE username = gwenshap;
COMMIT;

process 1
process 2
Is there a
Tim?
nope
Is there a
Tim?
Awesome, make
a Tim
nope No prob!
Okay, hoss
Cool, make a
Tim

Consistency
• Invariants/constraints
• Unique usernames
• Account balances greater than zero

K
V
• Log of events
• Strict ordering guarantee
• Constant-time reads and
writes
• Persistent on disk

…
…
…
Partition 0
Partition 1
Partition 2
K
V
Run the key through a
hash function

…
…
…
Partition 0
Partition 1
Partition 2
K
V

• Provides scalable writes,
storage, and consumption
• Ordering is within partition
only
• Key selection becomes a data
modeling concern
…
…
…

Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 4

Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 4
Topic 1
Partition 1
Topic 1
Partition 1

Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 4
Topic 1
Partition 1
Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 2

Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 4
Topic 1
Partition 1
Topic 1
Partition 1
Topic 1
Partition 2
Topic 1
Partition 2
Topic 1
Partition 3
Topic 1
Partition 3

…
…
…
partition 0
partition 1
partition 2
Partitioned Topic
producer

…
…
…
partition 0
partition 1
partition 2
Partitioned Topic
producer
• A client application
• Puts messages into topics
• Handles partitioning, network
protocol
• Java, Go, .NET, C/C++, Python
• Also every other language

consumer A
…
…
…
partition 0
partition 1
partition 2
Partitioned Topic

consumer A
consumer B
…
…
…
partition 0
partition 1
partition 2
Partitioned Topic

consumer A
consumer B
…
…
…
partition 0
partition 1
partition 2
Partitioned Topic
consumer A

consumer A
consumer B
…
…
…
partition 0
partition 1
partition 2
Partitioned Topic
consumer A
consumer A

consumer A
consumer B
…
…
…
partition 0
partition 1
partition 2
Partitioned Topic
consumer A
consumer A
• A client application
• Reads messages from topics
• Horizontally, elastically scalable
(if stateless)
• Java, Go, .NET, C/C++, Python,
everything else

Stream
Processingapps
rdbms
nosql
dwh/
hadoop

Kafka
Connect
broker
broker
broker
broker
data
source
data
sink
Kafka
Connect

Connect Worker Connect Worker Connect Worker
task task task task task

{
“name": "jdbc_source_postgres_movies",
"config": {
"connector.class": "io.confluent.connect.jdbc.JdbcSourceConnector",
"connection.url": “jdbc:postgresql://database:5432/WORKSHOP…”,
"table.whitelist": "movies",
"mode": "incrementing",
"incrementing.column.name": "id",
"validate.non.null": "false",
"topic.prefix": "postgres-"
}
}

• Declarative data integration
framework
• Extensive community library of
connectors
• Horizontally scalable and fault-
tolerant
• Pretty easy to extend
Kafka
Connect
broker
broker
broker
broker
data
source
data
sink
Kafka
Connect

consumer A
consumer A
consumer A

Turn streams into tables
Enrich a stream with a table
Aggregate streams
Join one stream with another
Scale stateful applications

Functional Java API
Abstractions for streams and tables
Scalable, fault-tolerant state

Streams
Application
Streams
Application
Streams
Application

• Java API
• Filter, join, aggregate, etc.
• Locates stream processing
with your application
• Scales like a Consumer Group
(but better!)
KTable<Long, Movie> movies =
builder.table(“movies”,
Materialized.
<Long, Movie,KeyValueStore<
Bytes, byte[]>>
as(“movies-store")
.withValueSerde(movieSerde)
.withKeySerde(Serdes.Long())
);

CREATE TABLE movie_ratings AS
SELECT title,
SUM(rating)/COUNT(rating) AS avg_rating,
COUNT(rating) AS num_ratings
FROM ratings
LEFT OUTER JOIN movies
ON ratings.movie_id = movies.movie_id
GROUP BY title;

producer
consumer
KSQL Cluster
KSQL
Server
KSQL
Server

• Declarative stream processing
language
• Provides stream and table
abstractions
• Filter, join, aggregate
• Run on horizontally scalable
KSQL cluster
CREATE TABLE movie_ratings AS
SELECT title,
SUM(rating)/COUNT(rating) AS avg_rating,
COUNT(rating) AS num_ratings
FROM ratings
LEFT OUTER JOIN movies
ON ratings.movie_id = movies.movie_id
GROUP BY title;

process 1
process 2
Is there a
Tim?
nope
Is there a
Tim?
Awesome, make
a Tim
nope No prob!
Okay, hoss
Cool,
make a Tim

process 1
process 2
I would like a
Tim
I would like a
Tim
Tim-1
Tim-2

process 1
process 2
Tim-1
Tim-2
Users
Ale
Yeva
Vik

process 1
process 2
Tim-1
Tim-2
Users
Ale
Yeva
Vik
Tim

process 1
process 2
Users
Ale
Yeva
Vik
Tim
yesIs there a
Tim?
Is there a
Tim? yes

You are not just
writing microservices.

You are building an
inside-out database

Thank
You!
@tlberglund
http://slackpass.io/confluentcommunity
http://confluent.io/ksql

25%OFF
Standard Priced
Conference Pass
Confluent Community
Discount Code
KS19Meetup

Dissolving the Problem (Making an ACID-Compliant Database Out of Apache Kafka®)

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Dissolving the Problem (Making an ACID-Compliant Database Out of Apache Kafka®)

Similaire à Dissolving the Problem (Making an ACID-Compliant Database Out of Apache Kafka®) (20)

Plus de confluent

Plus de confluent (20)

Dernier

Dernier (20)

Dissolving the Problem (Making an ACID-Compliant Database Out of Apache Kafka®)