Scylla Summit 2018: Kiwi.com Takes Flight with Scylla

KIWI.COM TAKES FLIGHT
WITH SCYLLA
Jan Plhak
Principal engineer at Kiwi.com

Presenter bio
Mathematician who turned to the Dark Side.
Working at travel industry for 5 years now.
Currently principal engineer at Kiwi.com - big data,
distributed systems, fancy algorithmics, C++ devel...

What is Kiwi.com
▪ “Provides a fare aggregator, metasearch engine and
booking for airline tickets.”
▪ Basically helps you figure out where you can fly within
your budget.
▪ Virtual interlining

What is Kiwi.com
▪ So we store some flights data…
▪ ±100 000 flights/day -> ±36M flights/year
▪ That’s a lot of data right?
Even your phone can store that...

So we store some flights data
▪ Combinations...
▪ ±7G (billions) flight entries
▪ 350 000 writes/sec, 600 000 reads/sec
▪ 20TB in multiple replicas
Your phone can’t store that...

How we store the data
Rocky road to perfection...

Stage two
Custom sharding, 60 databases + 60 x Redis, absolute joy

Stage three
▪ End of dark ages
▪ Distributed, scalable
▪ Data replication
▪ Much more performance

Stage Scylla
▪ Currently migrating
▪ Allows us to scale even further
▪ Allows us to ditch many workarounds we had to
implement because of Cassandra
▪ More in Martin’s talk

Scylla migration - fun fact
▪ Our use case is very read-intensive (600 000 reads/sec)
▪ A many of these reads can be cached
▪ Cassandra uses system cache - very slow
▪ Our current solution:

Scylla migration - fun fact
▪ Scylla vs Cassandra benchmarking
▪ Same data, same cluster, same read structure
▪ Scylla - 900K reads/s vs Cassandra - 40k reads/s

Motivation
▪ Precomputation engine needs flights data
▪ Downloading all the data every hour
▪ + Secondary production, testing…
▪ = A lot of stress on production database

Motivation
▪ Stages 1 and 2 - direct downloading - Worked well
▪ Stage 3 - Cassandra + much more data
• Token ranges
• CPU overload
• Massive latency spikes over the whole system

Why it failed
▪ Not very efficient implementation... Java...
▪ Re-reading all the data - very inefficient
▪ Idea - add “last_update_timestamp” column
• Select only recently updated entries
• Didn’t work - Cassandra still has to go through all the data
If only we could efficiently read only the
recently updated data...

Opening Pandora's box
▪ Cassandra flushes new data from memory to disk,
MemTable -> SSTable
▪ Every node holds multiple SSTables for each column family
▪ SSTables are immutable
And so we got an idea...

Opening Pandora's box
▪ Create a service that can detect and parse all newly created
SSTables - Splitters
▪ Stream the data to our distributed custom cache storage -
Mergers
▪ Feed our preprocessing engine with data from Mergers
▪ If Splitters are efficient, we can read the flights data with zero
impact on Cassandra’s performance

Splitters
▪ Step 1 - Reverse-engineer SSTable format from Cassandra src
▪ Step 2 - Implement fast SSTable parser in C++
▪ Step 3 - Implement mechanism for new SSTable detection
▪ Step 4 - Stream all the data to Mergers - including the
“last_update_timestamp”
▪ Step 5 - deploy the Splitter on every Cassandra node

Mergers
▪ Distributed storage, accepting data from Splitters
▪ Sharding based on logical key in our data - useful for
precomputation and streaming to our Engine
▪ Replication factor of 1 - If any node fails, remaining nodes have
to take it’s shards - restream everything!

Problems
▪ MemTable -> SSTable latency (±undefined)…
▪ … and eventual consistency - Splitters on all replicas ...
▪ … some data could be missing
▪ Cassandra’s vs our sharding - Merger failure -> complete reload
▪ Depending on internal format - zero support, no guarantees,
problematic documentation, insane
▪ Additional development, it took some time to get right

The good things
▪ Allows us to do frequent full-data dumps
▪ Performance
• Our C++ parser is very fast
• During normal operation - near-zero load on DB servers
▪ Zero impact on production DB - complete isolation
▪ Mergers - custom built for our use case - very efficient

Scylla is better
▪ Currently migrating, some problems (Scylla is too good)
▪ Testing -> continuous full table scans - filter for
“last_update_timestamp”
▪ Using token ranges - Scylla can handle, no overloading

What’s next?
▪ SSTable parser removal - Amazing!!!
▪ Two possible scenarios
a. Keep splitters and read preferably local token ranges (Complex)
b. Keep only Mergers and read the data directly (Much easier)

Thank You
Any Questions ?
Please stay in touch
jan.plhak@kiwi.com

Scylla Summit 2018: Kiwi.com Takes Flight with Scylla

Recommandé

Recommandé

Contenu connexe

Plus de ScyllaDB

Plus de ScyllaDB (20)

Dernier

Dernier (20)

Scylla Summit 2018: Kiwi.com Takes Flight with Scylla

Notes de l'éditeur