SlideShare une entreprise Scribd logo
1  sur  61
Télécharger pour lire hors ligne
To Serverless
and Beyond
Dor Laor, Co-founder & CEO
Why ScyllaDB?
We asked the smartest entity on earth
ScyllaDB
ScyllaDB
ScyllaDB
ScyllaDB
Why ScyllaDB?
Best High Availability in the industry
Best Disaster Recovery in the industry
Best scalability in the industry
Best Price/Performance in the industry Auto-tune - out of the box performance
Fully compatible with Cassandra & DynamoDB
The power of Cassandra at the speed of Redis
No Lock-in
Open Source Software
■ Quick recap for newbies
Agenda
■ Journey of 2022
■ The Future of Fast Data
ScyllaDB Journey Gen #1:Brute Power
Database Scores
Database Efficiency Score =
ƒ(query/vcpu, workload/ram, cost/storage);
Database Over Provision Score =
ƒ(idle, tables/clusters, readonly/DCs);
Database Scale Score =
ƒ(10x scale cost/ability,P99@Scale,Time2scale,Ease of scale);
Database Maintenance Score =
ƒ(Backup,Repair, Replace,Scale, Upgrade,Observe,Trace);
Database Freedom Score =
ƒ(License, Environment, Ecosystem tools);
R1
Can I
propose a
value?
Check
condition
R2
R3
Accept
new value
Learn
decision
Decision made
o
k
!
o
k
!
r
o
w
c
h
e
c
k
s
u
m
o
k
!
a
c
k
ok!
ScyllaDB Journey Gen #2:Full API, LWT
ScyllaDB Journey Gen#3:
CDC,K8S,Indexes,ICS,Alternator
ScyllaDB Journey - ScyllaDB 5.0
ScyllaDB Cloud 2022 Roadmap
Today (2022) - Perfect Single Tenant land
■ Extreme efficiency
■ Perfect security & isolation
■ Clear and simple units - servers
■ > 5x Cost effectiveness vs all others
■ Easy to consume
ScyllaDB Journey - ScyllaDB 5.0
ScyllaDB Cloud 2022 Roadmap
Today (2022) - Perfect Single Tenant land
■ Extreme efficiency
■ Perfect security & isolation
■ Clear and simple units - servers
■ > 5x Cost effectiveness vs all others
■ Easy to consume
Coming - Multi tenancy
■ Separation of compute & storage
■ Serverless, pay per usage
■ Elastic w/ APIs control
■ More efficient than today’s
implementation
■ Preserves ScyllaDB goodware™
■ Quick recap for newbies
Agenda
■ Journey of 2022
■ The Future of Fast Data
2022:
All Weather,
All Terrain,
High Throughput,
Low Latency Database
■ Performance, performance Isolation, performance
■ Maintenance, Maintenance, Maintenance
■ Consistent metadata by Raft
■ ScyllaDB Cloud serverless - abstract the servers
2022 Highlights
■ IO Scheduler v5
■ Per partition rate limit
■ 30% CPU gain coming to
enterprise/cloud
■ Distributed aggregates
■ Eliminate exceptions
from the IO path
■ Reverse queries
■ I4i
■ Range tombstone v2
Performance, Performance Isolation
■ v1 max-io-requests
■ V2 bandwidth+iops
capacity, statically
partitioned
■ V3 cross-shards
capacity sharing
■ V4
rate-limiter-based
capacity
■ V5 Mixed BW
I/O Scheduler V5
■ v1 max-io-requests
■ V2 bandwidth+iops
capacity, statically
partitioned
■ V3 cross-shards
capacity sharing
■ V4
rate-limiter-based
capacity
■ V5 Mixed BW
I/O Scheduler V5
■ v1 max-io-requests
■ V2 bandwidth+iops
capacity, statically
partitioned
■ V3 cross-shards
capacity sharing
■ V4
rate-limiter-based
capacity
■ V5 Mixed BW
I/O Scheduler V5
■ v1 max-io-requests
■ V2 bandwidth+iops
capacity, statically
partitioned
■ V3 cross-shards
capacity sharing
■ V4
rate-limiter-based
capacity
■ V5 Mixed BW
I/O Scheduler V5
Also added per I/O class throttling
■ Compaction
■ Repair
■ Streaming
■ IO Scheduler v5
■ Per partition rate limit
■ 30% CPU gain coming
to enterprise/cloud
■ Distributed
aggregates
■ Eliminate exceptions
from the IO path
■ Reverse queries
■ I4i
■ Range tombstone v2
Performance, Performance Isolation
SELECT COUNT(*) FROM keyspace.table
Return c1
Return c2
Return c3
COUNT )
p0..p1
COUNT )
p2..p3
COUNT )
p4..p5
COUNT )
p6..p7
■ IO Scheduler v5
■ Per partition rate limit
■ 30% CPU gain coming
to enterprise/cloud
■ Distributed
aggregates
■ Eliminate exceptions
from the IO path
■ Reverse queries
■ I4i
■ Range tombstone v2
Performance, Performance Isolation
7
1
2 5
7
1 2 5
■ IO Scheduler v5
■ Per partition rate limit
■ 30% CPU gain coming
to enterprise/cloud
■ Distributed aggregates
■ Eliminate exceptions
from the IO path
■ Reverse queries
■ I4i
■ Range tombstone v2
Performance, Performance Isolation
■ Performance, performance isolation, performance
■ Maintenance, Maintenance, Maintenance
■ Consistent metadata by Raft
■ ScyllaDB Cloud serverless - abstract the servers
2022 Highlights
■ Auto Tombstone GC
■ Load & stream
■ Auto restore
■ RBNO
■ Consistent Schema
changes
Maintenance, Maintenance, Maintenance
■ Auto Tombstone GC
■ Load & stream
■ Auto restore
■ RBNO
■ Consistent Schema
changes
Maintenance, Maintenance, Maintenance
1-333
334-666
667-1000
1-200
201-400
401-600
601-800
801-1000
201-400
1-200
■ Auto Tombstone GC
■ Load & stream
■ Auto restore
■ RBNO
■ Consistent Schema
changes
Maintenance, Maintenance, Maintenance
Repair Based Node Operations is in for /all /default
■ Resumable
■ Consistent
■ Simplified
■ Fast
■ Unified
Top-K Queries
User defined aggregates = UDF +
WASM +
Distributed aggregates +
Workload prioritization
How many people follow Elon Musk?
What are the top-k products in shopping carts
Double Click on Database Drivers
■ Maintain Python, Java, Go
■ Huge boost w/ Rust
■ C++ driver, wrapped over Rust
■ Improved load balancing (shard, cache, zone aware)
■ Serverless support
■ Performance
■ Performance, performance isolation, performance
■ Maintenance, Maintenance, Maintenance
■ Consistent metadata by Raft
■ ScyllaDB Cloud serverless - abstract the servers
2022 Highlights
■ Raft
■ Consistent Schema
changes
■ Consistent topology
changes
Consistent Metadata by Raft
Protocol for state machine replication
Total order broadcast of state change commands
Single leader
X = 0 X += 1 CAS(X, 0, 1)
X = 0
■ Raft
■ Consistent Schema
changes
■ Consistent topology
changes
Consistent Metadata by Raft
RAFT
Safe
schema
changes
Safe
topology
changes
Dynamic
partitioning
Consistent
tables
Tablets
5.0
5.2
5.3
■ Performance, performance isolation, performance
■ Maintenance, Maintenance, Maintenance
■ Consistent metadata by Raft
■ ScyllaDB Cloud serverless - abstract the servers
2022 Highlights
■ Quick recap for newbies
Agenda
■ Journey of 2022
■ The Future of Fast Data
■ Serverless
■ Core ScyllaDB
s/server/vcpu/
■ Abstract the servers, instances. Building block is VCPU
■ Decouple storage from compute. Still uses NVMe
■ Flexibility of annual commit
■ Encryption@rest + BYOK
■ Elasticity
■ API & Terraform
■ Advanced Network gateways
■ Self serve user management & SSO
■ Self serve billing & metering
Scylla Cloud Goes Serverless
Vision
Old: The power of Cassandra at the speed of Redis
New: The power of ScyllaDB, with the usability of DynamoDB
Gamechanger Revolution
Today
■ Eventual consistency
■ 3x replication, may not be identical
■ Storage is bound to compute
■ Full scan repair process is required
■ Cheap for high throughput,
expensive for high volume
Future
■ Raft based consistency
■ No need to repair
■ Single main replica, on S3
■ Can be cached for performance
■ Save arriving, fresh data in 3 replicas,
until sync w/ S3
■ Point in time recovery, extreme
elasticity
■ Cluster parking
■ ~Free disaster recovery backup
Future of Key Ranges
■ token = hash(partition key)
■ token range: set of partition keys
token
token
range
token
Each node has a set of tokens assigned during bootstrap
Token Metadata - vNodes
node A node B node C
A
C
B
C
A
B
Token Metadata - vNodes
A
C
B
C
A
B
{A, C}
{C, B}
{B, A}
{C, A}
{A, B}
{B, C}
token
metadata
replication
metadata
replication
strategy
Future of Key Ranges
Today
■ Key ranges =~ vNodes
■ Statically allocated @node
creation
■ Size of TBs
Coming - Tablets
■ Small range of keys
■ Size GBs-10GB
■ Dynamically controlled
■ Dynamically mapped/moves
between nodes
Tablets
tablet
tablet
replica
tablet
replica
tablet
replica
replication
metadata:
(per table)
Tablets - Balancing
Aim at manageable size:
■ Not too small -> few -> low
metadata overhead
■ Not too large -> many ->
enough to balance load
Tablets - Balancing
The load balancer can decide to
move tablets
Tablets
Resharding is cheap.
SStables split at tablet boundary.
Reassign tablets to shards (logical operation).
Tablets
Cleanup after topology change is cheap.
Just delete SStables.
Today
■ Disaggregated, local NVMe
■ Gravity - time to stream
■ Ultra fast
■ Cheaper than EBS/SAN
■ Expensive if not queried
Future of Storage
Future of Storage
tablet
replica
tablet
replica
Node
Node
Node
Node
Node
Node
Node
Nodes w/ NVMe are awesome!
Future of Storage
tablet
tablet
replica
tablet
replica
Node
Node
Node
Node
Node
Node
Node
Today
■ Disaggregated, local NVMe
■ Gravity - time to stream
■ Ultra fast
■ Cheaper than EBS/SAN
■ Expensive if not queried
Object Store / S3 based
■ $ almost free
■ Completely decoupled
■ Fully consistent
■ Accessed by all nodes
■ Tiered Storage
■ Slow-> Local NVMe cache
Future of Storage
Future of Storage
tablet
Node
Local
write
Send
commitlog
Upload
S3
Repair &
compact
Future of Storage
tablet
Node
■ Compaction can happen off-query engine
■ Sstables NVME-cached by replicas (Cache-Replication-Factor)
■ Tiered Storage helps to control size of NVMe cache
■ Available for all users (not just DBaaS)
Tablet +
S3 +
Raft Metadata
= G.O.A.T
G.O.A.T Future
tablet
Node
■ Data in S3 - Low TCO
■ NVMe Configurable Caches == Fast, tiered storage
■ Elasticity
■ Remove nodes - Sync pending to S3 & change table ownership
■ Add nodes
■ many at once
■ New hardware added async from tablet ownership
■ Node can serve data from the first tablet assigned!
■ Get the local NVMe cache populated
■ Point in time Backup/restore
■ Add/remove data-centers easily - access local S3
■ Cluster parking
Future of Consistency
Eventual
Consistency
Immediate
Eventual
Consistency
Bring RAFT to user tables!
CREATE TABLE foo WITH raft = TRUE;
Strongly-Consistent Tables
Do not have any databases
before ScyllaDB.
Always run nodetool cleanup
after bootstrapping a new
node.
Run repair within gc-grace
seconds.
Do not bootstrap nodes
concurrently, or make
any other topology change
Do not use SimpleStrategy in
a multi DC setup
Do not have any databases
before ScyllaDB.
Always run nodetool cleanup
after bootstrapping a new
node.
Run repair within gc-grace
seconds.
Do not bootstrap nodes
concurrently, or make
any other topology change
Do not use SimpleStrategy in
a multi DC setup
Do not have any databases
before ScyllaDB.
Always run nodetool cleanup
after bootstrapping a new
node.
Run repair within gc-grace
seconds.
Do not bootstrap nodes
concurrently, or make
any other topology change
Do not use SimpleStrategy in
a multi DC setup
Do not have any databases
before ScyllaDB.
Always run nodetool cleanup
after bootstrapping a new
node.
Run repair within gc-grace
seconds.
Do not bootstrap nodes
concurrently, or make
any other topology change
Do not use SimpleStrategy in
a multi DC setup
Strongly-Consistent Tables
LWT (paxos based)
■ Slow
■ 3 rounds to replicas per user
request
■ Concurrent conflicting requests
-> retries -> negative scaling
RAFT
■ Fast
■ 1 round to replicas (on leader)
■ No retries
2022 ScyllaDB is great
Welcome to Serverless
Welcome to the Future
Thank You
@DorLaor

Contenu connexe

Similaire à To Serverless and Beyond

(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWSAmazon Web Services
 
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScyllaDB
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftAmazon Web Services
 
Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집
Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집
Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집AWSKRUG - AWS한국사용자모임
 
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowOpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowEd Balduf
 
Data Scotland 2019: You can run SQL Server on AWS
Data Scotland 2019: You can run SQL Server on AWSData Scotland 2019: You can run SQL Server on AWS
Data Scotland 2019: You can run SQL Server on AWSJohn McCormack
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAmazon Web Services
 
Fully-managed Cloud-native Databases: The path to indefinite scale @ CNN Mainz
Fully-managed Cloud-native Databases: The path to indefinite scale @ CNN MainzFully-managed Cloud-native Databases: The path to indefinite scale @ CNN Mainz
Fully-managed Cloud-native Databases: The path to indefinite scale @ CNN MainzQAware GmbH
 
Uses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon RedshiftUses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon RedshiftAmazon Web Services
 
Crimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryCrimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryScyllaDB
 
Leveraging Amazon Redshift for your Data Warehouse
Leveraging Amazon Redshift for your Data WarehouseLeveraging Amazon Redshift for your Data Warehouse
Leveraging Amazon Redshift for your Data WarehouseAmazon Web Services
 
Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...Amazon Web Services
 
Migrating from InnoDB and HBase to MyRocks at Facebook
Migrating from InnoDB and HBase to MyRocks at FacebookMigrating from InnoDB and HBase to MyRocks at Facebook
Migrating from InnoDB and HBase to MyRocks at FacebookMariaDB plc
 
Apache Performance Tuning: Scaling Out
Apache Performance Tuning: Scaling OutApache Performance Tuning: Scaling Out
Apache Performance Tuning: Scaling OutSander Temme
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Selecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
Selecting the Right AWS Database Solution - AWS 2017 Online Tech TalksSelecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
Selecting the Right AWS Database Solution - AWS 2017 Online Tech TalksAmazon Web Services
 
Getting started with amazon redshift - Toronto
Getting started with amazon redshift - TorontoGetting started with amazon redshift - Toronto
Getting started with amazon redshift - TorontoAmazon Web Services
 
How Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdfHow Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdfScyllaDB
 
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web ServicesAWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web ServicesAmazon Web Services
 
Free & Open DynamoDB API for Everyone
Free & Open DynamoDB API for EveryoneFree & Open DynamoDB API for Everyone
Free & Open DynamoDB API for EveryoneScyllaDB
 

Similaire à To Serverless and Beyond (20)

(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS(DAT202) Managed Database Options on AWS
(DAT202) Managed Database Options on AWS
 
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집
Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집
Aurora Serverless, 서버리스 RDB의 서막 - 트랙2, Community Day 2018 re:Invent 특집
 
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowOpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
 
Data Scotland 2019: You can run SQL Server on AWS
Data Scotland 2019: You can run SQL Server on AWSData Scotland 2019: You can run SQL Server on AWS
Data Scotland 2019: You can run SQL Server on AWS
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon Redshift
 
Fully-managed Cloud-native Databases: The path to indefinite scale @ CNN Mainz
Fully-managed Cloud-native Databases: The path to indefinite scale @ CNN MainzFully-managed Cloud-native Databases: The path to indefinite scale @ CNN Mainz
Fully-managed Cloud-native Databases: The path to indefinite scale @ CNN Mainz
 
Uses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon RedshiftUses and Best Practices for Amazon Redshift
Uses and Best Practices for Amazon Redshift
 
Crimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryCrimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent Memory
 
Leveraging Amazon Redshift for your Data Warehouse
Leveraging Amazon Redshift for your Data WarehouseLeveraging Amazon Redshift for your Data Warehouse
Leveraging Amazon Redshift for your Data Warehouse
 
Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...
 
Migrating from InnoDB and HBase to MyRocks at Facebook
Migrating from InnoDB and HBase to MyRocks at FacebookMigrating from InnoDB and HBase to MyRocks at Facebook
Migrating from InnoDB and HBase to MyRocks at Facebook
 
Apache Performance Tuning: Scaling Out
Apache Performance Tuning: Scaling OutApache Performance Tuning: Scaling Out
Apache Performance Tuning: Scaling Out
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Selecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
Selecting the Right AWS Database Solution - AWS 2017 Online Tech TalksSelecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
Selecting the Right AWS Database Solution - AWS 2017 Online Tech Talks
 
Getting started with amazon redshift - Toronto
Getting started with amazon redshift - TorontoGetting started with amazon redshift - Toronto
Getting started with amazon redshift - Toronto
 
How Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdfHow Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdf
 
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web ServicesAWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
AWS March 2016 Webinar Series - Managed Database Services on Amazon Web Services
 
Free & Open DynamoDB API for Everyone
Free & Open DynamoDB API for EveryoneFree & Open DynamoDB API for Everyone
Free & Open DynamoDB API for Everyone
 

Plus de ScyllaDB

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLWhat Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLScyllaDB
 
Low Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsLow Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasScyllaDB
 
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBBeyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasScyllaDB
 
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...ScyllaDB
 
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...ScyllaDB
 
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaScyllaDB
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityScyllaDB
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptxScyllaDB
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDBScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationScyllaDB
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsScyllaDB
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesScyllaDB
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsScyllaDB
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBScyllaDB
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101ScyllaDB
 

Plus de ScyllaDB (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLWhat Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQL
 
Low Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsLow Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & Pitfalls
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
 
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBBeyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
 
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
 
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
 
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual Workshop
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & Tradeoffs
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDB
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101
 

Dernier

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 

Dernier (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

To Serverless and Beyond

  • 1. To Serverless and Beyond Dor Laor, Co-founder & CEO
  • 2. Why ScyllaDB? We asked the smartest entity on earth
  • 7. Why ScyllaDB? Best High Availability in the industry Best Disaster Recovery in the industry Best scalability in the industry Best Price/Performance in the industry Auto-tune - out of the box performance Fully compatible with Cassandra & DynamoDB The power of Cassandra at the speed of Redis No Lock-in Open Source Software
  • 8. ■ Quick recap for newbies Agenda ■ Journey of 2022 ■ The Future of Fast Data
  • 9. ScyllaDB Journey Gen #1:Brute Power
  • 10. Database Scores Database Efficiency Score = ƒ(query/vcpu, workload/ram, cost/storage); Database Over Provision Score = ƒ(idle, tables/clusters, readonly/DCs); Database Scale Score = ƒ(10x scale cost/ability,P99@Scale,Time2scale,Ease of scale); Database Maintenance Score = ƒ(Backup,Repair, Replace,Scale, Upgrade,Observe,Trace); Database Freedom Score = ƒ(License, Environment, Ecosystem tools);
  • 11. R1 Can I propose a value? Check condition R2 R3 Accept new value Learn decision Decision made o k ! o k ! r o w c h e c k s u m o k ! a c k ok! ScyllaDB Journey Gen #2:Full API, LWT
  • 13. ScyllaDB Journey - ScyllaDB 5.0 ScyllaDB Cloud 2022 Roadmap Today (2022) - Perfect Single Tenant land ■ Extreme efficiency ■ Perfect security & isolation ■ Clear and simple units - servers ■ > 5x Cost effectiveness vs all others ■ Easy to consume
  • 14. ScyllaDB Journey - ScyllaDB 5.0 ScyllaDB Cloud 2022 Roadmap Today (2022) - Perfect Single Tenant land ■ Extreme efficiency ■ Perfect security & isolation ■ Clear and simple units - servers ■ > 5x Cost effectiveness vs all others ■ Easy to consume Coming - Multi tenancy ■ Separation of compute & storage ■ Serverless, pay per usage ■ Elastic w/ APIs control ■ More efficient than today’s implementation ■ Preserves ScyllaDB goodware™
  • 15. ■ Quick recap for newbies Agenda ■ Journey of 2022 ■ The Future of Fast Data
  • 16. 2022: All Weather, All Terrain, High Throughput, Low Latency Database
  • 17. ■ Performance, performance Isolation, performance ■ Maintenance, Maintenance, Maintenance ■ Consistent metadata by Raft ■ ScyllaDB Cloud serverless - abstract the servers 2022 Highlights
  • 18. ■ IO Scheduler v5 ■ Per partition rate limit ■ 30% CPU gain coming to enterprise/cloud ■ Distributed aggregates ■ Eliminate exceptions from the IO path ■ Reverse queries ■ I4i ■ Range tombstone v2 Performance, Performance Isolation
  • 19. ■ v1 max-io-requests ■ V2 bandwidth+iops capacity, statically partitioned ■ V3 cross-shards capacity sharing ■ V4 rate-limiter-based capacity ■ V5 Mixed BW I/O Scheduler V5
  • 20. ■ v1 max-io-requests ■ V2 bandwidth+iops capacity, statically partitioned ■ V3 cross-shards capacity sharing ■ V4 rate-limiter-based capacity ■ V5 Mixed BW I/O Scheduler V5
  • 21. ■ v1 max-io-requests ■ V2 bandwidth+iops capacity, statically partitioned ■ V3 cross-shards capacity sharing ■ V4 rate-limiter-based capacity ■ V5 Mixed BW I/O Scheduler V5
  • 22. ■ v1 max-io-requests ■ V2 bandwidth+iops capacity, statically partitioned ■ V3 cross-shards capacity sharing ■ V4 rate-limiter-based capacity ■ V5 Mixed BW I/O Scheduler V5 Also added per I/O class throttling ■ Compaction ■ Repair ■ Streaming
  • 23. ■ IO Scheduler v5 ■ Per partition rate limit ■ 30% CPU gain coming to enterprise/cloud ■ Distributed aggregates ■ Eliminate exceptions from the IO path ■ Reverse queries ■ I4i ■ Range tombstone v2 Performance, Performance Isolation SELECT COUNT(*) FROM keyspace.table Return c1 Return c2 Return c3 COUNT ) p0..p1 COUNT ) p2..p3 COUNT ) p4..p5 COUNT ) p6..p7
  • 24. ■ IO Scheduler v5 ■ Per partition rate limit ■ 30% CPU gain coming to enterprise/cloud ■ Distributed aggregates ■ Eliminate exceptions from the IO path ■ Reverse queries ■ I4i ■ Range tombstone v2 Performance, Performance Isolation 7 1 2 5 7 1 2 5
  • 25. ■ IO Scheduler v5 ■ Per partition rate limit ■ 30% CPU gain coming to enterprise/cloud ■ Distributed aggregates ■ Eliminate exceptions from the IO path ■ Reverse queries ■ I4i ■ Range tombstone v2 Performance, Performance Isolation
  • 26. ■ Performance, performance isolation, performance ■ Maintenance, Maintenance, Maintenance ■ Consistent metadata by Raft ■ ScyllaDB Cloud serverless - abstract the servers 2022 Highlights
  • 27. ■ Auto Tombstone GC ■ Load & stream ■ Auto restore ■ RBNO ■ Consistent Schema changes Maintenance, Maintenance, Maintenance
  • 28. ■ Auto Tombstone GC ■ Load & stream ■ Auto restore ■ RBNO ■ Consistent Schema changes Maintenance, Maintenance, Maintenance 1-333 334-666 667-1000 1-200 201-400 401-600 601-800 801-1000 201-400 1-200
  • 29. ■ Auto Tombstone GC ■ Load & stream ■ Auto restore ■ RBNO ■ Consistent Schema changes Maintenance, Maintenance, Maintenance Repair Based Node Operations is in for /all /default ■ Resumable ■ Consistent ■ Simplified ■ Fast ■ Unified
  • 30. Top-K Queries User defined aggregates = UDF + WASM + Distributed aggregates + Workload prioritization How many people follow Elon Musk? What are the top-k products in shopping carts
  • 31. Double Click on Database Drivers ■ Maintain Python, Java, Go ■ Huge boost w/ Rust ■ C++ driver, wrapped over Rust ■ Improved load balancing (shard, cache, zone aware) ■ Serverless support ■ Performance
  • 32. ■ Performance, performance isolation, performance ■ Maintenance, Maintenance, Maintenance ■ Consistent metadata by Raft ■ ScyllaDB Cloud serverless - abstract the servers 2022 Highlights
  • 33. ■ Raft ■ Consistent Schema changes ■ Consistent topology changes Consistent Metadata by Raft Protocol for state machine replication Total order broadcast of state change commands Single leader X = 0 X += 1 CAS(X, 0, 1) X = 0
  • 34. ■ Raft ■ Consistent Schema changes ■ Consistent topology changes Consistent Metadata by Raft RAFT Safe schema changes Safe topology changes Dynamic partitioning Consistent tables Tablets 5.0 5.2 5.3
  • 35. ■ Performance, performance isolation, performance ■ Maintenance, Maintenance, Maintenance ■ Consistent metadata by Raft ■ ScyllaDB Cloud serverless - abstract the servers 2022 Highlights
  • 36. ■ Quick recap for newbies Agenda ■ Journey of 2022 ■ The Future of Fast Data ■ Serverless ■ Core ScyllaDB
  • 38. ■ Abstract the servers, instances. Building block is VCPU ■ Decouple storage from compute. Still uses NVMe ■ Flexibility of annual commit ■ Encryption@rest + BYOK ■ Elasticity ■ API & Terraform ■ Advanced Network gateways ■ Self serve user management & SSO ■ Self serve billing & metering Scylla Cloud Goes Serverless
  • 39. Vision Old: The power of Cassandra at the speed of Redis New: The power of ScyllaDB, with the usability of DynamoDB
  • 40. Gamechanger Revolution Today ■ Eventual consistency ■ 3x replication, may not be identical ■ Storage is bound to compute ■ Full scan repair process is required ■ Cheap for high throughput, expensive for high volume Future ■ Raft based consistency ■ No need to repair ■ Single main replica, on S3 ■ Can be cached for performance ■ Save arriving, fresh data in 3 replicas, until sync w/ S3 ■ Point in time recovery, extreme elasticity ■ Cluster parking ■ ~Free disaster recovery backup
  • 41. Future of Key Ranges ■ token = hash(partition key) ■ token range: set of partition keys token token range token
  • 42. Each node has a set of tokens assigned during bootstrap Token Metadata - vNodes node A node B node C A C B C A B
  • 43. Token Metadata - vNodes A C B C A B {A, C} {C, B} {B, A} {C, A} {A, B} {B, C} token metadata replication metadata replication strategy
  • 44. Future of Key Ranges Today ■ Key ranges =~ vNodes ■ Statically allocated @node creation ■ Size of TBs Coming - Tablets ■ Small range of keys ■ Size GBs-10GB ■ Dynamically controlled ■ Dynamically mapped/moves between nodes
  • 46. Tablets - Balancing Aim at manageable size: ■ Not too small -> few -> low metadata overhead ■ Not too large -> many -> enough to balance load
  • 47. Tablets - Balancing The load balancer can decide to move tablets
  • 48. Tablets Resharding is cheap. SStables split at tablet boundary. Reassign tablets to shards (logical operation).
  • 49. Tablets Cleanup after topology change is cheap. Just delete SStables.
  • 50. Today ■ Disaggregated, local NVMe ■ Gravity - time to stream ■ Ultra fast ■ Cheaper than EBS/SAN ■ Expensive if not queried Future of Storage
  • 53. Today ■ Disaggregated, local NVMe ■ Gravity - time to stream ■ Ultra fast ■ Cheaper than EBS/SAN ■ Expensive if not queried Object Store / S3 based ■ $ almost free ■ Completely decoupled ■ Fully consistent ■ Accessed by all nodes ■ Tiered Storage ■ Slow-> Local NVMe cache Future of Storage
  • 55. Future of Storage tablet Node ■ Compaction can happen off-query engine ■ Sstables NVME-cached by replicas (Cache-Replication-Factor) ■ Tiered Storage helps to control size of NVMe cache ■ Available for all users (not just DBaaS)
  • 56. Tablet + S3 + Raft Metadata = G.O.A.T
  • 57. G.O.A.T Future tablet Node ■ Data in S3 - Low TCO ■ NVMe Configurable Caches == Fast, tiered storage ■ Elasticity ■ Remove nodes - Sync pending to S3 & change table ownership ■ Add nodes ■ many at once ■ New hardware added async from tablet ownership ■ Node can serve data from the first tablet assigned! ■ Get the local NVMe cache populated ■ Point in time Backup/restore ■ Add/remove data-centers easily - access local S3 ■ Cluster parking
  • 59. Bring RAFT to user tables! CREATE TABLE foo WITH raft = TRUE; Strongly-Consistent Tables Do not have any databases before ScyllaDB. Always run nodetool cleanup after bootstrapping a new node. Run repair within gc-grace seconds. Do not bootstrap nodes concurrently, or make any other topology change Do not use SimpleStrategy in a multi DC setup Do not have any databases before ScyllaDB. Always run nodetool cleanup after bootstrapping a new node. Run repair within gc-grace seconds. Do not bootstrap nodes concurrently, or make any other topology change Do not use SimpleStrategy in a multi DC setup Do not have any databases before ScyllaDB. Always run nodetool cleanup after bootstrapping a new node. Run repair within gc-grace seconds. Do not bootstrap nodes concurrently, or make any other topology change Do not use SimpleStrategy in a multi DC setup Do not have any databases before ScyllaDB. Always run nodetool cleanup after bootstrapping a new node. Run repair within gc-grace seconds. Do not bootstrap nodes concurrently, or make any other topology change Do not use SimpleStrategy in a multi DC setup
  • 60. Strongly-Consistent Tables LWT (paxos based) ■ Slow ■ 3 rounds to replicas per user request ■ Concurrent conflicting requests -> retries -> negative scaling RAFT ■ Fast ■ 1 round to replicas (on leader) ■ No retries
  • 61. 2022 ScyllaDB is great Welcome to Serverless Welcome to the Future Thank You @DorLaor