Designing IA for AI - Information Architecture Conference 2024
Storage on EC2 (& Cassandra), Cassandra Workshop, Berlin Buzzwords
1. Storage on EC2
(& Cassandra)
Tom Wilkie
Cassandra Workshop 8/06/11
Wednesday, 8 June 2011
2. ACHTUNG!
Data only collected over
past 5 days
Didn’t repeat experiments
(that much)
EC2 is a moving target
Wednesday, 8 June 2011
3. Consider: Not considering:
• Ephemeral vs EBS • Cluster Performance
• ... vs Instance Type • Internode latency,
throughput
• ... vs RAID level
• Tuning...
• ... vs # threads
ES ...
• (...vs storage engine) D F A I L UR
A TE EL
C OR R
Wednesday, 8 June 2011
6. [ih-fem-er-uhl] Show IPA –adjective
1. lasting a very short time; short-lived; transitory:
the ephemeral joys of childhood.
2. lasting but one day: an ephemeral flower. –noun
3. anything short-lived, as certain insects.
Wednesday, 8 June 2011
7. Ephemeral Storage
Seek Performance
8000
7000
6000
7000 IOPs from a disk??
5000
m1.large, ephemeral
Seek / s
4000 m1.xlarge, ephemeral
c1.xlarge, ephemeral
3000
2000
1000
0
1 2 3 4
# Devices http://www.slideshare.net/davegardnerisme/
running-cassandra-on-amazon-ec2
Wednesday, 8 June 2011
11. • Max 4 devices per instance
• Data goes away when instance is
terminated (or crashes!)
• Suspect there is some sort indirection layer
underneath - thin provisioning / dedupe /
CoW or something
• Linux software RAID sucks
Wednesday, 8 June 2011
12. R ES ...
F AI LU
E LA T ED
CO RR
What happens if a bug in your software
causes all your nodes to crash?
ie say a memory leak causes an
OOM... on all nodes
Wednesday, 8 June 2011
19. • Limited to ~100 IOPS per device?
• Or just 10ms latency?
• Seems to scale pretty linearly for random IO
• Sequential IO limited by network bandwidth,
independent of # devices
• shared with other network traffic?
• Linux software RAID sucks
Wednesday, 8 June 2011
20. R ES ...
F AI LU
E LA T ED
CO RR
What happens when EBS breaks?
http://storagemojo.com/2011/04/29/amazons-ebs-outage/
http://status.heroku.com/incident/151
Wednesday, 8 June 2011
22. “Use Elastic Block Storage”
http://stackoverflow.com/questions/4714879/deploy-cassandra-on-ec2
“Raid 0 EBS drives are the way to go”
http://coreyhulen.org/2010/10/03/%EF%BB%BFcassandra-performance-tests-on-ec2/
“we recommend using raid0 ephemeral disks”
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cold-boot-
performance-problems-td5615829.html#a5615889
Wednesday, 8 June 2011
25. Insert Rates by Instance Type
35000
30000
25000
20000
Inserts / s
15000
10000
5000
0
e ral e ral e ral ebs ebs ebs
hem hem hem ar ge, ar ge, ar ge,
ep ep , ep 1.l 1.x
l
1.x
l
ge, ge, rge m m c
1 .lar .x lar . xl a
m m1 c1
100 threads, batch mutate size 100, values length 10, 1 column per row, 300 million values
Wednesday, 8 June 2011