SlideShare une entreprise Scribd logo
1  sur  33
Télécharger pour lire hors ligne
Seagate Kinetic Open Storage Platform
James Hughes
…and many others
Storage is a Price Elastic Market
http://en.wikipedia.org/wiki/Alfred_Marshall

Price elasticity of demand
• Alfred Marshall (1890)

As the price of Storage approaches $0
• Demands for storage will approach infinity

If the price of a Cisco router
approaches $0
• Demands for routers will not approach
infinity - Storage is different

"3
Areal Density Growth

• 

100000

Perpendicular expected to extend
to 0.5-1 Tb/in2

10000

Additional innovations required
at that point

•  heat-assisted
recording
•  bit patterned
media recording

Single particle
superparamagnetic limit
(estimated)

40%

HAMR
+BPM

1000

HAMR
Perpendicular Writing & GMR

100

Seagate Confidential: Subject to NDA
100%
10 Charap’s limiteffective Jan. 18, 2009,
No. 77103,
(broken)
and all applicable supplements GMR reading
•  Inductive Writing/
1
0.1

29%

•  Inductive Writing/ MR reading
•  Inductive Writing & Reading

19
89
19
91
19
93
19
95
19
97
19
99
20
01
20
03
20
05
20
07
20
09
20
11
20
13
20
15
20
17
20
19

• 

Late 1990s – super paramagnetic
limit demonstrated through
modeling

gigabit / in2

• 

•  Areal Density CAGR 40%
•  Transfer Rate CAGR 20%

year

"4
Cloud Computing will increase this trend
•Jevons Paradox
• Cloud Computing increases the efficiency of computing....

Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

http://en.wikipedia.org/wiki/Jevons_paradox

"5
Cloud Computing will increase this trend
•Jevons Paradox
• Cloud Computing increases the efficiency of computing....

Improved technology doubles
Seagate Confidential: Subject to NDA
the amount of Information
No. 77103, effective Jan. produced
18,
with a given amount of Storage 2009,
!
and all applicable supplements
Demand for Storage rises

http://en.wikipedia.org/wiki/Jevons_paradox

"5
Technology TrendsNDA
Seagate Confidential: Subject to
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"6
Shingled Disks
•Write head larger than read
head
• Turns Disk into a sequentially
written media
Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

•All updates to data and
metadata are written
sequentially to a continuous
stream, called a log
•Disk API of sectors is no
longer “natural”

http://www.ssrc.ucsc.edu/Papers/amer-ieeetm11.pdf

"7
Log Structured Storage
How much is erased on a reposition?
• Tape - the remainder of the tape
• Shingled disk - the remainder of the track group
• Flash - the entire page

All persistent Storage systems do/will implement log structure
• e.g. “NoSQL Database of sectors”

Does it make sense to layer a database on top of a
database?
• Could we use the log structure of the media to provide a more
natural storage systems, not mimicking an antique paradigm?

"8
Single System Performance Trend
Leading to disaggregation of servers

http://web.eecs.umich.edu/~twenisch/papers/isca09-disaggregate.pdf

"9
Scaling Storage
Distributed Hash Table
• Key/Value Store

RAM
Flash
Disk

Memcached
FAWN
Riak

http://en.wikipedia.org/wiki/Distributed_hash_table

"10
Metadata and Metadata Servers are Evil
Required by traditional file systems (POSIX) to translate
names to sectors
• Hard to scale, heavy HA requirements, expensive

Can we use a name as a key?
• Place the data into a scaled key value store?
• Eliminate costly metadata servers?

"11
Cumulative operations ordered by length
100%

operations
92% of the operations

80%

60%

40%

32KB

Cumulative percentage

data

20%

0.5% of the data
0%
1.00

10.00

100.00

1000.00

10000.00

100000.00

Length (KB)
"12
512KB
Length

Map of Operations

0

0
1

3

Tim
e

(m

2
inu
tes

)

3
4

0

1
oc
L

2
B)
(T
on
ati

"13
Seagate Kinetic
Seagate Confidential: Subject to NDA
Open Storage Platform
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"14
Seagate Kinetic Open Storage Platform
Dis-intermediates applications to drive
–Goes around file systems, volume managers, drivers

Enable ecosystem of value added software
–Partners (like Basho) can create their own system value

Lower TCO
–Eliminates complexity

"15
"16
"16
D

A
S

"17
App

Proprietary
to System Vendor

• Application
• Clustering
• Management

App
D

LibKinetic

C++, Java, Python, Erlang, DIY
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

Proprietary
to Seagate

A

• Storage

S

"17
App

App

App

App

App

Proprietary
to System Vendor

• Application
• Clustering
• Management

App

D

LibKinetic

LibKinetic

LibKinetic
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

Proprietary
to Seagate

A

• Storage

S

"17
App

App

App

App

App

D

LibKinetic

LibKinetic

LibKinetic
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

D

A

D

A
S

D

A
S

D

A

A
S

S

D

A

S

D

A
S

D

A
S

A
S

S

"17

Proprietary
to Seagate

D

• Storage

App

Proprietary
to System Vendor

• Application
• Clustering
• Management
System Hardware
Typical JBOD architecture
• Does not require a server, just JBODs to the ToR Switch
• 10 JBODS × 60 drives × 4TB = 2.4PB/Rack

"18
Features
Provides RPC to Key/Value database
• Data is pre-indexed
• Compression and other value is easy and transparent

P2P (Drive to Drive) copy of key ranges
Communicate using existing Data Center Plumbing (TCP/IP)
Multiple masters - Data sharing between machines
Configurable caching per command
• Async, Sync, Flush

Local space management

"19
Kinetic Systems
Clustering (performance, reliability, management)
Compatibility with large scale applications (S3, etc.)
Centralized Management
• Reliability, availability, durability

"20
Lower TCO
Elimination of server layers
Less Human requirements
Reduced mistakes
Disaggregate storage from
servers
Power management

"21
Lower TCO
Elimination of server layers
Less Human requirements
Reduced mistakes
Disaggregate storage from
servers
Power management

"21
Goals of API
Data movement
• Get/put/delete/getnext/getprevious
• Versioned (== for success), options

Range operations
Multiple masters
• Authentication/Integrity/Authorization

Cluster-able
• Simple cluster configuration version enforcement

3rd party copy
Management
"22
Management (System Vendor)
Configures the drive
• Network
• authorized clients

Monitors
• Health
• Statistics
• Logs

Initiates recovery
• Change cluster version
• 3rdPartyCopy
"23
Data formats
Key Structure
• Variable number of octets (0-4KB)

Data Structure (Serialized to a byte stream)
• KeyOf
• Version
• E2E Data Integrity
–Algorithm name

• Data Variable length (0-xMB)

"24
Performance Metrics
Same normal performance expectations
•
•
•
•

Sequential Write
Random Write
Sequential Read
Sequential Write

Iometer for key/value

"25
Demo Time!

Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"26
Performance Results

MB/s

Puts/s

120

1000

90

750

60

500

30

250

0

0
0

2

4

6

8

1MB values put rate (MB/s)

0

2

4

6

1KB values put rate

"27

8
Conclusion
Deliver more value to Seagate, Partners and Customers
• Dis-intermediates cloud applications to drive
• Enable innovation in hardware and software ecosystem
• Lower TCO

OpenSource Software
–Basho Riak, Swift, HDFS

More information
• http://seagate.com/www/kinetic
• https://developers.seagate.com
• http://guthub.com/Seagate
"28

Contenu connexe

Tendances

GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Community
 

Tendances (20)

Disk health prediction for Ceph
Disk health prediction for CephDisk health prediction for Ceph
Disk health prediction for Ceph
 
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanPerformance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
 
Ndb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskNdb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_disk
 
Red Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed Hat Gluster Storage Performance
Red Hat Gluster Storage Performance
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep Dive
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3
 
Modern network servers
Modern network serversModern network servers
Modern network servers
 
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
 
MySQL Head-to-Head
MySQL Head-to-HeadMySQL Head-to-Head
MySQL Head-to-Head
 
Ceph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightCeph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer Spotlight
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference Architectures
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 

Similaire à Kinetic basho public

Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Community
 

Similaire à Kinetic basho public (20)

Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storage
 
In-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesIn-memory Data Management Trends & Techniques
In-memory Data Management Trends & Techniques
 
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
 
Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Deploying ssd in the data center 2014
Deploying ssd in the data center 2014
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
 
Dissecting Scalable Database Architectures
Dissecting Scalable Database ArchitecturesDissecting Scalable Database Architectures
Dissecting Scalable Database Architectures
 
CPU Caches
CPU CachesCPU Caches
CPU Caches
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016
 
Memory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationMemory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and Virtualization
 
Solid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsSolid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln Labs
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
 
Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10
 
Wolfgang Lehner Technische Universitat Dresden
Wolfgang Lehner Technische Universitat DresdenWolfgang Lehner Technische Universitat Dresden
Wolfgang Lehner Technische Universitat Dresden
 
Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation        Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation
 
S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710d
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Kinetic basho public

  • 1. Seagate Kinetic Open Storage Platform James Hughes …and many others
  • 2. Storage is a Price Elastic Market http://en.wikipedia.org/wiki/Alfred_Marshall Price elasticity of demand • Alfred Marshall (1890) As the price of Storage approaches $0 • Demands for storage will approach infinity If the price of a Cisco router approaches $0 • Demands for routers will not approach infinity - Storage is different "3
  • 3. Areal Density Growth •  100000 Perpendicular expected to extend to 0.5-1 Tb/in2 10000 Additional innovations required at that point •  heat-assisted recording •  bit patterned media recording Single particle superparamagnetic limit (estimated) 40% HAMR +BPM 1000 HAMR Perpendicular Writing & GMR 100 Seagate Confidential: Subject to NDA 100% 10 Charap’s limiteffective Jan. 18, 2009, No. 77103, (broken) and all applicable supplements GMR reading •  Inductive Writing/ 1 0.1 29% •  Inductive Writing/ MR reading •  Inductive Writing & Reading 19 89 19 91 19 93 19 95 19 97 19 99 20 01 20 03 20 05 20 07 20 09 20 11 20 13 20 15 20 17 20 19 •  Late 1990s – super paramagnetic limit demonstrated through modeling gigabit / in2 •  •  Areal Density CAGR 40% •  Transfer Rate CAGR 20% year "4
  • 4. Cloud Computing will increase this trend •Jevons Paradox • Cloud Computing increases the efficiency of computing.... Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements http://en.wikipedia.org/wiki/Jevons_paradox "5
  • 5. Cloud Computing will increase this trend •Jevons Paradox • Cloud Computing increases the efficiency of computing.... Improved technology doubles Seagate Confidential: Subject to NDA the amount of Information No. 77103, effective Jan. produced 18, with a given amount of Storage 2009, ! and all applicable supplements Demand for Storage rises http://en.wikipedia.org/wiki/Jevons_paradox "5
  • 6. Technology TrendsNDA Seagate Confidential: Subject to No. 77103, effective Jan. 18, 2009, and all applicable supplements "6
  • 7. Shingled Disks •Write head larger than read head • Turns Disk into a sequentially written media Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements •All updates to data and metadata are written sequentially to a continuous stream, called a log •Disk API of sectors is no longer “natural” http://www.ssrc.ucsc.edu/Papers/amer-ieeetm11.pdf "7
  • 8. Log Structured Storage How much is erased on a reposition? • Tape - the remainder of the tape • Shingled disk - the remainder of the track group • Flash - the entire page All persistent Storage systems do/will implement log structure • e.g. “NoSQL Database of sectors” Does it make sense to layer a database on top of a database? • Could we use the log structure of the media to provide a more natural storage systems, not mimicking an antique paradigm? "8
  • 9. Single System Performance Trend Leading to disaggregation of servers http://web.eecs.umich.edu/~twenisch/papers/isca09-disaggregate.pdf "9
  • 10. Scaling Storage Distributed Hash Table • Key/Value Store RAM Flash Disk Memcached FAWN Riak http://en.wikipedia.org/wiki/Distributed_hash_table "10
  • 11. Metadata and Metadata Servers are Evil Required by traditional file systems (POSIX) to translate names to sectors • Hard to scale, heavy HA requirements, expensive Can we use a name as a key? • Place the data into a scaled key value store? • Eliminate costly metadata servers? "11
  • 12. Cumulative operations ordered by length 100% operations 92% of the operations 80% 60% 40% 32KB Cumulative percentage data 20% 0.5% of the data 0% 1.00 10.00 100.00 1000.00 10000.00 100000.00 Length (KB) "12
  • 14. Seagate Kinetic Seagate Confidential: Subject to NDA Open Storage Platform No. 77103, effective Jan. 18, 2009, and all applicable supplements "14
  • 15. Seagate Kinetic Open Storage Platform Dis-intermediates applications to drive –Goes around file systems, volume managers, drivers Enable ecosystem of value added software –Partners (like Basho) can create their own system value Lower TCO –Eliminates complexity "15
  • 16. "16
  • 17. "16
  • 19. App Proprietary to System Vendor • Application • Clustering • Management App D LibKinetic C++, Java, Python, Erlang, DIY GPL Standard ProtoBuf TCP/IP/GbE • Interconnect Proprietary to Seagate A • Storage S "17
  • 20. App App App App App Proprietary to System Vendor • Application • Clustering • Management App D LibKinetic LibKinetic LibKinetic GPL Standard ProtoBuf TCP/IP/GbE • Interconnect Proprietary to Seagate A • Storage S "17
  • 22. System Hardware Typical JBOD architecture • Does not require a server, just JBODs to the ToR Switch • 10 JBODS × 60 drives × 4TB = 2.4PB/Rack "18
  • 23. Features Provides RPC to Key/Value database • Data is pre-indexed • Compression and other value is easy and transparent P2P (Drive to Drive) copy of key ranges Communicate using existing Data Center Plumbing (TCP/IP) Multiple masters - Data sharing between machines Configurable caching per command • Async, Sync, Flush Local space management "19
  • 24. Kinetic Systems Clustering (performance, reliability, management) Compatibility with large scale applications (S3, etc.) Centralized Management • Reliability, availability, durability "20
  • 25. Lower TCO Elimination of server layers Less Human requirements Reduced mistakes Disaggregate storage from servers Power management "21
  • 26. Lower TCO Elimination of server layers Less Human requirements Reduced mistakes Disaggregate storage from servers Power management "21
  • 27. Goals of API Data movement • Get/put/delete/getnext/getprevious • Versioned (== for success), options Range operations Multiple masters • Authentication/Integrity/Authorization Cluster-able • Simple cluster configuration version enforcement 3rd party copy Management "22
  • 28. Management (System Vendor) Configures the drive • Network • authorized clients Monitors • Health • Statistics • Logs Initiates recovery • Change cluster version • 3rdPartyCopy "23
  • 29. Data formats Key Structure • Variable number of octets (0-4KB) Data Structure (Serialized to a byte stream) • KeyOf • Version • E2E Data Integrity –Algorithm name • Data Variable length (0-xMB) "24
  • 30. Performance Metrics Same normal performance expectations • • • • Sequential Write Random Write Sequential Read Sequential Write Iometer for key/value "25
  • 31. Demo Time! Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements "26
  • 33. Conclusion Deliver more value to Seagate, Partners and Customers • Dis-intermediates cloud applications to drive • Enable innovation in hardware and software ecosystem • Lower TCO OpenSource Software –Basho Riak, Swift, HDFS More information • http://seagate.com/www/kinetic • https://developers.seagate.com • http://guthub.com/Seagate "28