SlideShare une entreprise Scribd logo
STORAGE GEEKS
101 - 2019
Andrew McGee
NOV 2019
@Andrew_McGee
STORAGE GEEKS
A Short History of
Electromagnetic Disk Drives
The Hard Disk Drive
Diagram: I, Surachit, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=4275514
Two main avenues of technology
improvement:
● Capacity/Areal Density
● Performance
HDD Advancement
Engineers VS The Laws of Nature
Engineering Corner
● Nano tech
● Miniaturization
● Materials Science
Engineering
Thermal instability at nanoscale
causes magnetic particles to flip.
Nature
Superparamagnetic
Effect
Perpendicular - PMR
Increased storage density over
longitudinal recording.
Increased platter thickness &
complexity.
● 2005 - Toshiba
● 2006 - Seagate
● 2006 - Hitachi
● 2006 - Western Digital
● 2006 - Fujitsu
HDD Capacity Advancement
Diagram: Public Domain image by Luca Cassioli 2005, https://commons.wikimedia.org/wiki/File:Perpendicular_Recording_Diagram.svg
Shingled - SMR
Increased storage density by 25%.
New tracks overlap old ones and must
overwrite them. Write penalty!
Host or Device managed.
libzbc - Zoned Block Device library
manages write performance.
● 2013 - Seagate
● 2014 - HGST
HDD Capacity Advancement
Helium Filled
30% less weight to data ratio
25% higher MTBF. 23% less power.
Using SMR or PMR hermetically
sealed and filled with helium.
● 2013 - HGST 7 platter 6TB
● 2016 - Seagate 7 platter 10TB
● 2017 - Toshiba 9 platter 14TB
● 2019 - Seagate 9 platter 16TB
● 2019 - HGST 9 platter 18TB
HDD Capacity Advancement
Heat Assisted - HAMR
Increased storage density by 10x.
Laser heats the magnetic material
before writing.
Heat, write and cool in 1 nano second.
Seagate
● 2019 - 16TB
● 2020 - 20TB
● 2024 - 48TB
HDD Capacity Advancement
"The technology is very, very difficult, and
there has been a lot of skepticism” - IDC 2013
Microwave Assisted - MAMR
Alternative to HAMR backed by HGST
and Toshiba.
Spin Torque Oscillators (STO) to
generate 20-40 GHz microwaves
HGST
● 2019 - 16TB
● 2019 - 18TB
● 2020 - 20TB
HDD Capacity Advancement
Bit Patterned Media - BPM
A future technology to increase areal
density demonstrated by Toshiba.
Media is pre-patterned into magnetic
islands using nano-lithography.
1 grain per bit instead of 20-30 today
to avoid superparamagnetism.
Possible areal densities up to 20-300
Tb/in2 as opposed to the 1 Tb/in2.
● 2010 - Toshiba PoC
HDD Capacity Advancement
Common Connection Protocols
SAS 12Gbps Serial Attached SCSI - Enterprise drives based on SCSI command set
NL-SAS 12Gbps SAS Controller with SATA based media, disk and performance
SATA 6Gbps SATA - Mostly consumer based on Parallel ATA command set
HDD Summary
Researchers fight for areal density against superparamagnetism
Techniques such as PMR, SMR, HAMR and MAMR increase areal density
Helium filled HDD’s allow for more platters in same form factor
STORAGE GEEKS
A Short History of Solid
State Drives
The Solid State Drive
Diagram: Toniperis, CC BY-SA 4.0, https://commons.wikimedia.org/wiki/File:NAND-ssd.svg
NAND Flash Memory storage
DWPD instead of MTBF
Two main avenues of technology
improvement:
● Capacity/Density
● Reliability
SSD Advancement
Flash NAND Types
SLC Single Level Cell 1 bit per cell - Enterprise class, expensive - 100,000 e/w cycles
MLC Multi Level Cell - enterprise & consumer, higher capacity, 10,000 e/w cycles
TLC Triple Level Cell - cheapest, highest capacity, consumer, 5,000 e/w cycles
3DXP 3D XPoint (Intel Optane) not based on NAND does not store a current
Flash Controllers
Embedded processor with firmware
Critical to drive performance
FTL provides logical to physical
mapping and LBA
● Bad block mapping
● Wear leveling
● Queuing/Caching
● Garbage collection
Flash Storage Brain
Diagram: Dmitry Nosachev, CC BY-SA 4.0, https://en.wikipedia.org/wiki/File:NAND_Flash_Pages_and_Blocks.svg
Common Connection Protocols
SAS 12Gbps Serial Attached SCSI - Enterprise drives based on SCSI command set
M.2 Form factor that can use SATA or PCIe (NVMe)
SATA 6Gbps SATA - Mostly consumer based on Parallel ATA command set
PCIe 31.5 Gbps PCIe 3.0 x4 - typically uses NVMe command set
SSD Summary
Flash resiliency based on SLC, MLC or TLC NAND
Flash controllers use sophisticated techniques for endurance and performance
Traditional or new connection types and form factors
STORAGE GEEKS
A Short Evolution of
Enterprise Storage
Block Storage Network - SAN
Modular or monolithic, Intelligent
controllers
Redundant multi pathing, dedicated
network
FC, iSCSI, FCoE, NVMe, IB
Block Characteristics:
● Very fast
● Data written in blocks
● Not human friendly
● Complex addressing
Storage Array
Ethernet Switch
FC Switch
Server
Clients
Storage Array
File Storage Network - NAS
Usually Modular, Intelligent controllers
Redundancy via dedicated or existing
ethernet network
NFS, CIFS
File Characteristics:
● Pretty fast
● Data written in files
● Human friendly paths
● Standard networking
Storage Array (Filer)
Ethernet Switch
Servers
Clients
NAS Storage
Object Storage - Cloud
Usually massively scale-out
Redundancy via multiple data copies
REST API (http), Swift, S3
Object Characteristics:
● Not usually fast
● Data written in objects
● Eventually consistent
● Human friendly metadata
● Cloud networking
Content Addressable Storage
Ethernet Switch
Servers
Clients
Object Storage
WAN
Data Protection Techniques
RAID Redundant Array of Independent Disks
RAID 0 - striping only, no protection
RAID 1 - exact mirroring
RAID 5 - 5D+1P, parity blks striped
RAID 6 - 4D+2P, parity blks striped
Performance only
Least capacity
1 disk fail
2 disk fail
Data Protection Techniques
RAID Parity and striping across block devices to create sets of redundancy
Multipath Redundant network paths from host to storage (dual HBA/NIC at host)
EC Erasure coding saves data in fragments with parity across different locations
Mirror Storage array level synchronous and asynchronous mirroring of data (DR/BC)
Cache Battery or super-capacitor backed up cache
Data Optimisation/Reduction
Compression Algo to reduce redundant blocks, whitespace etc (unstructured data)
Deduplication Avoids storing duplicate blocks (CPU intensive, rehydration, mapping)
● RAW Storage
● Usable storage
● Effective storage
● Decimal GB vs GiB
Storage Vendor Terms
Storage Array Summary
Block, File and Object storage characteristics
Data protection techniques employed by storage - RAID, EC, Mirroring
Data reduction techniques employed by storage - Compression and Dedupe
STORAGE GEEKS
A Quick Look at CEPH
Ceph Storage
Object Built on scale out object storage but also provides interfaces to block and file
REST Uses http to provide S3 or Swift compatibility
Metadata Metadata servers to control data mapping and redundancy
CRUSH Hashing algo to calculate storage and retrieval of data in distributed cluster
SDS Software defined storage abstracts the intelligence from the hardware
STORAGE GEEKS
Andrew McGee
NOV 2019
@Andrew_McGee
THANKS!
Reference
03 - The HDD
https://en.wikipedia.org/wiki/Hard_disk_drive
04 - Engineers VS Nature
https://en.wikipedia.org/wiki/Superparamagnetism
05 - PMR
https://en.wikipedia.org/wiki/Perpendicular_recording
06 - SMR
https://en.wikipedia.org/wiki/Shingled_magnetic_recording
https://www.anandtech.com/show/7290/seagate-to-ship-5tb-hdd-in-2014-using-shingled-magnetic-recording
07 - Helium
https://www.backblaze.com/blog/helium-filled-hard-drive-failure-rates/
https://www.anandtech.com/show/10106/western-digital-introduces-its-consumer-helium-drives
08 - HAMR
https://blog.seagate.com/intelligent/hamr-next-leap-forward-now/
https://en.wikipedia.org/wiki/Heat-assisted_magnetic_recording
https://fstoppers.com/originals/hamr-and-mamr-technologies-will-unlock-hard-drive-capacity-year-326328
https://www.backblaze.com/blog/hamr-hard-drives/
09 - MAMR
https://www.theregister.co.uk/2019/03/07/hamr_and_mamr_hdd_direction_debate/
https://blog.westerndigital.com/mamr-technology-breakthrough/
https://www.anandtech.com/show/13764/western-digital-2019-16tb-hdd-mamr-hamr
https://www.anandtech.com/print/11925/western-digital-stuns-storage-industry-with-mamr-breakthrough-for-nextgen-hdds
10 - BPM
https://www.toshiba.co.jp/rdc/rd/fields/11_e09_e.htm
11 - Connection Protocols
https://en.wikipedia.org/wiki/Serial_Attached_SCSI
https://en.wikipedia.org/wiki/Serial_ATA
Reference
14 - The SSD
https://en.wikipedia.org/wiki/Solid-state_drive
15 - Flash NAND
https://www.mydigitaldiscount.com/everything-you-need-to-know-about-slc-mlc-and-tlc-nand-flash.html
16 - Flash Controllers
https://en.wikipedia.org/wiki/Wear_leveling
https://en.wikipedia.org/wiki/Write_amplification
17 - Connection protocols
https://en.wikipedia.org/wiki/M.2

Contenu connexe

Tendances

USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)Ryousei Takano
 
JetStor X Storage Products 2017! New HOT products!
JetStor X Storage Products 2017! New HOT products!JetStor X Storage Products 2017! New HOT products!
JetStor X Storage Products 2017! New HOT products!Gene Leyzarovich
 
Using Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider DataUsing Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider DataRob Gardner
 
クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術Ryousei Takano
 
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Community
 
SanDisk: Persistent Memory and Cassandra
SanDisk: Persistent Memory and CassandraSanDisk: Persistent Memory and Cassandra
SanDisk: Persistent Memory and CassandraDataStax Academy
 
Adaptec’s maxCache™ 3.0 Read and Write SSD Caching Solution
Adaptec’s maxCache™ 3.0 Read and Write SSD Caching SolutionAdaptec’s maxCache™ 3.0 Read and Write SSD Caching Solution
Adaptec’s maxCache™ 3.0 Read and Write SSD Caching SolutionAdaptec by PMC
 
JetStor 8 series 16G FC 12G SAS units
JetStor 8 series 16G FC 12G SAS unitsJetStor 8 series 16G FC 12G SAS units
JetStor 8 series 16G FC 12G SAS unitsGene Leyzarovich
 
Towards Application Driven Storage
Towards Application Driven StorageTowards Application Driven Storage
Towards Application Driven StorageJavier González
 
Ceph Day Berlin: Measuring and predicting performance of Ceph clusters
Ceph Day Berlin: Measuring and predicting performance of Ceph clustersCeph Day Berlin: Measuring and predicting performance of Ceph clusters
Ceph Day Berlin: Measuring and predicting performance of Ceph clustersCeph Community
 
Scalable Storage for Massive Volume Data Systems
Scalable Storage for Massive Volume Data SystemsScalable Storage for Massive Volume Data Systems
Scalable Storage for Massive Volume Data SystemsLars Nielsen
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersRyousei Takano
 
Flexible and Scalable Domain-Specific Architectures
Flexible and Scalable Domain-Specific ArchitecturesFlexible and Scalable Domain-Specific Architectures
Flexible and Scalable Domain-Specific ArchitecturesNetronome
 

Tendances (19)

Introduction
IntroductionIntroduction
Introduction
 
HDF-EOS Development: Current Status and Tools
HDF-EOS Development: Current Status and ToolsHDF-EOS Development: Current Status and Tools
HDF-EOS Development: Current Status and Tools
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)
 
JetStor X Storage Products 2017! New HOT products!
JetStor X Storage Products 2017! New HOT products!JetStor X Storage Products 2017! New HOT products!
JetStor X Storage Products 2017! New HOT products!
 
Qnap NAS TS Serie x53u-catalogo
Qnap NAS TS Serie x53u-catalogoQnap NAS TS Serie x53u-catalogo
Qnap NAS TS Serie x53u-catalogo
 
JetStor NAS series 2016
JetStor NAS series 2016JetStor NAS series 2016
JetStor NAS series 2016
 
Using Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider DataUsing Ceph for Large Hadron Collider Data
Using Ceph for Large Hadron Collider Data
 
クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術
 
Qnap nas tvs serie x63-catalogo
Qnap nas tvs serie x63-catalogoQnap nas tvs serie x63-catalogo
Qnap nas tvs serie x63-catalogo
 
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic Cloud
 
SanDisk: Persistent Memory and Cassandra
SanDisk: Persistent Memory and CassandraSanDisk: Persistent Memory and Cassandra
SanDisk: Persistent Memory and Cassandra
 
Adaptec’s maxCache™ 3.0 Read and Write SSD Caching Solution
Adaptec’s maxCache™ 3.0 Read and Write SSD Caching SolutionAdaptec’s maxCache™ 3.0 Read and Write SSD Caching Solution
Adaptec’s maxCache™ 3.0 Read and Write SSD Caching Solution
 
JetStor 8 series 16G FC 12G SAS units
JetStor 8 series 16G FC 12G SAS unitsJetStor 8 series 16G FC 12G SAS units
JetStor 8 series 16G FC 12G SAS units
 
Towards Application Driven Storage
Towards Application Driven StorageTowards Application Driven Storage
Towards Application Driven Storage
 
Ceph Day Berlin: Measuring and predicting performance of Ceph clusters
Ceph Day Berlin: Measuring and predicting performance of Ceph clustersCeph Day Berlin: Measuring and predicting performance of Ceph clusters
Ceph Day Berlin: Measuring and predicting performance of Ceph clusters
 
Scalable Storage for Massive Volume Data Systems
Scalable Storage for Massive Volume Data SystemsScalable Storage for Massive Volume Data Systems
Scalable Storage for Massive Volume Data Systems
 
GIST AI-X Computing Cluster
GIST AI-X Computing ClusterGIST AI-X Computing Cluster
GIST AI-X Computing Cluster
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computers
 
Flexible and Scalable Domain-Specific Architectures
Flexible and Scalable Domain-Specific ArchitecturesFlexible and Scalable Domain-Specific Architectures
Flexible and Scalable Domain-Specific Architectures
 

Similaire à Storage Geeks 101 - 2019

SDC20 ScaleFlux.pptx
SDC20 ScaleFlux.pptxSDC20 ScaleFlux.pptx
SDC20 ScaleFlux.pptxssuserabc741
 
Heterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsHeterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsAnand Haridass
 
Seagate – Next Level Storage (Webinar mit Boston Server & Storage, 2018 09-28)
Seagate – Next Level Storage (Webinar mit Boston Server & Storage,  2018 09-28)Seagate – Next Level Storage (Webinar mit Boston Server & Storage,  2018 09-28)
Seagate – Next Level Storage (Webinar mit Boston Server & Storage, 2018 09-28)BOSTON Server & Storage Solutions GmbH
 
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...DataStax
 
2018 Infortrend All Flash Arrays Introduction (GS3025A)
2018 Infortrend All Flash Arrays Introduction (GS3025A)2018 Infortrend All Flash Arrays Introduction (GS3025A)
2018 Infortrend All Flash Arrays Introduction (GS3025A)infortrendgroup
 
Accelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheAccelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheDavid Grier
 
Mass storage structurefinal
Mass storage structurefinalMass storage structurefinal
Mass storage structurefinalmarangburu42
 
What is the future of disk drives?
What is the future of disk drives?What is the future of disk drives?
What is the future of disk drives?Iftikhar Alam
 
The future of tape april 16
The future of tape april 16The future of tape april 16
The future of tape april 16Josef Weingand
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureCeph Community
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitecturePatrick McGarry
 
Accelerating SSD Performance with HLNAND
Accelerating SSD Performance with HLNANDAccelerating SSD Performance with HLNAND
Accelerating SSD Performance with HLNANDrrschuetz
 
Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...
Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...
Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...nigelwakefield
 
NVMe over Fibre Channel Introduction
NVMe over Fibre Channel IntroductionNVMe over Fibre Channel Introduction
NVMe over Fibre Channel IntroductionCalvin Zito
 
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based HardwareRed hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based HardwareRed_Hat_Storage
 
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
The Power of One: Supermicro’s High-Performance Single-Processor Blade SystemsThe Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
The Power of One: Supermicro’s High-Performance Single-Processor Blade SystemsRebekah Rodriguez
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_clusterPrabhat gangwar
 

Similaire à Storage Geeks 101 - 2019 (20)

SDC20 ScaleFlux.pptx
SDC20 ScaleFlux.pptxSDC20 ScaleFlux.pptx
SDC20 ScaleFlux.pptx
 
Heterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsHeterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of Systems
 
Welcome to the Datasphere – the next level of storage
Welcome to the Datasphere – the next level of storageWelcome to the Datasphere – the next level of storage
Welcome to the Datasphere – the next level of storage
 
Seagate – Next Level Storage (Webinar mit Boston Server & Storage, 2018 09-28)
Seagate – Next Level Storage (Webinar mit Boston Server & Storage,  2018 09-28)Seagate – Next Level Storage (Webinar mit Boston Server & Storage,  2018 09-28)
Seagate – Next Level Storage (Webinar mit Boston Server & Storage, 2018 09-28)
 
San Presentation
San PresentationSan Presentation
San Presentation
 
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
 
2018 Infortrend All Flash Arrays Introduction (GS3025A)
2018 Infortrend All Flash Arrays Introduction (GS3025A)2018 Infortrend All Flash Arrays Introduction (GS3025A)
2018 Infortrend All Flash Arrays Introduction (GS3025A)
 
Accelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheAccelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cache
 
Mass storage structurefinal
Mass storage structurefinalMass storage structurefinal
Mass storage structurefinal
 
What is the future of disk drives?
What is the future of disk drives?What is the future of disk drives?
What is the future of disk drives?
 
The future of tape april 16
The future of tape april 16The future of tape april 16
The future of tape april 16
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
Accelerating SSD Performance with HLNAND
Accelerating SSD Performance with HLNANDAccelerating SSD Performance with HLNAND
Accelerating SSD Performance with HLNAND
 
Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...
Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...
Sigman North Ltd Bright Talk June13th Nigel Wakefield Storage Predictions Vs ...
 
QNAP TS-832PX-4G.pdf
QNAP TS-832PX-4G.pdfQNAP TS-832PX-4G.pdf
QNAP TS-832PX-4G.pdf
 
NVMe over Fibre Channel Introduction
NVMe over Fibre Channel IntroductionNVMe over Fibre Channel Introduction
NVMe over Fibre Channel Introduction
 
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based HardwareRed hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
 
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
The Power of One: Supermicro’s High-Performance Single-Processor Blade SystemsThe Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_cluster
 

Dernier

Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIES VE
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka DoktorováCzechDreamin
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1DianaGray10
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCzechDreamin
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomCzechDreamin
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀DianaGray10
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 

Dernier (20)

Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 

Storage Geeks 101 - 2019

  • 1. STORAGE GEEKS 101 - 2019 Andrew McGee NOV 2019 @Andrew_McGee
  • 2. STORAGE GEEKS A Short History of Electromagnetic Disk Drives
  • 3. The Hard Disk Drive Diagram: I, Surachit, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=4275514 Two main avenues of technology improvement: ● Capacity/Areal Density ● Performance HDD Advancement
  • 4. Engineers VS The Laws of Nature Engineering Corner ● Nano tech ● Miniaturization ● Materials Science Engineering Thermal instability at nanoscale causes magnetic particles to flip. Nature Superparamagnetic Effect
  • 5. Perpendicular - PMR Increased storage density over longitudinal recording. Increased platter thickness & complexity. ● 2005 - Toshiba ● 2006 - Seagate ● 2006 - Hitachi ● 2006 - Western Digital ● 2006 - Fujitsu HDD Capacity Advancement Diagram: Public Domain image by Luca Cassioli 2005, https://commons.wikimedia.org/wiki/File:Perpendicular_Recording_Diagram.svg
  • 6. Shingled - SMR Increased storage density by 25%. New tracks overlap old ones and must overwrite them. Write penalty! Host or Device managed. libzbc - Zoned Block Device library manages write performance. ● 2013 - Seagate ● 2014 - HGST HDD Capacity Advancement
  • 7. Helium Filled 30% less weight to data ratio 25% higher MTBF. 23% less power. Using SMR or PMR hermetically sealed and filled with helium. ● 2013 - HGST 7 platter 6TB ● 2016 - Seagate 7 platter 10TB ● 2017 - Toshiba 9 platter 14TB ● 2019 - Seagate 9 platter 16TB ● 2019 - HGST 9 platter 18TB HDD Capacity Advancement
  • 8. Heat Assisted - HAMR Increased storage density by 10x. Laser heats the magnetic material before writing. Heat, write and cool in 1 nano second. Seagate ● 2019 - 16TB ● 2020 - 20TB ● 2024 - 48TB HDD Capacity Advancement "The technology is very, very difficult, and there has been a lot of skepticism” - IDC 2013
  • 9. Microwave Assisted - MAMR Alternative to HAMR backed by HGST and Toshiba. Spin Torque Oscillators (STO) to generate 20-40 GHz microwaves HGST ● 2019 - 16TB ● 2019 - 18TB ● 2020 - 20TB HDD Capacity Advancement
  • 10. Bit Patterned Media - BPM A future technology to increase areal density demonstrated by Toshiba. Media is pre-patterned into magnetic islands using nano-lithography. 1 grain per bit instead of 20-30 today to avoid superparamagnetism. Possible areal densities up to 20-300 Tb/in2 as opposed to the 1 Tb/in2. ● 2010 - Toshiba PoC HDD Capacity Advancement
  • 11. Common Connection Protocols SAS 12Gbps Serial Attached SCSI - Enterprise drives based on SCSI command set NL-SAS 12Gbps SAS Controller with SATA based media, disk and performance SATA 6Gbps SATA - Mostly consumer based on Parallel ATA command set
  • 12. HDD Summary Researchers fight for areal density against superparamagnetism Techniques such as PMR, SMR, HAMR and MAMR increase areal density Helium filled HDD’s allow for more platters in same form factor
  • 13. STORAGE GEEKS A Short History of Solid State Drives
  • 14. The Solid State Drive Diagram: Toniperis, CC BY-SA 4.0, https://commons.wikimedia.org/wiki/File:NAND-ssd.svg NAND Flash Memory storage DWPD instead of MTBF Two main avenues of technology improvement: ● Capacity/Density ● Reliability SSD Advancement
  • 15. Flash NAND Types SLC Single Level Cell 1 bit per cell - Enterprise class, expensive - 100,000 e/w cycles MLC Multi Level Cell - enterprise & consumer, higher capacity, 10,000 e/w cycles TLC Triple Level Cell - cheapest, highest capacity, consumer, 5,000 e/w cycles 3DXP 3D XPoint (Intel Optane) not based on NAND does not store a current
  • 16. Flash Controllers Embedded processor with firmware Critical to drive performance FTL provides logical to physical mapping and LBA ● Bad block mapping ● Wear leveling ● Queuing/Caching ● Garbage collection Flash Storage Brain Diagram: Dmitry Nosachev, CC BY-SA 4.0, https://en.wikipedia.org/wiki/File:NAND_Flash_Pages_and_Blocks.svg
  • 17. Common Connection Protocols SAS 12Gbps Serial Attached SCSI - Enterprise drives based on SCSI command set M.2 Form factor that can use SATA or PCIe (NVMe) SATA 6Gbps SATA - Mostly consumer based on Parallel ATA command set PCIe 31.5 Gbps PCIe 3.0 x4 - typically uses NVMe command set
  • 18. SSD Summary Flash resiliency based on SLC, MLC or TLC NAND Flash controllers use sophisticated techniques for endurance and performance Traditional or new connection types and form factors
  • 19. STORAGE GEEKS A Short Evolution of Enterprise Storage
  • 20. Block Storage Network - SAN Modular or monolithic, Intelligent controllers Redundant multi pathing, dedicated network FC, iSCSI, FCoE, NVMe, IB Block Characteristics: ● Very fast ● Data written in blocks ● Not human friendly ● Complex addressing Storage Array Ethernet Switch FC Switch Server Clients Storage Array
  • 21. File Storage Network - NAS Usually Modular, Intelligent controllers Redundancy via dedicated or existing ethernet network NFS, CIFS File Characteristics: ● Pretty fast ● Data written in files ● Human friendly paths ● Standard networking Storage Array (Filer) Ethernet Switch Servers Clients NAS Storage
  • 22. Object Storage - Cloud Usually massively scale-out Redundancy via multiple data copies REST API (http), Swift, S3 Object Characteristics: ● Not usually fast ● Data written in objects ● Eventually consistent ● Human friendly metadata ● Cloud networking Content Addressable Storage Ethernet Switch Servers Clients Object Storage WAN
  • 23. Data Protection Techniques RAID Redundant Array of Independent Disks RAID 0 - striping only, no protection RAID 1 - exact mirroring RAID 5 - 5D+1P, parity blks striped RAID 6 - 4D+2P, parity blks striped Performance only Least capacity 1 disk fail 2 disk fail
  • 24. Data Protection Techniques RAID Parity and striping across block devices to create sets of redundancy Multipath Redundant network paths from host to storage (dual HBA/NIC at host) EC Erasure coding saves data in fragments with parity across different locations Mirror Storage array level synchronous and asynchronous mirroring of data (DR/BC) Cache Battery or super-capacitor backed up cache
  • 25. Data Optimisation/Reduction Compression Algo to reduce redundant blocks, whitespace etc (unstructured data) Deduplication Avoids storing duplicate blocks (CPU intensive, rehydration, mapping) ● RAW Storage ● Usable storage ● Effective storage ● Decimal GB vs GiB Storage Vendor Terms
  • 26. Storage Array Summary Block, File and Object storage characteristics Data protection techniques employed by storage - RAID, EC, Mirroring Data reduction techniques employed by storage - Compression and Dedupe
  • 27. STORAGE GEEKS A Quick Look at CEPH
  • 28. Ceph Storage Object Built on scale out object storage but also provides interfaces to block and file REST Uses http to provide S3 or Swift compatibility Metadata Metadata servers to control data mapping and redundancy CRUSH Hashing algo to calculate storage and retrieval of data in distributed cluster SDS Software defined storage abstracts the intelligence from the hardware
  • 29. STORAGE GEEKS Andrew McGee NOV 2019 @Andrew_McGee THANKS!
  • 30. Reference 03 - The HDD https://en.wikipedia.org/wiki/Hard_disk_drive 04 - Engineers VS Nature https://en.wikipedia.org/wiki/Superparamagnetism 05 - PMR https://en.wikipedia.org/wiki/Perpendicular_recording 06 - SMR https://en.wikipedia.org/wiki/Shingled_magnetic_recording https://www.anandtech.com/show/7290/seagate-to-ship-5tb-hdd-in-2014-using-shingled-magnetic-recording 07 - Helium https://www.backblaze.com/blog/helium-filled-hard-drive-failure-rates/ https://www.anandtech.com/show/10106/western-digital-introduces-its-consumer-helium-drives 08 - HAMR https://blog.seagate.com/intelligent/hamr-next-leap-forward-now/ https://en.wikipedia.org/wiki/Heat-assisted_magnetic_recording https://fstoppers.com/originals/hamr-and-mamr-technologies-will-unlock-hard-drive-capacity-year-326328 https://www.backblaze.com/blog/hamr-hard-drives/ 09 - MAMR https://www.theregister.co.uk/2019/03/07/hamr_and_mamr_hdd_direction_debate/ https://blog.westerndigital.com/mamr-technology-breakthrough/ https://www.anandtech.com/show/13764/western-digital-2019-16tb-hdd-mamr-hamr https://www.anandtech.com/print/11925/western-digital-stuns-storage-industry-with-mamr-breakthrough-for-nextgen-hdds 10 - BPM https://www.toshiba.co.jp/rdc/rd/fields/11_e09_e.htm 11 - Connection Protocols https://en.wikipedia.org/wiki/Serial_Attached_SCSI https://en.wikipedia.org/wiki/Serial_ATA
  • 31. Reference 14 - The SSD https://en.wikipedia.org/wiki/Solid-state_drive 15 - Flash NAND https://www.mydigitaldiscount.com/everything-you-need-to-know-about-slc-mlc-and-tlc-nand-flash.html 16 - Flash Controllers https://en.wikipedia.org/wiki/Wear_leveling https://en.wikipedia.org/wiki/Write_amplification 17 - Connection protocols https://en.wikipedia.org/wiki/M.2

Notes de l'éditeur

  1. helium, which is less dense than air, creates less drag and turbulence Key challenge in using helium inside of a hard drive: helium escapes from most containers SMART 22 attribute is the status of the Helium in a drive seven platters in the same space required for five in conventional hard drives, reducing the weight-to-data ratio by 30% smaller motor required to drive the disk consumes 23 percent less power and runs 4-5 degrees cooler
  2. Ref: https://blog.seagate.com/intelligent/hamr-next-leap-forward-now/ https://en.wikipedia.org/wiki/Heat-assisted_magnetic_recording https://fstoppers.com/originals/hamr-and-mamr-technologies-will-unlock-hard-drive-capacity-year-326328 https://www.backblaze.com/blog/hamr-hard-drives/
  3. Spin Torque Oscillators (STO) to generate the 20-40 GHz microwaves that lower the coercivity of the media lower the resistance of the underlying recording media, which in turn makes it easier to write bits on the platter
  4. Records data in magnetic islands (one bit per island), as opposed to current hard disk drive technology where each bit is stored in 20-30 magnetic grains
  5. SSDs based on NAND Flash will slowly leak charge over time if left for long periods without power. This causes worn-out drives (that have exceeded their endurance rating) to start losing data typically after one year Endurance Drive Writes Per Day - 1 - 25 full drive writes for every day of warranty (5 years)
  6. An Object is a file with sets of metadata to describe it.