SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
Di Wang Extreme Storage Architecture & Development (ESAD), Intel
SPDK, PMDK & Vtune™ Summit 2
Agenda
• DAOS (Distributed Asynchronous Object Storage) Overview
• DAOS Architecture & features
• DAOS Storage Model
• DAOS with PMDK & SPDK
• Current Performance & Resource
SPDK, PMDK & Vtune™ Summit 3
Storage revolution
90
25
20
15
10
5
0
NAND SSD
(4kB Read)
Intel® Optane SSD
(4kB Read)
Legend
NVM Media Read
PCIe & NVMe protocol
Software (File System, OS, Driver)
LatencyfromApp(uS)
Intel® Optane NVDIMMs
(64B Read)
SPDK, PMDK & Vtune™ Summit 4
DAOS overview
DAOS Storage Engine
Open Source Apache 2.0 License
HDD
POSIX I/O
3rd Party Applications
Rich Data Models
Storage Platform
Storage Media
Workflow
HDF5 SQL …
Intel® QLC 3D Nand SSD
SPDK, PMDK & Vtune™ Summit 5
Lightweight I/O
Mercury userspace function shipping
§ MPI equivalent communications latency
§ Built over libfabric
Applications link directly with DAOS lib
§ Direct call, no context switch
§ Small memory footprint
§ No locking, caching or data copy
Userspace DAOS server
§ Mmap non-volatile memory via PMDK
§ NVMe access through SPDK/Blobstore
AI/Analytics/Simulation Workflow
DAOS library
Mercury/Libfabric
NVMe
SSDs
Bulk
transfers
SPDK
PMDK
RPC
HDF5
SCM
File (No)SQL…
DAOS
Service
SPDK, PMDK & Vtune™ Summit 6
Storage Model
DAOS provides a rich storage API
§ New scalable storage model suitable for both structured &
unstructured data
– key-value stores, multi-dimensional arrays, columnar
databases, …
– Accelerate data analytic/AI frameworks
§ Non-blocking data & metadata operations
§ Ad-hoc concurrency control mechanism
Pool
§ Reservation of distributed storage
§ Predictable/extendable performance/capacity
Container
§ Aggregate related datasets into manageable entity
§ Unit of snapshot/transaction
Object
§ Key-array store with own distribution/resilience schema
§ Multi-level key for fine-grain control over colocation of related data
Record
§ Arbitrary binary blob from single byte to several Mbytes
Storage Pool Container Object Record
SPDK, PMDK & Vtune™ Summit 7
Fine-grained I/O
Mix of storage technologies
§ Storage Class Memory
– DAOS metadata & application metadata
– Byte-granular application data
§ NVMe SSD (*NAND)
– Cheaper storage for bulk data (e.g. checkpoints)
– Multi-KB
I/Os are logged & inserted into persistent index
§ Non-destructive write & consistent read
§ No alignment constraints
§ No read-modify-write
v1
v2
v3
read@v3 Application
Buffer
Server-side
Index
Bulk descriptor segments
SPDK, PMDK & Vtune™ Summit 8
DATA Management
Data Security & Reduction
§ Online real-time data encryption &
compression
§ Hardware acceleration
Data Distribution
§ Algorithmic placement
Data Protection
§ Declustered replication & erasure code
§ Fault-domain aware placement
§ Self-healing
§ End-to-end data integrity
Hash (object.Dkey)
Hash (object.Dkey)
Fault
domain
separation
SPDK, PMDK & Vtune™ Summit 9
Pool Storage on DAOS Server
DAOS Service
Argobots Xstream
PMDK
pmemobj
SPDK Blob
SCM
NVMe SSD
PMDK
pmemobj
PMDK
pmemobj
PMDK
pmemobj
PMDK
pmemobj
SPDK Blob SPDK Blob SPDK Blob SPDK Blob
NVMe block
allocation Info
PMDK
pmemobj
SPDK Blob
SPDK, PMDK & Vtune™ Summit 10
DAOS I/O over PMDK/SPDK
SCM
NVMe
DAOS Xstream
§ Reserve new buffer
§ Either reserve by pmemobj_reserve
§ Or reserve in NVME SSD
SPDK, PMDK & Vtune™ Summit 11
DAOS I/O over PMDK/SPDK
11
SCM
NVMe
DAOS Xstream
§ Reserve new buffer
§ Either reserve by pmemobj_reserve
§ Or reserve in NVME SSD
§ Start RDMA transfer to newly allocated buffer
§ Either transfer to PMEM
§ Or transfer to DMA buffer then to NVME SSD
§ Start pmemobj transaction
SPDK, PMDK & Vtune™ Summit 12
DAOS I/O over PMDK/SPDK
SCM
NVMe
DAOS Xstream
§ Reserve new buffer
§ Either reserve by pmemobj_reserve
§ Or reserve in NVME SSD
§ Start RDMA transfer to newly allocated buffer
§ Either transfer to PMEM
§ Or transfer to DMA buffer then to NVME SSD
§ Start pmemobj transaction
§ Modify index to insert new extent
SPDK, PMDK & Vtune™ Summit 13
DAOS I/O over PMDK/SPDK
13
SCM
NVMe
DAOS Xstream
§ Reserve new buffer
§ Either reserve by pmemobj_reserve
§ Or reserve in NVME SSD
§ Start RDMA transfer to newly allocated buffer
§ Either transfer to PMEM
§ Or transfer to DMA buffer then to NVME SSD
§ Start pmemobj transaction
§ Modify index to insert new extent
§ Publish the reserve the space.
§ Either pmemobj_tx_publish() for SCM.
§ Or publish the space for NVMe SSD.
§ Commit pmemobj transaction and reply to client
SPDK, PMDK & Vtune™ Summit 14
DAOS Performance
34996
188782
282017
407431
469666 472509 502516
0
200000
400000
600000
800000
1000000
1200000
1 8 16 32 64 128 256
IOPS
Number of Clients
IOR Write - 1024 I/O size
62392
326432
434839
829526 875873
773290
1019720
0
200000
400000
600000
800000
1000000
1200000
1 8 16 32 64 128 256
IOPS
Number of Clients
IOR Read - 1024B I/O size
• IOR runs on remote clients sending the I/O requests to the single DAOS server over the fabric
• Intel Omni-Path Host Adapter 100HFA016LS
• Using the DAOS MPI-IO driver with the full DAOS stack (client, network, server)
• Cascade Lake CPUs, 6 Dimms 512G AEP NMA1XBD512GQSE
SPDK, PMDK & Vtune™ Summit 15
DAOS Community Roadmap
All information provided in this roadmap is subject to change without notice.
1Q19 2Q19 3Q19 4Q19 1Q20 2Q20 3Q20 4Q20 1Q21 2Q21 3Q21 4Q21 1Q22 2Q22 3Q22
Pre-1.0 releases & RCs 1.0 1.2 1.4 2.0 2.2 2.4
DAOS:
- Replication with self-healing
- Persistent Memory support
- NVMe SSD support
- Self monitoring & bootstrap
- Initial control plane
- python/golang API bindings
I/O Middleware:
- MPI-IO driver
- HDF5 DAOS Connector (proto)
- POSIX I/O (proto)
DAOS:
- Per-pool ACL
- Lustre integration
I/O Middleware:
- HDF5 DAOS Connector
- POSIX I/O support
- Spark
DAOS:
- End-to-end data integrity
- Per-container ACL
- SmartNICs & accelerators
- Improved control plane
DAOS:
- Online server addition
- Advanced control plane
I/O Middleware:
- POSIX data mover
- Async HDF5 operations over DAOS
DAOS:
- Erasure code
- Telemetry & per-job statistics
- Multi OFI provider support
I/O Middleware:
- Advanced POSIX I/O support
- Advanced data mover
Partner engagement & PoCs
DAOS:
- Progressive layout / GIGA+
- Placement optimizations
- Checksum scrubbing
I/O Middleware:
- Apache Arrow (not POR)
DAOS:
- Catastrophic recovery tools
SPDK, PMDK & Vtune™ Summit 16
Resource
Source code on GitHub
https://github.com/daos-stack/daos
Community mailing list on Groups.io
daos@daos.groups.io or https://daos.groups.io/g/daos
Wiki
http://daos.io or https://wiki.hpdd.intel.com
Bug tracker
https://jira.hpdd.intel.com
Big Data Uses with Distributed Asynchronous Object Storage

Contenu connexe

Tendances

Benefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): StorageBenefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): StorageMarketingArrowECS_CZ
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephDanielle Womboldt
 
Ceph - High Performance Without High Costs
Ceph - High Performance Without High CostsCeph - High Performance Without High Costs
Ceph - High Performance Without High CostsJonathan Long
 
SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3UniFabric
 
Software-Defined Storage (SDS)
Software-Defined Storage (SDS)Software-Defined Storage (SDS)
Software-Defined Storage (SDS)HTS Hosting
 
SSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage DevicesSSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage DevicesHTS Hosting
 
NGENSTOR_ODA_HPDA
NGENSTOR_ODA_HPDANGENSTOR_ODA_HPDA
NGENSTOR_ODA_HPDAUniFabric
 
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red_Hat_Storage
 
SOUG_Deployment__Automation_DB
SOUG_Deployment__Automation_DBSOUG_Deployment__Automation_DB
SOUG_Deployment__Automation_DBUniFabric
 
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Community
 
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Patrick McGarry
 
Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph Community
 
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...In-Memory Computing Summit
 
Ceph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephCeph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephDanielle Womboldt
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureDanielle Womboldt
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red_Hat_Storage
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSCeph Community
 

Tendances (17)

Benefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): StorageBenefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): Storage
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for Ceph
 
Ceph - High Performance Without High Costs
Ceph - High Performance Without High CostsCeph - High Performance Without High Costs
Ceph - High Performance Without High Costs
 
SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3
 
Software-Defined Storage (SDS)
Software-Defined Storage (SDS)Software-Defined Storage (SDS)
Software-Defined Storage (SDS)
 
SSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage DevicesSSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage Devices
 
NGENSTOR_ODA_HPDA
NGENSTOR_ODA_HPDANGENSTOR_ODA_HPDA
NGENSTOR_ODA_HPDA
 
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
 
SOUG_Deployment__Automation_DB
SOUG_Deployment__Automation_DBSOUG_Deployment__Automation_DB
SOUG_Deployment__Automation_DB
 
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
 
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
 
Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale
 
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
 
Ceph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephCeph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and Ceph
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
 

Similaire à Big Data Uses with Distributed Asynchronous Object Storage

Storage user cases
Storage user casesStorage user cases
Storage user casesAndrea Mauro
 
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...Shuquan Huang
 
Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...
Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...
Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...Principled Technologies
 
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlStorage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlITCamp
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Community
 
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld
 
ceph optimization on ssd ilsoo byun-short
ceph optimization on ssd ilsoo byun-shortceph optimization on ssd ilsoo byun-short
ceph optimization on ssd ilsoo byun-shortNAVER D2
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY
 
AMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD
 
Dragonflow Austin Summit Talk
Dragonflow Austin Summit Talk Dragonflow Austin Summit Talk
Dragonflow Austin Summit Talk Eran Gampel
 
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph Ceph Community
 
Nytro-XV_NWD_VM_Performance_Acceleration
Nytro-XV_NWD_VM_Performance_AccelerationNytro-XV_NWD_VM_Performance_Acceleration
Nytro-XV_NWD_VM_Performance_AccelerationKhai Le
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Community
 
Micron vPAK 2pp Final no Bleed_TD03
Micron vPAK 2pp Final no Bleed_TD03Micron vPAK 2pp Final no Bleed_TD03
Micron vPAK 2pp Final no Bleed_TD03Peyman Blumstengel
 
22by7 and DellEMC Tech Day July 20 2017 - Power Edge
22by7 and DellEMC Tech Day July 20 2017 - Power Edge22by7 and DellEMC Tech Day July 20 2017 - Power Edge
22by7 and DellEMC Tech Day July 20 2017 - Power EdgeSashikris
 
Virtualize with Confidence
Virtualize with ConfidenceVirtualize with Confidence
Virtualize with ConfidenceNetWize
 
3. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 20133. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 2013Taldor Group
 

Similaire à Big Data Uses with Distributed Asynchronous Object Storage (20)

Storage user cases
Storage user casesStorage user cases
Storage user cases
 
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
 
Mellanox Storage Solutions
Mellanox Storage SolutionsMellanox Storage Solutions
Mellanox Storage Solutions
 
Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...
Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...
Speeding time to insight: The Dell PowerEdge C6620 with Dell PERC 12 RAID con...
 
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlStorage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK
 
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
 
NVMe over Fabric
NVMe over FabricNVMe over Fabric
NVMe over Fabric
 
ceph optimization on ssd ilsoo byun-short
ceph optimization on ssd ilsoo byun-shortceph optimization on ssd ilsoo byun-short
ceph optimization on ssd ilsoo byun-short
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
 
AMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor Architecture
 
Dragonflow Austin Summit Talk
Dragonflow Austin Summit Talk Dragonflow Austin Summit Talk
Dragonflow Austin Summit Talk
 
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
 
Nytro-XV_NWD_VM_Performance_Acceleration
Nytro-XV_NWD_VM_Performance_AccelerationNytro-XV_NWD_VM_Performance_Acceleration
Nytro-XV_NWD_VM_Performance_Acceleration
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
 
Micron vPAK 2pp Final no Bleed_TD03
Micron vPAK 2pp Final no Bleed_TD03Micron vPAK 2pp Final no Bleed_TD03
Micron vPAK 2pp Final no Bleed_TD03
 
22by7 and DellEMC Tech Day July 20 2017 - Power Edge
22by7 and DellEMC Tech Day July 20 2017 - Power Edge22by7 and DellEMC Tech Day July 20 2017 - Power Edge
22by7 and DellEMC Tech Day July 20 2017 - Power Edge
 
Virtualize with Confidence
Virtualize with ConfidenceVirtualize with Confidence
Virtualize with Confidence
 
3. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 20133. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 2013
 
QNAP NAS Training 2016
QNAP NAS Training 2016QNAP NAS Training 2016
QNAP NAS Training 2016
 

Plus de Intel® Software

AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology Intel® Software
 
Python Data Science and Machine Learning at Scale with Intel and Anaconda
Python Data Science and Machine Learning at Scale with Intel and AnacondaPython Data Science and Machine Learning at Scale with Intel and Anaconda
Python Data Science and Machine Learning at Scale with Intel and AnacondaIntel® Software
 
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSciStreamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSciIntel® Software
 
AI for good: Scaling AI in science, healthcare, and more.
AI for good: Scaling AI in science, healthcare, and more.AI for good: Scaling AI in science, healthcare, and more.
AI for good: Scaling AI in science, healthcare, and more.Intel® Software
 
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...Intel® Software
 
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...Intel® Software
 
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...Intel® Software
 
AWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI ResearchAWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI ResearchIntel® Software
 
Intel AIDC Houston Summit - Overview Slides
Intel AIDC Houston Summit - Overview SlidesIntel AIDC Houston Summit - Overview Slides
Intel AIDC Houston Summit - Overview SlidesIntel® Software
 
AIDC NY: BODO AI Presentation - 09.19.2019
AIDC NY: BODO AI Presentation - 09.19.2019AIDC NY: BODO AI Presentation - 09.19.2019
AIDC NY: BODO AI Presentation - 09.19.2019Intel® Software
 
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019Intel® Software
 
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...Intel® Software
 
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...Intel® Software
 
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...Intel® Software
 
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...Intel® Software
 
AIDC India - Intel Movidius / Open Vino Slides
AIDC India - Intel Movidius / Open Vino SlidesAIDC India - Intel Movidius / Open Vino Slides
AIDC India - Intel Movidius / Open Vino SlidesIntel® Software
 
AIDC India - AI Vision Slides
AIDC India - AI Vision SlidesAIDC India - AI Vision Slides
AIDC India - AI Vision SlidesIntel® Software
 
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...Intel® Software
 

Plus de Intel® Software (20)

AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology
 
Python Data Science and Machine Learning at Scale with Intel and Anaconda
Python Data Science and Machine Learning at Scale with Intel and AnacondaPython Data Science and Machine Learning at Scale with Intel and Anaconda
Python Data Science and Machine Learning at Scale with Intel and Anaconda
 
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSciStreamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
 
AI for good: Scaling AI in science, healthcare, and more.
AI for good: Scaling AI in science, healthcare, and more.AI for good: Scaling AI in science, healthcare, and more.
AI for good: Scaling AI in science, healthcare, and more.
 
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
 
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
 
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
 
AWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI ResearchAWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI Research
 
Intel Developer Program
Intel Developer ProgramIntel Developer Program
Intel Developer Program
 
Intel AIDC Houston Summit - Overview Slides
Intel AIDC Houston Summit - Overview SlidesIntel AIDC Houston Summit - Overview Slides
Intel AIDC Houston Summit - Overview Slides
 
AIDC NY: BODO AI Presentation - 09.19.2019
AIDC NY: BODO AI Presentation - 09.19.2019AIDC NY: BODO AI Presentation - 09.19.2019
AIDC NY: BODO AI Presentation - 09.19.2019
 
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
 
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
 
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
 
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
 
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
 
AIDC India - AI on IA
AIDC India  - AI on IAAIDC India  - AI on IA
AIDC India - AI on IA
 
AIDC India - Intel Movidius / Open Vino Slides
AIDC India - Intel Movidius / Open Vino SlidesAIDC India - Intel Movidius / Open Vino Slides
AIDC India - Intel Movidius / Open Vino Slides
 
AIDC India - AI Vision Slides
AIDC India - AI Vision SlidesAIDC India - AI Vision Slides
AIDC India - AI Vision Slides
 
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
 

Dernier

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Dernier (20)

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Big Data Uses with Distributed Asynchronous Object Storage

  • 1. Di Wang Extreme Storage Architecture & Development (ESAD), Intel
  • 2. SPDK, PMDK & Vtune™ Summit 2 Agenda • DAOS (Distributed Asynchronous Object Storage) Overview • DAOS Architecture & features • DAOS Storage Model • DAOS with PMDK & SPDK • Current Performance & Resource
  • 3. SPDK, PMDK & Vtune™ Summit 3 Storage revolution 90 25 20 15 10 5 0 NAND SSD (4kB Read) Intel® Optane SSD (4kB Read) Legend NVM Media Read PCIe & NVMe protocol Software (File System, OS, Driver) LatencyfromApp(uS) Intel® Optane NVDIMMs (64B Read)
  • 4. SPDK, PMDK & Vtune™ Summit 4 DAOS overview DAOS Storage Engine Open Source Apache 2.0 License HDD POSIX I/O 3rd Party Applications Rich Data Models Storage Platform Storage Media Workflow HDF5 SQL … Intel® QLC 3D Nand SSD
  • 5. SPDK, PMDK & Vtune™ Summit 5 Lightweight I/O Mercury userspace function shipping § MPI equivalent communications latency § Built over libfabric Applications link directly with DAOS lib § Direct call, no context switch § Small memory footprint § No locking, caching or data copy Userspace DAOS server § Mmap non-volatile memory via PMDK § NVMe access through SPDK/Blobstore AI/Analytics/Simulation Workflow DAOS library Mercury/Libfabric NVMe SSDs Bulk transfers SPDK PMDK RPC HDF5 SCM File (No)SQL… DAOS Service
  • 6. SPDK, PMDK & Vtune™ Summit 6 Storage Model DAOS provides a rich storage API § New scalable storage model suitable for both structured & unstructured data – key-value stores, multi-dimensional arrays, columnar databases, … – Accelerate data analytic/AI frameworks § Non-blocking data & metadata operations § Ad-hoc concurrency control mechanism Pool § Reservation of distributed storage § Predictable/extendable performance/capacity Container § Aggregate related datasets into manageable entity § Unit of snapshot/transaction Object § Key-array store with own distribution/resilience schema § Multi-level key for fine-grain control over colocation of related data Record § Arbitrary binary blob from single byte to several Mbytes Storage Pool Container Object Record
  • 7. SPDK, PMDK & Vtune™ Summit 7 Fine-grained I/O Mix of storage technologies § Storage Class Memory – DAOS metadata & application metadata – Byte-granular application data § NVMe SSD (*NAND) – Cheaper storage for bulk data (e.g. checkpoints) – Multi-KB I/Os are logged & inserted into persistent index § Non-destructive write & consistent read § No alignment constraints § No read-modify-write v1 v2 v3 read@v3 Application Buffer Server-side Index Bulk descriptor segments
  • 8. SPDK, PMDK & Vtune™ Summit 8 DATA Management Data Security & Reduction § Online real-time data encryption & compression § Hardware acceleration Data Distribution § Algorithmic placement Data Protection § Declustered replication & erasure code § Fault-domain aware placement § Self-healing § End-to-end data integrity Hash (object.Dkey) Hash (object.Dkey) Fault domain separation
  • 9. SPDK, PMDK & Vtune™ Summit 9 Pool Storage on DAOS Server DAOS Service Argobots Xstream PMDK pmemobj SPDK Blob SCM NVMe SSD PMDK pmemobj PMDK pmemobj PMDK pmemobj PMDK pmemobj SPDK Blob SPDK Blob SPDK Blob SPDK Blob NVMe block allocation Info PMDK pmemobj SPDK Blob
  • 10. SPDK, PMDK & Vtune™ Summit 10 DAOS I/O over PMDK/SPDK SCM NVMe DAOS Xstream § Reserve new buffer § Either reserve by pmemobj_reserve § Or reserve in NVME SSD
  • 11. SPDK, PMDK & Vtune™ Summit 11 DAOS I/O over PMDK/SPDK 11 SCM NVMe DAOS Xstream § Reserve new buffer § Either reserve by pmemobj_reserve § Or reserve in NVME SSD § Start RDMA transfer to newly allocated buffer § Either transfer to PMEM § Or transfer to DMA buffer then to NVME SSD § Start pmemobj transaction
  • 12. SPDK, PMDK & Vtune™ Summit 12 DAOS I/O over PMDK/SPDK SCM NVMe DAOS Xstream § Reserve new buffer § Either reserve by pmemobj_reserve § Or reserve in NVME SSD § Start RDMA transfer to newly allocated buffer § Either transfer to PMEM § Or transfer to DMA buffer then to NVME SSD § Start pmemobj transaction § Modify index to insert new extent
  • 13. SPDK, PMDK & Vtune™ Summit 13 DAOS I/O over PMDK/SPDK 13 SCM NVMe DAOS Xstream § Reserve new buffer § Either reserve by pmemobj_reserve § Or reserve in NVME SSD § Start RDMA transfer to newly allocated buffer § Either transfer to PMEM § Or transfer to DMA buffer then to NVME SSD § Start pmemobj transaction § Modify index to insert new extent § Publish the reserve the space. § Either pmemobj_tx_publish() for SCM. § Or publish the space for NVMe SSD. § Commit pmemobj transaction and reply to client
  • 14. SPDK, PMDK & Vtune™ Summit 14 DAOS Performance 34996 188782 282017 407431 469666 472509 502516 0 200000 400000 600000 800000 1000000 1200000 1 8 16 32 64 128 256 IOPS Number of Clients IOR Write - 1024 I/O size 62392 326432 434839 829526 875873 773290 1019720 0 200000 400000 600000 800000 1000000 1200000 1 8 16 32 64 128 256 IOPS Number of Clients IOR Read - 1024B I/O size • IOR runs on remote clients sending the I/O requests to the single DAOS server over the fabric • Intel Omni-Path Host Adapter 100HFA016LS • Using the DAOS MPI-IO driver with the full DAOS stack (client, network, server) • Cascade Lake CPUs, 6 Dimms 512G AEP NMA1XBD512GQSE
  • 15. SPDK, PMDK & Vtune™ Summit 15 DAOS Community Roadmap All information provided in this roadmap is subject to change without notice. 1Q19 2Q19 3Q19 4Q19 1Q20 2Q20 3Q20 4Q20 1Q21 2Q21 3Q21 4Q21 1Q22 2Q22 3Q22 Pre-1.0 releases & RCs 1.0 1.2 1.4 2.0 2.2 2.4 DAOS: - Replication with self-healing - Persistent Memory support - NVMe SSD support - Self monitoring & bootstrap - Initial control plane - python/golang API bindings I/O Middleware: - MPI-IO driver - HDF5 DAOS Connector (proto) - POSIX I/O (proto) DAOS: - Per-pool ACL - Lustre integration I/O Middleware: - HDF5 DAOS Connector - POSIX I/O support - Spark DAOS: - End-to-end data integrity - Per-container ACL - SmartNICs & accelerators - Improved control plane DAOS: - Online server addition - Advanced control plane I/O Middleware: - POSIX data mover - Async HDF5 operations over DAOS DAOS: - Erasure code - Telemetry & per-job statistics - Multi OFI provider support I/O Middleware: - Advanced POSIX I/O support - Advanced data mover Partner engagement & PoCs DAOS: - Progressive layout / GIGA+ - Placement optimizations - Checksum scrubbing I/O Middleware: - Apache Arrow (not POR) DAOS: - Catastrophic recovery tools
  • 16. SPDK, PMDK & Vtune™ Summit 16 Resource Source code on GitHub https://github.com/daos-stack/daos Community mailing list on Groups.io daos@daos.groups.io or https://daos.groups.io/g/daos Wiki http://daos.io or https://wiki.hpdd.intel.com Bug tracker https://jira.hpdd.intel.com