2. $> whois kyle bader
Senior Solution Architect
Red Hat
3. DATA CHALLENGES
Exponential growth in digital content increases pressure on
capacity, scalability, and cost.
The need for access to data from anywhere, anytime, on
any device requires unprecedented agility.
Modern services require the flexibility to store data on-
premises or in the cloud.
Growing content requires advanced data protection that
ensures integrity & high availability at very large scale.
1
2
3
4
4. Traditional Storage
Complex proprietary silos
Open, Software-Defined Storage
Standardized, unified, open platforms
Custom GUI
Proprietary Software
Proprietary
Hardware
Standard
Computers
and Disks
Standard
Hardware
OpenSource
Software
Ceph Gluster +++
Control Plane (API, GUI)
ADMIN USER
THE FUTURE OF STORAGE
ADMIN
USER
ADMIN
USER
ADMIN
USER
Custom GUI
Proprietary Software
Proprietary
Hardware
Custom GUI
Proprietary Software
Proprietary
Hardware
6. Server-based storage uses software and standard hardware to provide services
traditionally provided by single-purpose storage appliances, providing increased
agility and efficiency.
DISTRIBUTED CLUSTER OF
SERVERS
MEDIA MEDIA MEDIA MEDIA MEDIA MEDIA MEDIA
APPLIANCE
MEDIA MEDIA
APPLIANCE
MEDIA MEDIA
APPLIANCE
MEDIA MEDIA
USER USER USER
SERVER-BASED STORAGE
USER USER USER
8. STANDARD SAN/NAS IS ON THE DECLINE
Server-based storage is “will account for over 60%
of shipments long term.”
“By 2016, server-based storage solutions will lower
storage hardware costs by 50% or more.”
Gartner: “IT Leaders Can Benefit From Disruptive Innovation in the Storage Industry”
Credit Suisse Storage Update, September 3, 2015
Changing workloads drive the need for
flexible, economical server-based storage.
WW DEPLOYED CAPACITY (TB)
2010 2011 2012 2013 2014 2015
(est)
2016
(est)
Source: IDC
0%
60%
20%
40%
80%
100
%
Internal CapacityExternal
Capacity
9. STORAGE ORCHESTRATION
Storage orchestration is the ability to provision, grow, shrink, and decommission
storage resources on-demand and programmatically, providing increased control
and integration of storage into a software-defined data center.
WEB CONSOLE
A browser interface designed for
managing distributed storage
API
A full API for automation and
integration with outside systems
COMMAND LINE
A robust, scriptable command-line
interface for expert operators
PROVISION INSTALL CONFIGURE TUNE MONITOR
Full lifecycle management for distributed, software-defined data services
10. A RISING TIDE
“By 2020, between 70-80% of unstructured data will
be held on lower-cost storage managed by SDS”
“By 2019, 70% of existing storage array products
will also be available as software only versions”
Innovation Insight: Separating Hype From Hope for Software-Defined Storage
Innovation Insight: Separating Hype From Hope for Software-Defined Storage
2013 2014 2015 2016 2017 2018 2019
$1,349M
$1,195
M
$1,029M
$859M
$706M
$592M
SDS-P MARKET SIZE BY SEGMENT
$457M
Block Storage
File Storage
Object Storage
Hyperconverged
Source: IDC
Software-Defined Storage is leading a
shift in the global storage industry, with
far-reaching effects.
11. THE BALANCE
Inflexible
Expensive at large scale
Durable
Convenient
Flexible
Economical at large scale
Durable
Powerful
Appliances are suitable for small-
scale, workloads, but they do not scale
economically.
Software-defined storage has a
learning curve, but bring performance
and economy at petabyte scale.
12. THE ROBUSTNESS OF SOFTWARE
Software is more flexible than hardware
Software can do things hardware appliances can’t. SDS brings the
flexibility of software to the enterprise storage world.
• Can be deployed on bare metal, inside containers, inside VMs, or
in the public cloud.
• Can deploy on a single server, or thousands, and can be
upgraded and reconfigured on the fly.
• Grows and shrinks programmatically to meet changing demands
13. BUILDING ON PROVEN HARDWARE
Hardware is hard, and we got you covered
Tested software defined storage solutions, for repeatable success.
• Ceph Hardware Configuration Guide
• Ceph Hardware Selection Guide
• Ceph Performance and Sizing Guide - Supermicro
• Ceph Performance and Sizing Guide - Quanta QCT
14. OPTIMIZATION CRITERIA
IOPS
Optimized
Throughput
Optimized
Capacity
Optimized
• Lowest cost per IO
• Highest IOPS
• Meet minimum fault domain
requirement
• Lowest cost per unit of throughput
• Highest throughput
• Highest throughput per watt/BTU
• Meet minimum fault domain
requirement
• Lowest cost per TB
• Lowest watt/BTU per TB
• Meet minimum fault domain
requirement
• Typically block storage
• Replication
• MySQL for OpenStack tenants
• Block and object storage
• Replication or erasure coded
• Active performance storage for video,
audio, and images
• Streaming media
• Typically object storage
• Erasure coding dominant
• Media archives
• Data lake
15.
16.
17. • Shared, elastic storage pool
• Dynamic DB placement
• Flexible volume resizing
• Live instance migration
• Backup to object pool
• Read replicas via copy-on-write snapshots
MySQL ON CEPH STORAGE CLOUD
OPS EFFICIENCY
20. HEAD-TO-HEAD LAB
TEST ENVIRONMENTS
• EC2 r3.2xlarge and m4.4xlarge
• EBS Provisioned IOPS and GPSSD
• Percona Server
• Supermicro servers
• Red Hat Ceph Storage RBD
• Percona Server
21. OSD Storage Server Systems
5x SuperStorage SSG-6028R-OSDXXX
Dual Intel Xeon E5-2650v3 (10x core)
32GB SDRAM DDR3
2x 80GB boot drives
4x 800GB Intel DC P3700 (hot-swap U.2 NVMe)
1x dual port 10GbE network adaptors AOC-STGN-i2S
8x Seagate 6TB 7200 RPM SAS (unused in this lab)
Mellanox 40GbE network adaptor(unused in this lab)
MySQL Client Systems
12x Super Server 2UTwin2 nodes
Dual Intel Xeon E5-2670v2
(cpuset limited to 8 or 16 vCPUs)
64GB SDRAM DDR3
Storage Server Software:
Red Hat Ceph Storage 1.3.2
Red Hat Enterprise Linux 7.2
Percona Server
5x OSD Nodes 12x Client Nodes
Shared10GSFP+Networking
Monitor Nodes
SUPERMICRO CEPH
LAB ENVIRONMENT