SlideShare une entreprise Scribd logo
1  sur  23
De epDB for MySQL® Ove r vi ew 
J u l y 2 0 1 4
The World We Live In… 
• According to IDC, the Database software market has a CAGR of 34.2% 
• Wal-Mart generates 1 million new database records every hour 
• Chevron generates data at a rate of 2TB/day! 
• According to the Data Warehousing Institute 46% of companies plan to 
replace their existing data warehousing platforms 
• Every day, we create 2.5 quintillion bytes of data — so much that 
90% of the data in the world today has been created in the last two 
years alone.
MySQL Challenges 
• Performance degrades as table sizes get larger 
– Limitations of the underlying computer science 
• Highly indexed schemas negatively impact performance 
– More indexes helps query performance but hurts transactions 
• Poor performance with complex queries 
– Many table joins 
• Data loading times are slow due to poor concurrency 
– Table locking and single threaded operations 
• Backup time and performance impact 
– Big databases are slow to backup and effect system 
performance
Technology Limitations 
Most relational databases use Traditional B+ Trees which have architectural 
limitations that become apparent with large data sets or heavy indexing 
	120,000		 
	100,000		 
	80,000		 
	60,000		 
	40,000		 
	20,000		 
	-				 
	997,000		 
	5,901,000		 
InnoDB	Inser ng	rows	into	a	table	with	7	Indexes	 
(using	iiBench	with	10	clients	in	insert	only	with	secondary	indexes)		 
Running	on	So Layer	32	Core	system	with	32GB	 
RAM	and	4	drive	HDD	RAID5.		 
MySQL	5.5.35	running	on	Ubuntu	12.04.		 
Key	InnoDB	parameters:	 
innodb_buffer_pool_size=	4G	 
innodb_flush_log_at_trx_commit	=0	 
innodb_flush_method=O_DIRECT	 
innodb_log_file_size=100M	 
innodb_log_files_in_group=2		 
innodb_log_buffer_size=16M	 
	 
	 
	10,857,000		 
	15,855,000		 
	20,741,000		 
	25,684,000		 
	30,743,000		 
	35,598,000		 
	40,562,000		 
	45,533,000		 
	50,680,000		 
	55,678,000		 
	60,619,000		 
	65,510,000		 
	70,566,000		 
	75,550,000		 
	80,608,000		 
	85,547,000		 
	90,652,000		 
Insert	Rate	 
Row	Count	 
Elapsed	 me:	40,600	 
seconds
Cache Ahead Summary Index Tree 
Derived from the 
classic B+ Tree 
Embedded statistics and other meta-data 
in the nodes improves both tree 
navigation and indexing 
Branch node segments 
can vary in size based on 
actual data values 
Summary nodes provide a 
mechanism navigate 
extremely large tables by 
minimizing the number of 
branches walked 
Wider trees with embedded 
meta-data to enhance 
search and modification 
operations 
CASI Tree Instantiations: 
• A CASI Tree exists in both memory and on 
disk for each table and index 
• The structure of the Tree on disk and in 
memory are different 
• The (re)organization of the Tree on disk 
happens asynchronously from the one in 
memory based on adaptive algorithms, to 
yield improved disk I/O and CPU concurrency 
Root Node 
Branch 
Summary Node 
Node 
Summary 
Node 
Summary 
Branch Node 
Node 
Branch 
Node
CASI Tree Benefits 
CONSTANT TIME INDEXING 
Lightning fast indexing at extreme 
scale 
SEGMENTED COLUMN STORE 
Accelerates analytic operations and 
data management at scale 
STREAMING I/O 
Maximizes disk throughput with 
highly efficient use of IOPS 
EXTREME CONCURRENCY 
Minimizes locks and wait states to 
maximize CPU throughput 
INTELLIGENT CACHING 
Uses adaptive segment sizes and 
summaries to eliminate many disk reads 
BUILD FOR THE CLOUD 
Adaptive configuration and continuous 
optimization eliminates scheduled 
downtime 
CASI Tree Principles: 
• Always try to append data to file (i.e. don't seek, use the current seek position) 
• Read data sequentially (i.e. don't seek, use the current seek position for next sequence of reads) 
• Continually re-writes & reorders data such that the previous two principles above are met
Constant Time Indexing 
Minimizes index cost enabling high performance heavily indexed tables 
Different data structures on disk and in memory 
All work is performed in constant-time eliminating the need for periodic flushing 
Streaming 
File I/O 
(No memory map page 
size limitations) 
In Memory: Enhanced B+ Tree 
• Optimized for ‘wide’ nodes with 
accelerated operations 
• Stores index summaries to achieve 
great scale while maximizing cache 
effectiveness 
• Values are stored independently of 
the tree 
• Tree rebalancing occurs only in 
memory – no impact on data stored 
on disk 
• No fixed page/block sizes 
On Disk: Segmented Column Store 
• Highly optimized for on-disk 
read/write access 
• Never requires operational/in-place 
rebalancing 
• All previous database states are 
available 
• Efficiently supports variable size 
keys, values and ‘point reads’ 
• Utilizes segmented column store 
technology for indexes and columns 
Key Benefits: Increases maximum practical table sizes and improves analytic 
performance by allowing for more indexing
Segmented Column Store 
Structure of the index files for the database 
– Provides the functional capabilities of a column store 
– Simultaneously read and write optimized 
– Instantaneous database start up/shut down 
– Columns are updated in tandem with value changes 
– Consistent performance and latency; optimized in real time 
– Columns consist of variable length segments 
– Each segment is a block of ordered keys, references to rows and 
meta-data 
– Changes to the key space require only delta updates 
Optimized for real-time analytics 
– Embedded statistical data in each segment 
– Allows for heavy indexing to improve query performance 
– Enables continuous transactional data feed 
Suited for high levels of compression 
– Compact representation of keys with summarization 
– Flexible segment and delta compression 
Segmented Column Store 
Header 
Segment Type & Size 
Meta-Data 
Segment Type & Size 
Segment 
A 
Segment 
B 
Delta 
Changes 
to 
Segment 
A 
Back Reference 
Keys and /or Values 
Segment Type & Size 
Meta-Data 
Keys and /or Values 
Segment Type & Size 
Segment Type & Size 
Meta-Data 
Keys and /or Values 
Segment Type & Size 
Key Benefits: Excellent compression facilities and improved query performance. 
Supports continuous streaming backups with snapshots
Streaming I/O 
• Massively optimized delivering near wire speed throughput 
• Append only file structures virtually eliminate disk seeks 
• Concurrent operations for updates in memory & on disk 
• Optimizations for SSD, HDD, and in-memory-only operation 
• Minimizes IO wait states 
Data Streams Streaming Transactional State Logging 
DeepDB 
Streaming Indexing 
Key Benefits: Achieves near SSD like performance with magnetic HDD’s. Extends 
the life expectancy of SSD’s with built in wear leveling and no write 
amplification
Extreme Concurrency 
Running the Sysbench test On a 32 CPU core system with 32 attached clients 
Strands system resources and 
takes longer to complete the 
test 
Load time 8m59s 
Test Time 54.09s 
Transaction rate: 1.4k/sec 
Utilizes ~100% of available 
system resources to complete 
the test 
Load time 23.96s 
Test Time 5.82s 
Transaction rate: 15k/sec 
Key Benefits: Database operations take full advantage of all allocated system 
resources and dramatically improves system performance
Intelligent Caching 
• Adaptive algorithms manage cache usage 
– Dynamically sized data segments 
– Point read capable: no page operations 
• In-memory compression 
– Maximizes cache effectiveness 
– Adaptive operation manages compression vs. performance 
• Summary indexing reduces cache ‘thrashing’ 
– Only pull in the data that is relevant 
– No need to pull ‘pages’ in to cache 
Key Benefits: Improves overall system performance by staying in cache more 
often then standard MySQL
Built for the Cloud 
• Designed for easy deployments with virtually no 
configuration required in most cases 
• No off-line operations 
– Continuous defragmentation & optimization 
– No downtime for scheduled maintenance 
• Linear performance and consistent low latency 
• Instantaneous startup and shutdown 
• No performance degradations due to B+ Tree 
rebalancing or log flushing 
Key Benefits: Rapid deployment with almost no configuration and no off- line 
maintenance operations. Delivers greatly enhanced performance 
when using network based storage
DeepDB for MySQL 
A storage engine that breaks through current 
performance and scaling limitations 
– Easy-to-install plugin replacement for the 
InnoDB storage engine 
– Requires no application or schema changes 
– Scales-up performance of existing systems 
– Increases practical data sizes and complexity 
– Billions of rows with high index densities 
– High performance index creation/maintenance 
– High performance ACID transactions with 
consistently low latency 
– Reduced query latencies 
Application Examples: 
Wordpress | SugarCRM | Drupal 
PHP | Perl | Python | Etc. 
Apache Server MySQL 
DeepDB InnoDB 
CentOS | RHEL | Ubuntu 
Bare metal | Virtualized | Cloud
Benefits The Entire Data 
Lifecycle 
Load 
- Delimited files 
- Dump files 
Operate 
- Transactions 
- Compress 
Analyze 
- Replicate 
- Query 
Protect 
- Backup 
- Recover 
DeepDB 
Provides enhanced 
scaling and 
performance across 
a broad set of use 
cases 
Compatible with all 
existing MySQL 
applications and 
tool chains 
Designed to fully 
leverage todays 
powerful 
computing 
systems 
Optimized for 
deployment in the 
cloud with adaptive 
behavior and on-line 
maintenance
Data Loading 
15 
DeepDB 
Reduces data 
loading times by 
20x or more 
Whether you are 
loading delimited 
files or restoring 
MySQL dump files 
DeepDB can 
dramatically reduce 
your load times 
DeepDB’s data 
loading advantage 
can be seen in both 
dedicated bare-metal 
and cloud based 
deployments
Transactional Performance 
Use Cases 
(All tests performed on MySQL 5.5) 
MySQL with DeepDB MySQL with 
InnoDB 
Improvement 
Streaming Data test (Machine-to-Machine) 
(iiBench Maximum Transactions/second with Single index) 
3.795M/sec 217k/sec 17x 
Transactional Workload Test (Financial) 
(Sysbench transaction rate) 
15,083/sec 1,381/sec 11x 
Complex Transactional Test (e-Commerce) 
(DBT-2 transaction rate using HDD) 
205,184/min 15,086/min 13.6x 
Social Media Transactional Test (Twitter) 
(iiBench with 250M Rows,7 Indexes w/ composite keys) 
Database Creation 15 Minutes 24 Hours 96x 
First query from cold start 50 seconds 5.5 Minutes 6.6x 
Second query from Cold start 1 second 240 seconds 240x 
Disk storage footprint (uncompressed) 29GB 50G 42% 
16
Advantage in the Cloud 
17
Reduces Disk Size Requirements 
18 
6,000 
5,000 
4,000 
3,000 
2,000 
1,000 
- 
Uncompressed 
Compressed 
5,400 
2,800 
3,780 
640 
Size in GB 
On Disk Data Size 
InnoDB 
DeepDB
Cut Your Query Times in Half 
2.5	 
2	 
1.5	 
1	 
0.5	 
DeepDB improves query speed by 1.5 to 2 times when measured 
19 
against DBT3 benchmark 
1.75	 
1.86	 
2.00	 
1.88	 
1.93	 
2.06	 
1.62	 
1.87	 
0	 
SF=1,	2G,	Avg	2	 
runs	 
SF=1,	16G,	Avg	5	 
runs	 
SF=1,	16G,	Key	 
Comp,	Avg	5	runs	 
SF=2,	16G,	Avg	5	 
runs	 
SF=2,	16G,	Key	 
Comp,	Avg	5	runs	 
SF=5,	16G,	Avg	2	 
runs	 
SF=5,	16G,	Key	 
Comp,	Avg	5	runs	 
Overall	Average	 
Times	Faster	 
DBT3	Performance	Comparison	Summary	 
Average	query	performance	across	various	configura ons	 
InnoDB	 DeepDB
Protect Your Data 
DeepDB architecture eliminates potential data 
integrity problems and patent-pending error 
recovery completes in just seconds 
• No updates in place 
• No memory map 
Unique data structures support real-time and 
continuous streaming backups to ensure data is 
always protected 
• Append only files provide natural incremental 
backups 
DeepDB 
Ensures your 
data is 
continually 
backed up and 
available 
20
DeepDB Advantages 
21 
The Ultimate 
MySQL 
Storage Engine 
50% Smaller Data 
Footprint 
Reduces compressed or 
uncompressed data to 
less than half the size of 
InnoDB 
5x-10x 
Improvement in ACID 
transactional throughput 
Plug-in 
Replacement for 
InnoDB 
Install DeepDB without 
any changes to existing 
MySQL Applications 
HDD=SSD 
Increases effective HDD 
throughput to near SSD 
levels and extends SSD 
life up to 10x 
1B+ Rows 
Provides high 
performance support for 
very large tables 
20x Faster Data 
Loading 
Concurrent operations 
and IO optimizations 
reduces load times 
Run Queries Twice 
as Fast 
Summary Indexing 
techniques enable ultra 
low latency queries 
Real-Time Backups 
Create streaming 
backups with 
snapshotting 
Low Latency 
Replicas 
Efficiently scale out 
analytics and read heavy 
work loads
22 
Try DeepDB 
yourself! 
http://deep.is/downloads/
Thank You! 
23

Contenu connexe

Tendances

SKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSING
SKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSINGSKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSING
SKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSINGSkillwise Group
 
Designing, Building, and Maintaining Large Cubes using Lessons Learned
Designing, Building, and Maintaining Large Cubes using Lessons LearnedDesigning, Building, and Maintaining Large Cubes using Lessons Learned
Designing, Building, and Maintaining Large Cubes using Lessons LearnedDenny Lee
 
Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1Skillwise Group
 
A Scalable Data Transformation Framework using the Hadoop Ecosystem
A Scalable Data Transformation Framework using the Hadoop EcosystemA Scalable Data Transformation Framework using the Hadoop Ecosystem
A Scalable Data Transformation Framework using the Hadoop EcosystemSerendio Inc.
 
Oracle12c Database in-memory Data Sheet
Oracle12c Database in-memory Data SheetOracle12c Database in-memory Data Sheet
Oracle12c Database in-memory Data SheetOracle
 
Capacity Management of an ETL System
Capacity Management of an ETL SystemCapacity Management of an ETL System
Capacity Management of an ETL SystemASHOK BHATLA
 
Oracle-12c Online Training by Quontra Solutions
 Oracle-12c Online Training by Quontra Solutions Oracle-12c Online Training by Quontra Solutions
Oracle-12c Online Training by Quontra SolutionsQuontra Solutions
 
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDatawarehouse Trainings
 
ORACLE 12C-New-Features
ORACLE 12C-New-FeaturesORACLE 12C-New-Features
ORACLE 12C-New-FeaturesNavneet Upneja
 
A Scalable Data Transformation Framework using Hadoop Ecosystem
A Scalable Data Transformation Framework using Hadoop EcosystemA Scalable Data Transformation Framework using Hadoop Ecosystem
A Scalable Data Transformation Framework using Hadoop EcosystemDataWorks Summit
 
Data protection for oracle backup & recovery for oracle databases
Data protection for oracle  backup & recovery for oracle databasesData protection for oracle  backup & recovery for oracle databases
Data protection for oracle backup & recovery for oracle databasessolarisyougood
 
Tuning data warehouse
Tuning data warehouseTuning data warehouse
Tuning data warehouseSrinivasan R
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_clusterPrabhat gangwar
 
Partitioning your Oracle Data Warehouse - Just a simple task?
Partitioning your Oracle Data Warehouse - Just a simple task?Partitioning your Oracle Data Warehouse - Just a simple task?
Partitioning your Oracle Data Warehouse - Just a simple task?Trivadis
 
Scaling Your Database In The Cloud
Scaling Your Database In The CloudScaling Your Database In The Cloud
Scaling Your Database In The CloudCory Isaacson
 
The Plan Cache Whisperer - Performance Tuning SQL Server
The Plan Cache Whisperer - Performance Tuning SQL ServerThe Plan Cache Whisperer - Performance Tuning SQL Server
The Plan Cache Whisperer - Performance Tuning SQL ServerJason Strate
 

Tendances (20)

SKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSING
SKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSINGSKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSING
SKILLWISE-SSIS DESIGN PATTERN FOR DATA WAREHOUSING
 
Designing, Building, and Maintaining Large Cubes using Lessons Learned
Designing, Building, and Maintaining Large Cubes using Lessons LearnedDesigning, Building, and Maintaining Large Cubes using Lessons Learned
Designing, Building, and Maintaining Large Cubes using Lessons Learned
 
Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1
 
A Scalable Data Transformation Framework using the Hadoop Ecosystem
A Scalable Data Transformation Framework using the Hadoop EcosystemA Scalable Data Transformation Framework using the Hadoop Ecosystem
A Scalable Data Transformation Framework using the Hadoop Ecosystem
 
Oracle12c Database in-memory Data Sheet
Oracle12c Database in-memory Data SheetOracle12c Database in-memory Data Sheet
Oracle12c Database in-memory Data Sheet
 
Capacity Management of an ETL System
Capacity Management of an ETL SystemCapacity Management of an ETL System
Capacity Management of an ETL System
 
Oracle-12c Online Training by Quontra Solutions
 Oracle-12c Online Training by Quontra Solutions Oracle-12c Online Training by Quontra Solutions
Oracle-12c Online Training by Quontra Solutions
 
Oracle: DW Design
Oracle: DW DesignOracle: DW Design
Oracle: DW Design
 
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
ORACLE 12C-New-Features
ORACLE 12C-New-FeaturesORACLE 12C-New-Features
ORACLE 12C-New-Features
 
A Scalable Data Transformation Framework using Hadoop Ecosystem
A Scalable Data Transformation Framework using Hadoop EcosystemA Scalable Data Transformation Framework using Hadoop Ecosystem
A Scalable Data Transformation Framework using Hadoop Ecosystem
 
Data protection for oracle backup & recovery for oracle databases
Data protection for oracle  backup & recovery for oracle databasesData protection for oracle  backup & recovery for oracle databases
Data protection for oracle backup & recovery for oracle databases
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Tuning data warehouse
Tuning data warehouseTuning data warehouse
Tuning data warehouse
 
Teradata training
Teradata trainingTeradata training
Teradata training
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_cluster
 
Partitioning your Oracle Data Warehouse - Just a simple task?
Partitioning your Oracle Data Warehouse - Just a simple task?Partitioning your Oracle Data Warehouse - Just a simple task?
Partitioning your Oracle Data Warehouse - Just a simple task?
 
Scaling Your Database In The Cloud
Scaling Your Database In The CloudScaling Your Database In The Cloud
Scaling Your Database In The Cloud
 
The Plan Cache Whisperer - Performance Tuning SQL Server
The Plan Cache Whisperer - Performance Tuning SQL ServerThe Plan Cache Whisperer - Performance Tuning SQL Server
The Plan Cache Whisperer - Performance Tuning SQL Server
 

En vedette

Amazon S3 storage engine plugin for MySQL
Amazon S3 storage engine plugin for MySQLAmazon S3 storage engine plugin for MySQL
Amazon S3 storage engine plugin for MySQLKapil Mohan
 
Openfest15 MySQL Plugin Development
Openfest15 MySQL Plugin DevelopmentOpenfest15 MySQL Plugin Development
Openfest15 MySQL Plugin DevelopmentGeorgi Kodinov
 
MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)
MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)
MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)Aurimas Mikalauskas
 
MariaDB: in-depth (hands on training in Seoul)
MariaDB: in-depth (hands on training in Seoul)MariaDB: in-depth (hands on training in Seoul)
MariaDB: in-depth (hands on training in Seoul)Colin Charles
 
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...Ivan Zoratti
 
MySQL Storage Engines Landscape
MySQL Storage Engines LandscapeMySQL Storage Engines Landscape
MySQL Storage Engines LandscapeColin Charles
 
MySQL Storage Engines
MySQL Storage EnginesMySQL Storage Engines
MySQL Storage EnginesKarthik .P.R
 

En vedette (7)

Amazon S3 storage engine plugin for MySQL
Amazon S3 storage engine plugin for MySQLAmazon S3 storage engine plugin for MySQL
Amazon S3 storage engine plugin for MySQL
 
Openfest15 MySQL Plugin Development
Openfest15 MySQL Plugin DevelopmentOpenfest15 MySQL Plugin Development
Openfest15 MySQL Plugin Development
 
MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)
MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)
MySQL Performance Tuning. Part 1: MySQL Configuration (includes MySQL 5.7)
 
MariaDB: in-depth (hands on training in Seoul)
MariaDB: in-depth (hands on training in Seoul)MariaDB: in-depth (hands on training in Seoul)
MariaDB: in-depth (hands on training in Seoul)
 
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
 
MySQL Storage Engines Landscape
MySQL Storage Engines LandscapeMySQL Storage Engines Landscape
MySQL Storage Engines Landscape
 
MySQL Storage Engines
MySQL Storage EnginesMySQL Storage Engines
MySQL Storage Engines
 

Similaire à Best storage engine for MySQL

Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics PlatformSantanu Dey
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMichael Hiskey
 
Optimize Your Reporting In Less Than 10 Minutes
Optimize Your Reporting In Less Than 10 MinutesOptimize Your Reporting In Less Than 10 Minutes
Optimize Your Reporting In Less Than 10 MinutesAlexandra Sasha Blumenfeld
 
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...Edgar Alejandro Villegas
 
Presentation cloud control enterprise manager 12c
Presentation   cloud control enterprise manager 12cPresentation   cloud control enterprise manager 12c
Presentation cloud control enterprise manager 12cxKinAnx
 
Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...
Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...
Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...Microsoft
 
InfiniFlux vs_RDBMS
InfiniFlux vs_RDBMSInfiniFlux vs_RDBMS
InfiniFlux vs_RDBMSInfiniFlux
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarKognitio
 
Solving Office 365 Big Challenges using Cassandra + Spark
Solving Office 365 Big Challenges using Cassandra + Spark Solving Office 365 Big Challenges using Cassandra + Spark
Solving Office 365 Big Challenges using Cassandra + Spark Anubhav Kale
 
Azure SQL DB V12 at your service by Pieter Vanhove
Azure SQL DB V12 at your service by Pieter VanhoveAzure SQL DB V12 at your service by Pieter Vanhove
Azure SQL DB V12 at your service by Pieter VanhoveITProceed
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Using SAS GRID v 9 with Isilon F810
Using SAS GRID v 9 with Isilon F810Using SAS GRID v 9 with Isilon F810
Using SAS GRID v 9 with Isilon F810Boni Bruno
 
Revolutionary Storage for Modern Databases, Applications and Infrastrcture
Revolutionary Storage for Modern Databases, Applications and InfrastrctureRevolutionary Storage for Modern Databases, Applications and Infrastrcture
Revolutionary Storage for Modern Databases, Applications and Infrastrcturesabnees
 
SharePoint 2010 database maintenance
SharePoint 2010 database maintenanceSharePoint 2010 database maintenance
SharePoint 2010 database maintenanceMatt Ranlett
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftAmazon Web Services
 

Similaire à Best storage engine for MySQL (20)

Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Optimize Your Reporting In Less Than 10 Minutes
Optimize Your Reporting In Less Than 10 MinutesOptimize Your Reporting In Less Than 10 Minutes
Optimize Your Reporting In Less Than 10 Minutes
 
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
 
Presentation cloud control enterprise manager 12c
Presentation   cloud control enterprise manager 12cPresentation   cloud control enterprise manager 12c
Presentation cloud control enterprise manager 12c
 
Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...
Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...
Business Insight 2014 - Microsofts nye BI og database platform - Erling Skaal...
 
InfiniFlux vs_RDBMS
InfiniFlux vs_RDBMSInfiniFlux vs_RDBMS
InfiniFlux vs_RDBMS
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Deep Dive on Amazon Redshift
Deep Dive on Amazon RedshiftDeep Dive on Amazon Redshift
Deep Dive on Amazon Redshift
 
Solving Office 365 Big Challenges using Cassandra + Spark
Solving Office 365 Big Challenges using Cassandra + Spark Solving Office 365 Big Challenges using Cassandra + Spark
Solving Office 365 Big Challenges using Cassandra + Spark
 
Redshift overview
Redshift overviewRedshift overview
Redshift overview
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Azure SQL DB V12 at your service by Pieter Vanhove
Azure SQL DB V12 at your service by Pieter VanhoveAzure SQL DB V12 at your service by Pieter Vanhove
Azure SQL DB V12 at your service by Pieter Vanhove
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Using SAS GRID v 9 with Isilon F810
Using SAS GRID v 9 with Isilon F810Using SAS GRID v 9 with Isilon F810
Using SAS GRID v 9 with Isilon F810
 
Revolutionary Storage for Modern Databases, Applications and Infrastrcture
Revolutionary Storage for Modern Databases, Applications and InfrastrctureRevolutionary Storage for Modern Databases, Applications and Infrastrcture
Revolutionary Storage for Modern Databases, Applications and Infrastrcture
 
SharePoint 2010 database maintenance
SharePoint 2010 database maintenanceSharePoint 2010 database maintenance
SharePoint 2010 database maintenance
 
Deep Dive on Amazon Redshift
Deep Dive on Amazon RedshiftDeep Dive on Amazon Redshift
Deep Dive on Amazon Redshift
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 

Dernier

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 

Dernier (20)

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 

Best storage engine for MySQL

  • 1. De epDB for MySQL® Ove r vi ew J u l y 2 0 1 4
  • 2. The World We Live In… • According to IDC, the Database software market has a CAGR of 34.2% • Wal-Mart generates 1 million new database records every hour • Chevron generates data at a rate of 2TB/day! • According to the Data Warehousing Institute 46% of companies plan to replace their existing data warehousing platforms • Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone.
  • 3. MySQL Challenges • Performance degrades as table sizes get larger – Limitations of the underlying computer science • Highly indexed schemas negatively impact performance – More indexes helps query performance but hurts transactions • Poor performance with complex queries – Many table joins • Data loading times are slow due to poor concurrency – Table locking and single threaded operations • Backup time and performance impact – Big databases are slow to backup and effect system performance
  • 4. Technology Limitations Most relational databases use Traditional B+ Trees which have architectural limitations that become apparent with large data sets or heavy indexing 120,000 100,000 80,000 60,000 40,000 20,000 - 997,000 5,901,000 InnoDB Inser ng rows into a table with 7 Indexes (using iiBench with 10 clients in insert only with secondary indexes) Running on So Layer 32 Core system with 32GB RAM and 4 drive HDD RAID5. MySQL 5.5.35 running on Ubuntu 12.04. Key InnoDB parameters: innodb_buffer_pool_size= 4G innodb_flush_log_at_trx_commit =0 innodb_flush_method=O_DIRECT innodb_log_file_size=100M innodb_log_files_in_group=2 innodb_log_buffer_size=16M 10,857,000 15,855,000 20,741,000 25,684,000 30,743,000 35,598,000 40,562,000 45,533,000 50,680,000 55,678,000 60,619,000 65,510,000 70,566,000 75,550,000 80,608,000 85,547,000 90,652,000 Insert Rate Row Count Elapsed me: 40,600 seconds
  • 5. Cache Ahead Summary Index Tree Derived from the classic B+ Tree Embedded statistics and other meta-data in the nodes improves both tree navigation and indexing Branch node segments can vary in size based on actual data values Summary nodes provide a mechanism navigate extremely large tables by minimizing the number of branches walked Wider trees with embedded meta-data to enhance search and modification operations CASI Tree Instantiations: • A CASI Tree exists in both memory and on disk for each table and index • The structure of the Tree on disk and in memory are different • The (re)organization of the Tree on disk happens asynchronously from the one in memory based on adaptive algorithms, to yield improved disk I/O and CPU concurrency Root Node Branch Summary Node Node Summary Node Summary Branch Node Node Branch Node
  • 6. CASI Tree Benefits CONSTANT TIME INDEXING Lightning fast indexing at extreme scale SEGMENTED COLUMN STORE Accelerates analytic operations and data management at scale STREAMING I/O Maximizes disk throughput with highly efficient use of IOPS EXTREME CONCURRENCY Minimizes locks and wait states to maximize CPU throughput INTELLIGENT CACHING Uses adaptive segment sizes and summaries to eliminate many disk reads BUILD FOR THE CLOUD Adaptive configuration and continuous optimization eliminates scheduled downtime CASI Tree Principles: • Always try to append data to file (i.e. don't seek, use the current seek position) • Read data sequentially (i.e. don't seek, use the current seek position for next sequence of reads) • Continually re-writes & reorders data such that the previous two principles above are met
  • 7. Constant Time Indexing Minimizes index cost enabling high performance heavily indexed tables Different data structures on disk and in memory All work is performed in constant-time eliminating the need for periodic flushing Streaming File I/O (No memory map page size limitations) In Memory: Enhanced B+ Tree • Optimized for ‘wide’ nodes with accelerated operations • Stores index summaries to achieve great scale while maximizing cache effectiveness • Values are stored independently of the tree • Tree rebalancing occurs only in memory – no impact on data stored on disk • No fixed page/block sizes On Disk: Segmented Column Store • Highly optimized for on-disk read/write access • Never requires operational/in-place rebalancing • All previous database states are available • Efficiently supports variable size keys, values and ‘point reads’ • Utilizes segmented column store technology for indexes and columns Key Benefits: Increases maximum practical table sizes and improves analytic performance by allowing for more indexing
  • 8. Segmented Column Store Structure of the index files for the database – Provides the functional capabilities of a column store – Simultaneously read and write optimized – Instantaneous database start up/shut down – Columns are updated in tandem with value changes – Consistent performance and latency; optimized in real time – Columns consist of variable length segments – Each segment is a block of ordered keys, references to rows and meta-data – Changes to the key space require only delta updates Optimized for real-time analytics – Embedded statistical data in each segment – Allows for heavy indexing to improve query performance – Enables continuous transactional data feed Suited for high levels of compression – Compact representation of keys with summarization – Flexible segment and delta compression Segmented Column Store Header Segment Type & Size Meta-Data Segment Type & Size Segment A Segment B Delta Changes to Segment A Back Reference Keys and /or Values Segment Type & Size Meta-Data Keys and /or Values Segment Type & Size Segment Type & Size Meta-Data Keys and /or Values Segment Type & Size Key Benefits: Excellent compression facilities and improved query performance. Supports continuous streaming backups with snapshots
  • 9. Streaming I/O • Massively optimized delivering near wire speed throughput • Append only file structures virtually eliminate disk seeks • Concurrent operations for updates in memory & on disk • Optimizations for SSD, HDD, and in-memory-only operation • Minimizes IO wait states Data Streams Streaming Transactional State Logging DeepDB Streaming Indexing Key Benefits: Achieves near SSD like performance with magnetic HDD’s. Extends the life expectancy of SSD’s with built in wear leveling and no write amplification
  • 10. Extreme Concurrency Running the Sysbench test On a 32 CPU core system with 32 attached clients Strands system resources and takes longer to complete the test Load time 8m59s Test Time 54.09s Transaction rate: 1.4k/sec Utilizes ~100% of available system resources to complete the test Load time 23.96s Test Time 5.82s Transaction rate: 15k/sec Key Benefits: Database operations take full advantage of all allocated system resources and dramatically improves system performance
  • 11. Intelligent Caching • Adaptive algorithms manage cache usage – Dynamically sized data segments – Point read capable: no page operations • In-memory compression – Maximizes cache effectiveness – Adaptive operation manages compression vs. performance • Summary indexing reduces cache ‘thrashing’ – Only pull in the data that is relevant – No need to pull ‘pages’ in to cache Key Benefits: Improves overall system performance by staying in cache more often then standard MySQL
  • 12. Built for the Cloud • Designed for easy deployments with virtually no configuration required in most cases • No off-line operations – Continuous defragmentation & optimization – No downtime for scheduled maintenance • Linear performance and consistent low latency • Instantaneous startup and shutdown • No performance degradations due to B+ Tree rebalancing or log flushing Key Benefits: Rapid deployment with almost no configuration and no off- line maintenance operations. Delivers greatly enhanced performance when using network based storage
  • 13. DeepDB for MySQL A storage engine that breaks through current performance and scaling limitations – Easy-to-install plugin replacement for the InnoDB storage engine – Requires no application or schema changes – Scales-up performance of existing systems – Increases practical data sizes and complexity – Billions of rows with high index densities – High performance index creation/maintenance – High performance ACID transactions with consistently low latency – Reduced query latencies Application Examples: Wordpress | SugarCRM | Drupal PHP | Perl | Python | Etc. Apache Server MySQL DeepDB InnoDB CentOS | RHEL | Ubuntu Bare metal | Virtualized | Cloud
  • 14. Benefits The Entire Data Lifecycle Load - Delimited files - Dump files Operate - Transactions - Compress Analyze - Replicate - Query Protect - Backup - Recover DeepDB Provides enhanced scaling and performance across a broad set of use cases Compatible with all existing MySQL applications and tool chains Designed to fully leverage todays powerful computing systems Optimized for deployment in the cloud with adaptive behavior and on-line maintenance
  • 15. Data Loading 15 DeepDB Reduces data loading times by 20x or more Whether you are loading delimited files or restoring MySQL dump files DeepDB can dramatically reduce your load times DeepDB’s data loading advantage can be seen in both dedicated bare-metal and cloud based deployments
  • 16. Transactional Performance Use Cases (All tests performed on MySQL 5.5) MySQL with DeepDB MySQL with InnoDB Improvement Streaming Data test (Machine-to-Machine) (iiBench Maximum Transactions/second with Single index) 3.795M/sec 217k/sec 17x Transactional Workload Test (Financial) (Sysbench transaction rate) 15,083/sec 1,381/sec 11x Complex Transactional Test (e-Commerce) (DBT-2 transaction rate using HDD) 205,184/min 15,086/min 13.6x Social Media Transactional Test (Twitter) (iiBench with 250M Rows,7 Indexes w/ composite keys) Database Creation 15 Minutes 24 Hours 96x First query from cold start 50 seconds 5.5 Minutes 6.6x Second query from Cold start 1 second 240 seconds 240x Disk storage footprint (uncompressed) 29GB 50G 42% 16
  • 17. Advantage in the Cloud 17
  • 18. Reduces Disk Size Requirements 18 6,000 5,000 4,000 3,000 2,000 1,000 - Uncompressed Compressed 5,400 2,800 3,780 640 Size in GB On Disk Data Size InnoDB DeepDB
  • 19. Cut Your Query Times in Half 2.5 2 1.5 1 0.5 DeepDB improves query speed by 1.5 to 2 times when measured 19 against DBT3 benchmark 1.75 1.86 2.00 1.88 1.93 2.06 1.62 1.87 0 SF=1, 2G, Avg 2 runs SF=1, 16G, Avg 5 runs SF=1, 16G, Key Comp, Avg 5 runs SF=2, 16G, Avg 5 runs SF=2, 16G, Key Comp, Avg 5 runs SF=5, 16G, Avg 2 runs SF=5, 16G, Key Comp, Avg 5 runs Overall Average Times Faster DBT3 Performance Comparison Summary Average query performance across various configura ons InnoDB DeepDB
  • 20. Protect Your Data DeepDB architecture eliminates potential data integrity problems and patent-pending error recovery completes in just seconds • No updates in place • No memory map Unique data structures support real-time and continuous streaming backups to ensure data is always protected • Append only files provide natural incremental backups DeepDB Ensures your data is continually backed up and available 20
  • 21. DeepDB Advantages 21 The Ultimate MySQL Storage Engine 50% Smaller Data Footprint Reduces compressed or uncompressed data to less than half the size of InnoDB 5x-10x Improvement in ACID transactional throughput Plug-in Replacement for InnoDB Install DeepDB without any changes to existing MySQL Applications HDD=SSD Increases effective HDD throughput to near SSD levels and extends SSD life up to 10x 1B+ Rows Provides high performance support for very large tables 20x Faster Data Loading Concurrent operations and IO optimizations reduces load times Run Queries Twice as Fast Summary Indexing techniques enable ultra low latency queries Real-Time Backups Create streaming backups with snapshotting Low Latency Replicas Efficiently scale out analytics and read heavy work loads
  • 22. 22 Try DeepDB yourself! http://deep.is/downloads/