SlideShare une entreprise Scribd logo
1  sur  26
1
Data & Analytics Convergence
Keith Manthey, CTO Analytics
Property of EMC. Not for further distribution
2
Source: EMC Digital Universe with Research and Analysis by IDC, The Digital Universe of
Opportunities: Rich Data and the Increasing Value of the Internet of Things, April 2014.
2020
4.4ZETTABYTES
44ZETTABYTES
10xMORE
DigitalUniverse 2014
2013
ZETTABYTE = 1,000,000,000,000,000,000,000 bytes
34.4 Billion 32GB Smartphones =1 ZETTABYTE
34.4 Billion Samsung S5’s end-to-end would circle the Earth 121.8 times
Property of EMC. Not for further distribution
3
4
© Copyright 2015 EMC Corporation. All rights reserved.
30B
DEVICES
7B
PEOPLE
1M+
NEW BUSINESSES
Source: Gartner Group, 2014
© Copyright 2015 EMC Corporation. All rights reserved.
2020: A NEW DIGITAL WORLD
Property of EMC. Not for further distribution
5
PRECISION
FARMING
DRESS THAT
DISPLAYS HOW
WE FEEL
CONTACT LENS
THAT CONTROLS
BLOOD SUGAR
THERMOSTAT
THAT KNOWS
YOU’RE AWAY
FITNESS BAND
THAT MEASURES
ACTIVITY LEVEL
GLASSES THAT
DIRECT US
WHERE TO GO
DRONES THAT
DELIVER OUR
GROCERIES
DIGITIZATION IS ALREADY BEGINNING
Property of EMC. Not for further distribution
6
© Copyright 2015 EMC Corporation. All rights reserved.
Many Industries Face Structural Change
Property of EMC. Not for further distribution
7
Analytics is about Data & Outcomes…
Property of EMC. Not for further distribution
8
Macro Market Trends
Courtesy of Wikibon
Courtesy of Infoworld
Property of EMC. Not for further distribution
9
Reasons for Change?
SKILLS
Operations
Growth of Data Type
Property of EMC. Not for further distribution
10
Philosophical - Database
Cache Logs
System Processes
(including Logical - Catalog + Physical Structures –
Reader/Writers)
Data
Storage
Instance
Traditional DB
Assumes:
• Query < 5% of Data
• Schema on Write
(Structured)
• All data confirms to
Schema (changes to
versioned data if
schema changes)
• Limited to compute
methods (SQL, UDF,
and R soon*)
Property of EMC. Not for further distribution
11
Philosophical - Hadoop
Spark MapReduce
HDFS
(including Logical – Name Node+ Physical Structures – Data
Node)
Data
Storage
YARN
Hadoop
Built for:
• Query 100% of Data
each time
• Schema on Read
(including multiple
versions over time)
• Unlimited in compute
methods (SQL,
Programmatic,
Tools(Spark, Storm,
R…))
Property of EMC. Not for further distribution
12
Comparison
Spark MapReduce
HDFS
(including Logical – Name Node+ Physical Structures – Data
Node)
Data
Storage
YARN
Cache Logs
System Processes
(including Logical - Catalog + Physical Structures –
Reader/Writers)
Data
Storage
Instance
SCALE UP – More CPUS/Memory
Vs
SCALE OUT – More Nodes
SCALE OUT – More Nodes
Property of EMC. Not for further distribution
13
Convergence
Single
Execution
Query Across
Structured
and
Unstructured
(“push down
processing”)
&
Hadoop
Query
Integration
Erasure
Coding
(HDFS-EC)
(enterprise
patterns)
&
Better
Operational
Support
&
Skills gaps
Property of EMC. Not for further distribution
14
What is the DB Convergence Play?
Per Microsoft, “PolyBase is
a T-SQL front end that
allows customers to query
data stored in HDFS”
Microsoft Polybase - Click here for original
IBM's Big SQL Product Overview
Property of EMC. Not for further distribution
15
But… Hadoop is about DAS
Property of EMC. Not for further distribution
16
Data Locality – Per Eric Brewer…
MSFT
Research Link
U. Cal
Berkeley
Original Link /
Paper
Property of EMC. Not for further distribution
17
Who is Eric Brewer?
• Eric Brewer is a UC
Berkeley Professor who
happens to be currently on
sabbatical working with
Google (VP of
Infrastructure).
• He proposed the CAP
Theorem in 1990
• Google records 40K hits on
“Brewer’s Theorem Proofs”
Property of EMC. Not for further distribution
18
It’s all Hadoop?
• Per Mike Olson at 2015 Strataconf, Hadoop is really
disappearing, with the real importance of discussion
on the applications on top of the platform
• It’s about Outcomes and use cases. As a result,
Machine Learning & Spark are gaining all the glory
– “How Old” Presentation from Strataconf
– IBM commits 3.5K associates to Apache Spark
– Microsoft buy Revolution Analytics to bring Machine
Learning to Databases
Property of EMC. Not for further distribution
19
What has transpired with Hadoop?
• Cloudera has cracked into the Operational Data Store
and Data Warehouse Gartner Quads. This has long
been held by traditional RDBMS entrants.
• Increased investment from Hadoop vendors around
items like Kudu and LLAP targeting OLTP workloads.
• Creation of a converged ACID Compliant RDBMS on
Hadoop
Property of EMC. Not for further distribution
20
Keith’s Predictions
• More Enterprise Patterns for Hadoop:
– Companies are running out of data center and network
space. The push for denser footprints are emerging
– Operations drives better reference architectures that match
their support model
– More focus on Interactive Queries and real time processing
– More converged pushes from other parties like Splice
– More use cases driving more adoption, but less about
Hadoop
• More Unstructured Data Support / Analytics for
Databases & ACID compliance upon Hadoop.
– To Quote Willie Sutton: “It’s where the money is…”
Property of EMC. Not for further distribution
21
Why does EMC Care?
• Enterprise Standard Storage Technology supporting
the World’s Databases
• Largest Enterprise Storage Vendor for Hadoop
Platforms (Isilon)
– Certified with Hortonworks and Cloudera, along with Pivotal
and IBM Big Insights
• Bring ease of use to difficult platform and ease of
convergence on products like Polybase.
Property of EMC. Not for further distribution
22
Appendix
Property of EMC. Not for further distribution
23
© Copyright 2015 EMC Corporation. All rights reserved.
Ethernet
Hadoop Architecture – DAS vs Isilon
NameNode
Ethernet
Compute Node Compute Node Compute Node
Compute NodeCompute Node Compute Node
name
node
name
node
name
node
datanode
Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node
Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node
Property of EMC. Not for further distribution
24
Traditional Hadoop POD
18 racks
Extended Time-to-Results
•Requires Additional “Data Staging” Storage
•Iterative Testing is Time Consuming
•Requires Copying of Data Several Times
Rigid Architecture
•Inefficient Floor Space
•Must Purchase Compute & Storage Together
•Storage Efficiency < 25%
Lacks Enterprise Features
•No Disaster Recovery, Snapshots
•Single Protocol (HDFS Only)
•Lacks Full Security Features
42U
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
=
~ 5PB Usable
Hadoop Storage
Isilon vHadoop
(no staging needed)
Hadoop POD:
Compute with Staging Storage
Isilon vHadoop
8 racks
Faster Time-to-Results
•Data Stays on the Isilon Cluster
•Allows for Rapid Iterative Testing Process
•Simplifies Hosting Workflow
Flexible Architecture
•Efficient Floor Space, Power & Cooling
•Leverage VMs for Flexible Deployments
•Storage Efficiency > 78%
Enterprise Capabilities
•Disaster Recovery, Snapshots
•SyncIQ-Data Replication Offsite
•Highly Secure Hosting Environment
42U
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
42U
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
42U
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
42U
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
SERIES
Property of EMC. Not for further distribution
25
© Copyright 2015 EMC Corporation. All rights reserved.
1TB Hadoop Job Cycle Comparison
Isilon Significantly Reduces Time To Results
Traditional Hadoop+DAS
17:32 30:18 20:5020:50
Isilon Enabled Hadoop
18:51
Terasort Test on 1TB
DAS Isilon Benefit
MB/s Per Node 55.00 85.00 55%
Compute Min 30.18 18.51 -39%
TTR Min 89.30 18.51 -79%
Isilon Advantages
• Eliminates All Data Movement
• Allows for Virtualized Compute
• Significantly Less Cost
• 79% Faster TTR!
TTR- 89.3
Minutes!
Property of EMC. Not for further distribution
EMC Isilon Database Converged deck

Contenu connexe

Tendances

Real Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
Real Time Interactive Queries IN HADOOP: Big Data Warehousing MeetupReal Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
Real Time Interactive Queries IN HADOOP: Big Data Warehousing MeetupCaserta
 
Interactive query in hadoop
Interactive query in hadoopInteractive query in hadoop
Interactive query in hadoopRommel Garcia
 
Overview of stinger interactive query for hive
Overview of stinger   interactive query for hiveOverview of stinger   interactive query for hive
Overview of stinger interactive query for hiveDavid Kaiser
 
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
Hadoop in the Cloud: Real World Lessons from Enterprise CustomersHadoop in the Cloud: Real World Lessons from Enterprise Customers
Hadoop in the Cloud: Real World Lessons from Enterprise CustomersDataWorks Summit/Hadoop Summit
 
Big Data Warehousing: Pig vs. Hive Comparison
Big Data Warehousing: Pig vs. Hive ComparisonBig Data Warehousing: Pig vs. Hive Comparison
Big Data Warehousing: Pig vs. Hive ComparisonCaserta
 
Wrangling Customer Usage Data with Hadoop
Wrangling Customer Usage Data with HadoopWrangling Customer Usage Data with Hadoop
Wrangling Customer Usage Data with HadoopDataWorks Summit
 
Hadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityHadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityDataWorks Summit
 
The Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data CentricThe Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data CentricDataWorks Summit
 
The Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsightThe Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsightGert Drapers
 
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...DataWorks Summit
 
Is Cloud a right Companion for Hadoop
Is Cloud a right Companion for HadoopIs Cloud a right Companion for Hadoop
Is Cloud a right Companion for HadoopDataWorks Summit
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesDataWorks Summit
 
Demystify Big Data Breakfast Briefing: Herb Cunitz, Hortonworks
Demystify Big Data Breakfast Briefing:  Herb Cunitz, HortonworksDemystify Big Data Breakfast Briefing:  Herb Cunitz, Hortonworks
Demystify Big Data Breakfast Briefing: Herb Cunitz, HortonworksHortonworks
 
HDInsight Hadoop on Windows Azure
HDInsight Hadoop on Windows AzureHDInsight Hadoop on Windows Azure
HDInsight Hadoop on Windows AzureLynn Langit
 
Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitApache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitSaptak Sen
 
Hadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizon
Hadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizonHadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizon
Hadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizonDataWorks Summit/Hadoop Summit
 
Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Alluxio, Inc.
 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXHow Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXBMC Software
 
Big Data Architecture and Deployment
Big Data Architecture and DeploymentBig Data Architecture and Deployment
Big Data Architecture and DeploymentCisco Canada
 

Tendances (20)

Real Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
Real Time Interactive Queries IN HADOOP: Big Data Warehousing MeetupReal Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
Real Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
 
Interactive query in hadoop
Interactive query in hadoopInteractive query in hadoop
Interactive query in hadoop
 
Overview of stinger interactive query for hive
Overview of stinger   interactive query for hiveOverview of stinger   interactive query for hive
Overview of stinger interactive query for hive
 
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
Hadoop in the Cloud: Real World Lessons from Enterprise CustomersHadoop in the Cloud: Real World Lessons from Enterprise Customers
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
 
Big Data Warehousing: Pig vs. Hive Comparison
Big Data Warehousing: Pig vs. Hive ComparisonBig Data Warehousing: Pig vs. Hive Comparison
Big Data Warehousing: Pig vs. Hive Comparison
 
50 Shades of SQL
50 Shades of SQL50 Shades of SQL
50 Shades of SQL
 
Wrangling Customer Usage Data with Hadoop
Wrangling Customer Usage Data with HadoopWrangling Customer Usage Data with Hadoop
Wrangling Customer Usage Data with Hadoop
 
Hadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityHadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management Simplicity
 
The Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data CentricThe Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data Centric
 
The Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsightThe Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsight
 
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
Unify Stream and Batch Processing using Dataflow, a Portable Programmable Mod...
 
Is Cloud a right Companion for Hadoop
Is Cloud a right Companion for HadoopIs Cloud a right Companion for Hadoop
Is Cloud a right Companion for Hadoop
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! Perspectives
 
Demystify Big Data Breakfast Briefing: Herb Cunitz, Hortonworks
Demystify Big Data Breakfast Briefing:  Herb Cunitz, HortonworksDemystify Big Data Breakfast Briefing:  Herb Cunitz, Hortonworks
Demystify Big Data Breakfast Briefing: Herb Cunitz, Hortonworks
 
HDInsight Hadoop on Windows Azure
HDInsight Hadoop on Windows AzureHDInsight Hadoop on Windows Azure
HDInsight Hadoop on Windows Azure
 
Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitApache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop Summit
 
Hadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizon
Hadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizonHadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizon
Hadoop and Friends as Key Enabler of the IoE - Continental's Dynamic eHorizon
 
Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS
 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXHow Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
 
Big Data Architecture and Deployment
Big Data Architecture and DeploymentBig Data Architecture and Deployment
Big Data Architecture and Deployment
 

Similaire à EMC Isilon Database Converged deck

20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weitingWei Ting Chen
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...Alluxio, Inc.
 
Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...
Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...
Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...RainStor
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Cécile Poyet
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Hortonworks
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Cécile Poyet
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit MumbaiAnand Haridass
 
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data PlatformModernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data PlatformHortonworks
 
IBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERIBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERinside-BigData.com
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Precisely
 
EMC Big Data Solutions Overview
EMC Big Data Solutions OverviewEMC Big Data Solutions Overview
EMC Big Data Solutions Overviewwalshe1
 
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Steven Totman
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization Denodo
 
Accelerating Big Data Insights
Accelerating Big Data InsightsAccelerating Big Data Insights
Accelerating Big Data InsightsDataWorks Summit
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaJeffrey T. Pollock
 
Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016Joan Novino
 
LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...
LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...
LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...The Linux Foundation
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 

Similaire à EMC Isilon Database Converged deck (20)

20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...
 
Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...
Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...
Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your M...
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai
 
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data PlatformModernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
 
IBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERIBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWER
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
 
EMC Big Data Solutions Overview
EMC Big Data Solutions OverviewEMC Big Data Solutions Overview
EMC Big Data Solutions Overview
 
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
 
Ibm db2 big sql
Ibm db2 big sqlIbm db2 big sql
Ibm db2 big sql
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
 
Accelerating Big Data Insights
Accelerating Big Data InsightsAccelerating Big Data Insights
Accelerating Big Data Insights
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafka
 
Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016
 
LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...
LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...
LF Collab Summit 2015: ARM Servers for the Next Generation Date Center and Cl...
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 

Dernier

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 

Dernier (20)

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 

EMC Isilon Database Converged deck

  • 1. 1 Data & Analytics Convergence Keith Manthey, CTO Analytics Property of EMC. Not for further distribution
  • 2. 2 Source: EMC Digital Universe with Research and Analysis by IDC, The Digital Universe of Opportunities: Rich Data and the Increasing Value of the Internet of Things, April 2014. 2020 4.4ZETTABYTES 44ZETTABYTES 10xMORE DigitalUniverse 2014 2013 ZETTABYTE = 1,000,000,000,000,000,000,000 bytes 34.4 Billion 32GB Smartphones =1 ZETTABYTE 34.4 Billion Samsung S5’s end-to-end would circle the Earth 121.8 times Property of EMC. Not for further distribution
  • 3. 3
  • 4. 4 © Copyright 2015 EMC Corporation. All rights reserved. 30B DEVICES 7B PEOPLE 1M+ NEW BUSINESSES Source: Gartner Group, 2014 © Copyright 2015 EMC Corporation. All rights reserved. 2020: A NEW DIGITAL WORLD Property of EMC. Not for further distribution
  • 5. 5 PRECISION FARMING DRESS THAT DISPLAYS HOW WE FEEL CONTACT LENS THAT CONTROLS BLOOD SUGAR THERMOSTAT THAT KNOWS YOU’RE AWAY FITNESS BAND THAT MEASURES ACTIVITY LEVEL GLASSES THAT DIRECT US WHERE TO GO DRONES THAT DELIVER OUR GROCERIES DIGITIZATION IS ALREADY BEGINNING Property of EMC. Not for further distribution
  • 6. 6 © Copyright 2015 EMC Corporation. All rights reserved. Many Industries Face Structural Change Property of EMC. Not for further distribution
  • 7. 7 Analytics is about Data & Outcomes… Property of EMC. Not for further distribution
  • 8. 8 Macro Market Trends Courtesy of Wikibon Courtesy of Infoworld Property of EMC. Not for further distribution
  • 9. 9 Reasons for Change? SKILLS Operations Growth of Data Type Property of EMC. Not for further distribution
  • 10. 10 Philosophical - Database Cache Logs System Processes (including Logical - Catalog + Physical Structures – Reader/Writers) Data Storage Instance Traditional DB Assumes: • Query < 5% of Data • Schema on Write (Structured) • All data confirms to Schema (changes to versioned data if schema changes) • Limited to compute methods (SQL, UDF, and R soon*) Property of EMC. Not for further distribution
  • 11. 11 Philosophical - Hadoop Spark MapReduce HDFS (including Logical – Name Node+ Physical Structures – Data Node) Data Storage YARN Hadoop Built for: • Query 100% of Data each time • Schema on Read (including multiple versions over time) • Unlimited in compute methods (SQL, Programmatic, Tools(Spark, Storm, R…)) Property of EMC. Not for further distribution
  • 12. 12 Comparison Spark MapReduce HDFS (including Logical – Name Node+ Physical Structures – Data Node) Data Storage YARN Cache Logs System Processes (including Logical - Catalog + Physical Structures – Reader/Writers) Data Storage Instance SCALE UP – More CPUS/Memory Vs SCALE OUT – More Nodes SCALE OUT – More Nodes Property of EMC. Not for further distribution
  • 14. 14 What is the DB Convergence Play? Per Microsoft, “PolyBase is a T-SQL front end that allows customers to query data stored in HDFS” Microsoft Polybase - Click here for original IBM's Big SQL Product Overview Property of EMC. Not for further distribution
  • 15. 15 But… Hadoop is about DAS Property of EMC. Not for further distribution
  • 16. 16 Data Locality – Per Eric Brewer… MSFT Research Link U. Cal Berkeley Original Link / Paper Property of EMC. Not for further distribution
  • 17. 17 Who is Eric Brewer? • Eric Brewer is a UC Berkeley Professor who happens to be currently on sabbatical working with Google (VP of Infrastructure). • He proposed the CAP Theorem in 1990 • Google records 40K hits on “Brewer’s Theorem Proofs” Property of EMC. Not for further distribution
  • 18. 18 It’s all Hadoop? • Per Mike Olson at 2015 Strataconf, Hadoop is really disappearing, with the real importance of discussion on the applications on top of the platform • It’s about Outcomes and use cases. As a result, Machine Learning & Spark are gaining all the glory – “How Old” Presentation from Strataconf – IBM commits 3.5K associates to Apache Spark – Microsoft buy Revolution Analytics to bring Machine Learning to Databases Property of EMC. Not for further distribution
  • 19. 19 What has transpired with Hadoop? • Cloudera has cracked into the Operational Data Store and Data Warehouse Gartner Quads. This has long been held by traditional RDBMS entrants. • Increased investment from Hadoop vendors around items like Kudu and LLAP targeting OLTP workloads. • Creation of a converged ACID Compliant RDBMS on Hadoop Property of EMC. Not for further distribution
  • 20. 20 Keith’s Predictions • More Enterprise Patterns for Hadoop: – Companies are running out of data center and network space. The push for denser footprints are emerging – Operations drives better reference architectures that match their support model – More focus on Interactive Queries and real time processing – More converged pushes from other parties like Splice – More use cases driving more adoption, but less about Hadoop • More Unstructured Data Support / Analytics for Databases & ACID compliance upon Hadoop. – To Quote Willie Sutton: “It’s where the money is…” Property of EMC. Not for further distribution
  • 21. 21 Why does EMC Care? • Enterprise Standard Storage Technology supporting the World’s Databases • Largest Enterprise Storage Vendor for Hadoop Platforms (Isilon) – Certified with Hortonworks and Cloudera, along with Pivotal and IBM Big Insights • Bring ease of use to difficult platform and ease of convergence on products like Polybase. Property of EMC. Not for further distribution
  • 22. 22 Appendix Property of EMC. Not for further distribution
  • 23. 23 © Copyright 2015 EMC Corporation. All rights reserved. Ethernet Hadoop Architecture – DAS vs Isilon NameNode Ethernet Compute Node Compute Node Compute Node Compute NodeCompute Node Compute Node name node name node name node datanode Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node Property of EMC. Not for further distribution
  • 24. 24 Traditional Hadoop POD 18 racks Extended Time-to-Results •Requires Additional “Data Staging” Storage •Iterative Testing is Time Consuming •Requires Copying of Data Several Times Rigid Architecture •Inefficient Floor Space •Must Purchase Compute & Storage Together •Storage Efficiency < 25% Lacks Enterprise Features •No Disaster Recovery, Snapshots •Single Protocol (HDFS Only) •Lacks Full Security Features 42U SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES = ~ 5PB Usable Hadoop Storage Isilon vHadoop (no staging needed) Hadoop POD: Compute with Staging Storage Isilon vHadoop 8 racks Faster Time-to-Results •Data Stays on the Isilon Cluster •Allows for Rapid Iterative Testing Process •Simplifies Hosting Workflow Flexible Architecture •Efficient Floor Space, Power & Cooling •Leverage VMs for Flexible Deployments •Storage Efficiency > 78% Enterprise Capabilities •Disaster Recovery, Snapshots •SyncIQ-Data Replication Offsite •Highly Secure Hosting Environment 42U SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES 42U SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES 42U SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES 42U SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES SERIES Property of EMC. Not for further distribution
  • 25. 25 © Copyright 2015 EMC Corporation. All rights reserved. 1TB Hadoop Job Cycle Comparison Isilon Significantly Reduces Time To Results Traditional Hadoop+DAS 17:32 30:18 20:5020:50 Isilon Enabled Hadoop 18:51 Terasort Test on 1TB DAS Isilon Benefit MB/s Per Node 55.00 85.00 55% Compute Min 30.18 18.51 -39% TTR Min 89.30 18.51 -79% Isilon Advantages • Eliminates All Data Movement • Allows for Virtualized Compute • Significantly Less Cost • 79% Faster TTR! TTR- 89.3 Minutes! Property of EMC. Not for further distribution

Notes de l'éditeur

  1. Just to put the exponential growth of the digital universe into context… Like the physical universe, the digital universe is large – doubling in size every two years, and by 2020 the digital universe – the data we create and copy annually – will reach 44 zettabytes, or 44 trillion gigabytes – containing nearly as many digital bits as there are stars in the universe. If the Digital Universe were represented by the memory in a stack of tablets, in 2013 it would have stretched two-thirds of the way to the Moon. By 2020, there would be 6.6 stacks from the Earth to the Moon. With this much data floating around, we need structure to sort it, make sense of it, and tell the story. That’s where data visualization comes in.
  2. We live in an amazing time, but looking forward to 2020… Estimated 30B – 200B devices 7 billion people 1 million new businesses from where we are today These people using these devices within these businesses are constantly connected This gives rise to new ways of doing business: new disruptive technology, new disruptive business models <CLICK>
  3. We’re already starting to see this today Looking at the likes of Nest: a thermostat that knows when you are in and out of your house, and can regulate the temperature in your home much more efficiently than ever done in the past Wearables such as Fitbits, Jawbones and the like There’s sports clothing companies that come to us that say they think in 10 years they will be more of a software company, with clothing that contains embedded telemetric devices, that communicate not just who they are, and where they are, but what time they get up, when they eat, when they sweat. Sports companies will know almost everything about you, whereas in the past they’ve known almost nothing about you. Another example: contact lenses that regulate blood sugar And another: intelligent machines. Let’s drill in on that for a bit… <CLICK>
  4. Many industries are facing massive change. The thing that is driving the change is software and new applications – mobile and web applications – that create new possibilities. These are just a few examples: Nest is a software-defined thermostat. Thermostat’s entire job is measure temperature in a range and send a current to turn on and off the furnace / ac when the temperature is out of range. But, Nest built a thermostat with a web application to control it from anywhere and intelligence in the thermostat to recognize patterns and even know when you are home, so it can automatically adjust the temperature for you. That innovation is why Google bought Nest for $3.2B in Case in 2014. Tesla is a software-defined car. A mobile app allows you to control the car from anywhere, turning on the AC/heat before you arrive, opening and closing doors. They can also improve the car’s capabilities and efficiency by upgrading the car’s software instead of forcing you to get a new car. Uber allows you to call a towncar from any location. You call the car, the car shows up, you get in, tell them where you are going and get out. Your credit card is automatically hit, then you rate the driver and they rate you. This has turned the taxi industry on it’s ear. The entertainment industry is another big change. In the 80s, we all went to Blockbuster and hoped our new release was available. In the 2000s, they started redbox, which really hurt Blockbuster. Now you simply sit at home and everyone can watch the new release on the same day it comes out, streaming in to the home. Blockbuster is gone. Redboxes are fading. It’s all online streaming to your TV, your phone, your tablet…
  5. What all products have figured out is its about Outcomes. A client won’t install Hadoop just to buy some servers. They are hoping (realistically or not) that they can improve something of their business
  6. Why would Hadoop want to move towards Enterprise Storage Reference Architectures or Why would Databases w/ Enterprise Storage Reference Architectures move towards Hadoop Vendors Adoption for Hadoop are based upon available skills and operational support
  7. For SQL Databases, all of the data growth is mainly unstructured For Hadoop, its hard for companies to get started due to operations and lack of skills with their incumbent talent pool.