SlideShare une entreprise Scribd logo
1  sur  21
1
Intel and Cloudera:
Accelerating Enterprise Big Data Success
Alan Saldich | VP of Marketing | Cloudera
Ron Kasabian | VP of Big Data Solutions | Intel
2
Big Picture: Datacenter Inflection
Cluster to Cloud
ASIC to IA/Fabric3
Big Data4
Physical to Virtual
SW-only to HW-assisted2
2010 2011 2012 2013
Public
Private
2008 2009 2010 2011 2012 2013
Virtualized
Nonvirtualized
RISC to IA
UNIX to Linux
1
Linux/x86 Units
UNIX/RISC units
2000 20132001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012
0
“In 2000 Intel saw Linux coming &
invested in heavily in Red Hat; in 2005
we saw virtualization happening and
invested in VMware; in 2008 we
started investing heavily in hyper-scale
computing.
We think big data & Hadoop
will dwarf all of them.”
Diane Bryant, SVP & GM
Data Center Group, Intel
3
Intel, the Open-Source Software Company
~10k SW Developers, 35+ Sites
3
Commercial
ecosystem
Academic
research
Customer
solutions
#2 Linux
contributor
100010
011011
Open building
blocks
Industry
standards
Tools and
resources
4
Research
Benchmarking
Tuning
Optimization
Product
History of Intel and Apache Hadoop*
2009 2014
Open Cirrus*
HiBench
Release IDH 1.0
(2011)
* Other names and brands may be claimed as the property of others.
Release IDH 2.0
(2012)Telco Smart City
Web
RetailHealthcare
Release IDP 3.1
(2014)
5
Delivering Big Data Solutions
Consumer Behavior Security &
Risk Management
Operational
Efficiency
Location Aware
Ad Placement
Buyer Protection
Program
Personalized
Preventive Care
Claim Fraud
Reduction
Traffic
Optimization
Smart Energy
Grid
6
The Big Data Platform
Analytic Tools and Utilities
Data
Servers Storage Network
* Other names and brands may be claimed as the property of others.
Services
Big Data Platform
Ecosystem of
Verticals
Scalable Data &
Analytics Platform
Composable
Resource
Pools
7
+
8
Accelerating Enterprise
Big Data Success
9
Intel-Cloudera Strategic Alliance
Advance the data management industry by:
1. Combining the strengths’ of IDH and CDH
2. Driving adoption through standardization and open
source innovation
3. Delivering a Moore’s Law multiplier => 10X
improvement in price / performance above and beyond
Moore’s law
Industry leading CDH is superset of CDH and IDH features
10
Cloudera Company Snapshot
©2014 Cloudera, Inc. All rights reserved.
Founded 2008, by former employees of
Employees Today Over 600
World Class Support 24x7 Global Staff
Pro-active & Predictive Support Programs
Mission Critical Thousands of Enterprise Users
Over 350 Paying Subscription Customers
The Largest Ecosystem Over 1,000 Partners
Cloudera University Over 40,000 Trained
Open Source Leaders Cloudera Employees are Leading Developers & Contributors.
Our collaboration with Intel has helped define the market.
11
A Strong Track Record of Innovation
2008
CLOUDERA FOUNDED
BY MIKE OLSON
AMR AWADALLAH &
JEFF HAMMERBACHER
2009
HADOOP CREATOR
DOUG CUTTING
JOINS CLOUDERA
2009
CLOUDERA RELEASES CDH
THE FIRST COMMERCIAL
APACHE HADOOP
DISTRIBUTION
2010
CLOUDERA MANAGER:
FIRST MANAGEMENT
APPLICATION FOR
HADOOP
2011
CLOUDERA
REACHES 100
PRODUCTION
CUSTOMERS
2011
CLOUDERA
UNIVERSITY
EXPANDS TO 140
COUNTRIES
2012
CLOUDERA ENTERPRISE 4
THE STANDARD FOR
HADOOP IN THE
ENTERPRISE
2012
CLOUDERA
CONNECT REACHES
300 PARTNERS
2014
THE ENTERPRISE
DATA HUB
LAUNCHED
2013
CLOUDERA IMPALA
CLOUDERA NAVIGATOR
CLOUDERA SEARCH
2013
TOM REILLY JOINS AS CEO
OVER 800 PARTNERS
IN CLOUDERA CONNECT
2014
SERIES F FUNDING WITH
INTEL AS KEY PARTNER
OVER 900 PARTNERS
IN CLOUDERA CONNECT
2014
CLOUDERA
ENTERPRISE 5
CDH
Cloudera
Manager
CLOUDERA
ENTERPRISE
4
ASK BIGGER
QUESTIONS
ENTERPRISE
DATA HUB
CLOUDERA
ENTERPRISE
5
12
Expanding Big Data Requires A New Approach
1980s
Bring Data to Compute
Now
Bring Compute to Data
Relative size & complexity
Data
Information-centric
businesses use all data:
Multi-structured,
internal & external data
of all types
Compute
Compute
Compute
Process-centric
businesses use:
• Structured data mainly
• Internal data only
• “Important” data only
Compute
Compute
Compute
Data
Data
Data
Data
13
Hadoop Changes the Game:
Storage and Compute on One Platform
The Old Way
Expensive & Unattainable
The Hadoop Way
Affordable & Attainable
14
The Old Way: Bringing Data to Compute
Complex Architecture
• Many special-purpose
systems
• Moving data around
• No complete views
4
Missing Data
• Leaving data behind
• Risk and compliance
• High cost of storage
1
Time to Data
• Up-front modeling
• Transforms slow
• Transforms lose data
2
Cost of Analytics
• Existing systems strained
• No agility
• “BI backlog”
3
SERVERSMARTSEDWS DOCUMENTS STORAGE SEARCH ARCHIVE
ERP, CRM, RDBMS, MACHINES FILES, IMAGES, VIDEOS, LOGS, CLICKSTREAMS EXTERNAL DATA SOURCES
15
SERVERS MARTS EDWS DOCUMENTS STORAGE SEARCH ARCHIVE
ERP, CRM, RDBMS, MACHINES FILES, IMAGES, VIDEOS, LOGS, CLICKSTREAMS EXTERNAL DATA SOURCES
The New Way: Bringing Compute to Data
Diverse Analytic Platform
• Bring applications to data
• Combine different workloads on
common data (i.e. SQL + Search)
• True analytic agility
4
1
2
3 4
Active Compliance Archive
• Full fidelity original data
• Indefinite time, any source
• Lowest cost storage
1
Persistent Staging
• One source of data for all analytics
• Persist state of transformed data
• Significantly faster & cheaper
2
Self-Service Exploratory BI
• Simple search + BI tools
• “Schema on read” agility
• Reduce BI user backlog requests
3
16
Open Source
Scalable
Flexible
Cost-Effective
✔
Managed
Open Architecture
Secure and
Governed
From Hadoop to an Enterprise Data Hub
✔
✔
✔
BATCH
PROCESSING
ANALYTIC
SQL
SEARCH
ENGINE
MACHINE
LEARNING
STREAM
PROCESSING
3RD PARTY
APPS
WORKLOAD MANAGEMENT
STORAGE FOR ANY TYPE OF DATA
UNIFIED, ELASTIC, RESILIENT, SECURE
DATA
MANAGEMENT
SYSTEM
MANAGEMENT
CLOUDERA’S ENTERPRISE DATA HUB
Filesystem Online NoSQL
✔
17
Discover New Use Cases
ON-LINE SERVICES /
SOCIAL MEDIA
People & career
matching
Website
optimization
HEALTH CARE
Patient sensors,
monitoring,
EHRs Quality
of care
FINANCIAL SERVICES
Risk & portfolio
analysis
New products
MEDIA /
ENTERTAINMENT
Viewers /
advertising
effectiveness
CONSUMER
PACKAGED GOODS
Sentiment
analysis of
what’s hot,
customer service
TRAVEL & TRANSPORTATION
Sensor analysis for
optimal traffic flows
Customer
sentiment
RETAIL
Consumer sentiment
Optimized
marketing
LAW ENFORCEMENT
& DEFENSE
Threat analysis,
Social media
monitoring,
Photo analysis
EDUCATION
& RESEARCH
Experiment
sensor
analysis
LIFE SCIENCES
Clinical trials
Genomics
AUTOMOTIVE
Auto sensors
reporting location,
problems
COMMUNICATIONS
Location-
based
advertising
HIGH TECHNOLOGY /
INDUSTRIAL MFG.
Mfg quality
Warranty
analysis
UTILITIES
Smart Meter
analysis for
network
capacity
OIL & GAS
Drilling
exploration
sensor
analysis
18
Converge on One Open Source Platform
©2014 Cloudera, Inc. All rights reserved.
• Most stable, compatible, and mature
Hadoop distribution
• Leading SQL functionality &
performance (Impala)
• Deepest management and governance
capabilities
• 150 Hadoop developers
• More than 80 committers
• The only distribution with performance
and security enhanced from the silicon
up
• Leading security capabilities including
encryption, access control, and auditing
• Long-standing commitment to open
source with 1000 developers working on
Linux, KVM, Xen, Java, OpenStack,
Hadoop
• 50 Hadoop developers and 12
committers
19
Ensuring Cloudera runs best on Intel Architecture
• Encryption (AES-NI)
• Compression (SSE 4.2)
• Math (MKL)
Software & Silicon co-evolve to deliver dramatic gains
1 Push compute-
intensive work down
to the silicon
Increase main
memory utilization up
to 20X
Design for rack-
scale architecture
200:1
10:1
Improve Disk:Memory
2 3
20
Faster Insights, Better Security, & Less Complexity
•Maintain an open horizontal platform for big data
•Continue to enhance Apache Hadoop and related projects
Accelerate innovation via open source software
•Optimize performance across compute, storage, & network
•Ensure platform security, enhanced by hardware
Enable Hadoop to run best on IA
•Establish usage models and industry standard benchmarks
•Develop reference architectures and industry-wide solutions
Foster evolution of big data ecosystem
21
Thank You!
Alan Saldich | Cloudera
Ron Kasabian | Intel

Contenu connexe

Tendances

Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...
Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...
Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...
Cloudera, Inc.
 
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage HadoopActian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
DataWorks Summit
 
Configuring a Secure, Multitenant Cluster for the Enterprise
Configuring a Secure, Multitenant Cluster for the EnterpriseConfiguring a Secure, Multitenant Cluster for the Enterprise
Configuring a Secure, Multitenant Cluster for the Enterprise
Cloudera, Inc.
 

Tendances (20)

Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 
快速数据快速分析引擎-Kudu
快速数据快速分析引擎-Kudu快速数据快速分析引擎-Kudu
快速数据快速分析引擎-Kudu
 
Where to Deploy Hadoop: Bare Metal or Cloud?
Where to Deploy Hadoop: Bare Metal or Cloud? Where to Deploy Hadoop: Bare Metal or Cloud?
Where to Deploy Hadoop: Bare Metal or Cloud?
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWS
 
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudYARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
 
Getting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionGetting Apache Spark Customers to Production
Getting Apache Spark Customers to Production
 
Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...
Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...
Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...
 
Solr consistency and recovery internals
Solr consistency and recovery internalsSolr consistency and recovery internals
Solr consistency and recovery internals
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
 
Apache Hadoop 3
Apache Hadoop 3Apache Hadoop 3
Apache Hadoop 3
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the Enterprise
 
What the Enterprise Requires - Business Continuity and Visibility
What the Enterprise Requires - Business Continuity and VisibilityWhat the Enterprise Requires - Business Continuity and Visibility
What the Enterprise Requires - Business Continuity and Visibility
 
Hybrid is the New Normal
Hybrid is the New NormalHybrid is the New Normal
Hybrid is the New Normal
 
大数据数据治理及数据安全
大数据数据治理及数据安全大数据数据治理及数据安全
大数据数据治理及数据安全
 
Cloudbreak - Technical Deep Dive
Cloudbreak - Technical Deep DiveCloudbreak - Technical Deep Dive
Cloudbreak - Technical Deep Dive
 
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage HadoopActian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
 
A deep dive into running data analytic workloads in the cloud
A deep dive into running data analytic workloads in the cloudA deep dive into running data analytic workloads in the cloud
A deep dive into running data analytic workloads in the cloud
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made Easy
 
Configuring a Secure, Multitenant Cluster for the Enterprise
Configuring a Secure, Multitenant Cluster for the EnterpriseConfiguring a Secure, Multitenant Cluster for the Enterprise
Configuring a Secure, Multitenant Cluster for the Enterprise
 

En vedette

En vedette (20)

Women in Big Data | Mike Olson
Women in Big Data | Mike OlsonWomen in Big Data | Mike Olson
Women in Big Data | Mike Olson
 
Alberto Degradi - Big Data: grande sfida e grande opportunità - Digital for B...
Alberto Degradi - Big Data: grande sfida e grande opportunità - Digital for B...Alberto Degradi - Big Data: grande sfida e grande opportunità - Digital for B...
Alberto Degradi - Big Data: grande sfida e grande opportunità - Digital for B...
 
IOT
IOTIOT
IOT
 
Chicago Data Summit: Cloudera's Distribution including Apache Hadoop & Cloude...
Chicago Data Summit: Cloudera's Distribution including Apache Hadoop & Cloude...Chicago Data Summit: Cloudera's Distribution including Apache Hadoop & Cloude...
Chicago Data Summit: Cloudera's Distribution including Apache Hadoop & Cloude...
 
How to tackle big data from a security
How to tackle big data from a securityHow to tackle big data from a security
How to tackle big data from a security
 
Big Data
Big DataBig Data
Big Data
 
7 Characteristics of a Bad (Big) Data Platform
7 Characteristics of a Bad (Big) Data Platform7 Characteristics of a Bad (Big) Data Platform
7 Characteristics of a Bad (Big) Data Platform
 
AWS Big Data Platform
AWS Big Data PlatformAWS Big Data Platform
AWS Big Data Platform
 
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
 
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
 
Leveraging SAP, Hadoop, and Big Data to Redefine Business
Leveraging SAP, Hadoop, and Big Data to Redefine BusinessLeveraging SAP, Hadoop, and Big Data to Redefine Business
Leveraging SAP, Hadoop, and Big Data to Redefine Business
 
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
 
Big Data Solutions Executive Overview
Big Data Solutions Executive OverviewBig Data Solutions Executive Overview
Big Data Solutions Executive Overview
 
Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
 
WSO2 Big Data Analytics Platform
WSO2 Big Data Analytics PlatformWSO2 Big Data Analytics Platform
WSO2 Big Data Analytics Platform
 
How to design a linear control system
How to design a linear control systemHow to design a linear control system
How to design a linear control system
 
Hadoop - Introduzione all’architettura ed approcci applicativi
Hadoop - Introduzione all’architettura ed approcci applicativiHadoop - Introduzione all’architettura ed approcci applicativi
Hadoop - Introduzione all’architettura ed approcci applicativi
 
Introduzione ai Big Data e alla scienza dei dati - Big Data
Introduzione ai Big Data e alla scienza dei dati - Big DataIntroduzione ai Big Data e alla scienza dei dati - Big Data
Introduzione ai Big Data e alla scienza dei dati - Big Data
 
FANTIN BIG DATA (1)
FANTIN BIG DATA (1)FANTIN BIG DATA (1)
FANTIN BIG DATA (1)
 

Similaire à Intel and Cloudera: Accelerating Enterprise Big Data Success

Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic
IntelAPAC
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 

Similaire à Intel and Cloudera: Accelerating Enterprise Big Data Success (20)

MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic
 
Hadoop and Manufacturing
Hadoop and ManufacturingHadoop and Manufacturing
Hadoop and Manufacturing
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
SQL + Hadoop: The High Performance Advantage�
SQL + Hadoop:  The High Performance Advantage�SQL + Hadoop:  The High Performance Advantage�
SQL + Hadoop: The High Performance Advantage�
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Forecast deploy3 100_ak2
Forecast deploy3 100_ak2Forecast deploy3 100_ak2
Forecast deploy3 100_ak2
 
Yahoo! Hack Europe
Yahoo! Hack EuropeYahoo! Hack Europe
Yahoo! Hack Europe
 
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus WebinarBuild and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
 
Actian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL EditionActian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL Edition
 
Cloud the current future v6
Cloud   the current future v6Cloud   the current future v6
Cloud the current future v6
 
CoreLogic Innovation Fueled By Cloud Foundry (Cloud Foundry Summit 2014)
CoreLogic Innovation Fueled By Cloud Foundry (Cloud Foundry Summit 2014)CoreLogic Innovation Fueled By Cloud Foundry (Cloud Foundry Summit 2014)
CoreLogic Innovation Fueled By Cloud Foundry (Cloud Foundry Summit 2014)
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solution
 
How to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of ThingsHow to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of Things
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven World
 

Plus de Cloudera, Inc.

Plus de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Intel and Cloudera: Accelerating Enterprise Big Data Success

  • 1. 1 Intel and Cloudera: Accelerating Enterprise Big Data Success Alan Saldich | VP of Marketing | Cloudera Ron Kasabian | VP of Big Data Solutions | Intel
  • 2. 2 Big Picture: Datacenter Inflection Cluster to Cloud ASIC to IA/Fabric3 Big Data4 Physical to Virtual SW-only to HW-assisted2 2010 2011 2012 2013 Public Private 2008 2009 2010 2011 2012 2013 Virtualized Nonvirtualized RISC to IA UNIX to Linux 1 Linux/x86 Units UNIX/RISC units 2000 20132001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 0 “In 2000 Intel saw Linux coming & invested in heavily in Red Hat; in 2005 we saw virtualization happening and invested in VMware; in 2008 we started investing heavily in hyper-scale computing. We think big data & Hadoop will dwarf all of them.” Diane Bryant, SVP & GM Data Center Group, Intel
  • 3. 3 Intel, the Open-Source Software Company ~10k SW Developers, 35+ Sites 3 Commercial ecosystem Academic research Customer solutions #2 Linux contributor 100010 011011 Open building blocks Industry standards Tools and resources
  • 4. 4 Research Benchmarking Tuning Optimization Product History of Intel and Apache Hadoop* 2009 2014 Open Cirrus* HiBench Release IDH 1.0 (2011) * Other names and brands may be claimed as the property of others. Release IDH 2.0 (2012)Telco Smart City Web RetailHealthcare Release IDP 3.1 (2014)
  • 5. 5 Delivering Big Data Solutions Consumer Behavior Security & Risk Management Operational Efficiency Location Aware Ad Placement Buyer Protection Program Personalized Preventive Care Claim Fraud Reduction Traffic Optimization Smart Energy Grid
  • 6. 6 The Big Data Platform Analytic Tools and Utilities Data Servers Storage Network * Other names and brands may be claimed as the property of others. Services Big Data Platform Ecosystem of Verticals Scalable Data & Analytics Platform Composable Resource Pools
  • 7. 7 +
  • 9. 9 Intel-Cloudera Strategic Alliance Advance the data management industry by: 1. Combining the strengths’ of IDH and CDH 2. Driving adoption through standardization and open source innovation 3. Delivering a Moore’s Law multiplier => 10X improvement in price / performance above and beyond Moore’s law Industry leading CDH is superset of CDH and IDH features
  • 10. 10 Cloudera Company Snapshot ©2014 Cloudera, Inc. All rights reserved. Founded 2008, by former employees of Employees Today Over 600 World Class Support 24x7 Global Staff Pro-active & Predictive Support Programs Mission Critical Thousands of Enterprise Users Over 350 Paying Subscription Customers The Largest Ecosystem Over 1,000 Partners Cloudera University Over 40,000 Trained Open Source Leaders Cloudera Employees are Leading Developers & Contributors. Our collaboration with Intel has helped define the market.
  • 11. 11 A Strong Track Record of Innovation 2008 CLOUDERA FOUNDED BY MIKE OLSON AMR AWADALLAH & JEFF HAMMERBACHER 2009 HADOOP CREATOR DOUG CUTTING JOINS CLOUDERA 2009 CLOUDERA RELEASES CDH THE FIRST COMMERCIAL APACHE HADOOP DISTRIBUTION 2010 CLOUDERA MANAGER: FIRST MANAGEMENT APPLICATION FOR HADOOP 2011 CLOUDERA REACHES 100 PRODUCTION CUSTOMERS 2011 CLOUDERA UNIVERSITY EXPANDS TO 140 COUNTRIES 2012 CLOUDERA ENTERPRISE 4 THE STANDARD FOR HADOOP IN THE ENTERPRISE 2012 CLOUDERA CONNECT REACHES 300 PARTNERS 2014 THE ENTERPRISE DATA HUB LAUNCHED 2013 CLOUDERA IMPALA CLOUDERA NAVIGATOR CLOUDERA SEARCH 2013 TOM REILLY JOINS AS CEO OVER 800 PARTNERS IN CLOUDERA CONNECT 2014 SERIES F FUNDING WITH INTEL AS KEY PARTNER OVER 900 PARTNERS IN CLOUDERA CONNECT 2014 CLOUDERA ENTERPRISE 5 CDH Cloudera Manager CLOUDERA ENTERPRISE 4 ASK BIGGER QUESTIONS ENTERPRISE DATA HUB CLOUDERA ENTERPRISE 5
  • 12. 12 Expanding Big Data Requires A New Approach 1980s Bring Data to Compute Now Bring Compute to Data Relative size & complexity Data Information-centric businesses use all data: Multi-structured, internal & external data of all types Compute Compute Compute Process-centric businesses use: • Structured data mainly • Internal data only • “Important” data only Compute Compute Compute Data Data Data Data
  • 13. 13 Hadoop Changes the Game: Storage and Compute on One Platform The Old Way Expensive & Unattainable The Hadoop Way Affordable & Attainable
  • 14. 14 The Old Way: Bringing Data to Compute Complex Architecture • Many special-purpose systems • Moving data around • No complete views 4 Missing Data • Leaving data behind • Risk and compliance • High cost of storage 1 Time to Data • Up-front modeling • Transforms slow • Transforms lose data 2 Cost of Analytics • Existing systems strained • No agility • “BI backlog” 3 SERVERSMARTSEDWS DOCUMENTS STORAGE SEARCH ARCHIVE ERP, CRM, RDBMS, MACHINES FILES, IMAGES, VIDEOS, LOGS, CLICKSTREAMS EXTERNAL DATA SOURCES
  • 15. 15 SERVERS MARTS EDWS DOCUMENTS STORAGE SEARCH ARCHIVE ERP, CRM, RDBMS, MACHINES FILES, IMAGES, VIDEOS, LOGS, CLICKSTREAMS EXTERNAL DATA SOURCES The New Way: Bringing Compute to Data Diverse Analytic Platform • Bring applications to data • Combine different workloads on common data (i.e. SQL + Search) • True analytic agility 4 1 2 3 4 Active Compliance Archive • Full fidelity original data • Indefinite time, any source • Lowest cost storage 1 Persistent Staging • One source of data for all analytics • Persist state of transformed data • Significantly faster & cheaper 2 Self-Service Exploratory BI • Simple search + BI tools • “Schema on read” agility • Reduce BI user backlog requests 3
  • 16. 16 Open Source Scalable Flexible Cost-Effective ✔ Managed Open Architecture Secure and Governed From Hadoop to an Enterprise Data Hub ✔ ✔ ✔ BATCH PROCESSING ANALYTIC SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING 3RD PARTY APPS WORKLOAD MANAGEMENT STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE DATA MANAGEMENT SYSTEM MANAGEMENT CLOUDERA’S ENTERPRISE DATA HUB Filesystem Online NoSQL ✔
  • 17. 17 Discover New Use Cases ON-LINE SERVICES / SOCIAL MEDIA People & career matching Website optimization HEALTH CARE Patient sensors, monitoring, EHRs Quality of care FINANCIAL SERVICES Risk & portfolio analysis New products MEDIA / ENTERTAINMENT Viewers / advertising effectiveness CONSUMER PACKAGED GOODS Sentiment analysis of what’s hot, customer service TRAVEL & TRANSPORTATION Sensor analysis for optimal traffic flows Customer sentiment RETAIL Consumer sentiment Optimized marketing LAW ENFORCEMENT & DEFENSE Threat analysis, Social media monitoring, Photo analysis EDUCATION & RESEARCH Experiment sensor analysis LIFE SCIENCES Clinical trials Genomics AUTOMOTIVE Auto sensors reporting location, problems COMMUNICATIONS Location- based advertising HIGH TECHNOLOGY / INDUSTRIAL MFG. Mfg quality Warranty analysis UTILITIES Smart Meter analysis for network capacity OIL & GAS Drilling exploration sensor analysis
  • 18. 18 Converge on One Open Source Platform ©2014 Cloudera, Inc. All rights reserved. • Most stable, compatible, and mature Hadoop distribution • Leading SQL functionality & performance (Impala) • Deepest management and governance capabilities • 150 Hadoop developers • More than 80 committers • The only distribution with performance and security enhanced from the silicon up • Leading security capabilities including encryption, access control, and auditing • Long-standing commitment to open source with 1000 developers working on Linux, KVM, Xen, Java, OpenStack, Hadoop • 50 Hadoop developers and 12 committers
  • 19. 19 Ensuring Cloudera runs best on Intel Architecture • Encryption (AES-NI) • Compression (SSE 4.2) • Math (MKL) Software & Silicon co-evolve to deliver dramatic gains 1 Push compute- intensive work down to the silicon Increase main memory utilization up to 20X Design for rack- scale architecture 200:1 10:1 Improve Disk:Memory 2 3
  • 20. 20 Faster Insights, Better Security, & Less Complexity •Maintain an open horizontal platform for big data •Continue to enhance Apache Hadoop and related projects Accelerate innovation via open source software •Optimize performance across compute, storage, & network •Ensure platform security, enhanced by hardware Enable Hadoop to run best on IA •Establish usage models and industry standard benchmarks •Develop reference architectures and industry-wide solutions Foster evolution of big data ecosystem
  • 21. 21 Thank You! Alan Saldich | Cloudera Ron Kasabian | Intel