SlideShare a Scribd company logo
1 of 53
Download to read offline
1© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
The New Pivotal Big Data Suite
Jacque Istok
2© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
 Store Everything
 Analyze Anything
 Build the Right Thing
3© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Enables
Hadoop Market Adoption
Data Lakes
Unify Unstructured and
Structured Data Access
Big Data Apps
Build analytic and
transaction-led
applications impacting
top line revenue
Data-Driven
Enterprise
App Dev and Operational
Management on HDFS
Data Architecture
ETL Offload
Accommodate massive
data growth with existing
EDW investments
4© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Full Approach
It’s More Than Just Hadoop
5© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Big Data: Industry Perspective
Retail
• CRM – Customer Scoring
• Store Siting and Layout
• Fraud Detection / Prevention
• Supply Chain Optimization
Advertising & Public Relations
• Demand Signaling
• Ad Targeting
• Sentiment Analysis
• Customer Acquisition
Financial Services
• Algorithmic Trading
• Risk Analysis
• Fraud Detection
• Portfolio Analysis
Media & Telecommunications
• Network Optimization
• Customer Scoring
• Churn Prevention
• Fraud Prevention
Manufacturing
• Product Research
• Engineering Analytics
• Process and Quality Analysis
• Distribution Optimization
Energy
• Smart Grid
• Exploration
Government
• Market Governance
• Counter-Terrorism
• Econometrics
• Health Informatics
Healthcare & Life Sciences
• Pharmaco-Genomics
• Bio-Informatics
• Pharmaceutical Research
• Clinical Outcomes Research
6© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
How Pivotal Accelerates
Value Creation
70% of data
generated by
customers
80% of data
being stored
3% being
prepared for
analysis
0.5% being
analyzed
<0.5% being
operationalized
First
Movers
Smart
Enterprises
~20X
$2.9B
~30X$4
B
~7X
$290B
~20X
$120B
Average Enterprises
SOLVE THE BIG DATA UTILITY GAP
7© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Market Dynamics: Big Data Technologies
Applications
Analytics and
Discovery
Data Organization
and Management
Infrastructure
A new generation
of technologies and
architectures
that enable economical
high-velocity capture,
discovery and analysis
Pivotal
Data Labs
Source: IDC Predictions 2013: Big Data Battle for Dominance in the Intelligent Economy
8© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Journey to Data Driven Enterprise
Archive
•Realize cost
efficiencies and extend
life of existing systems
and
•Data migration
Insights
•Integrate all existing
data to generate
business insights
•Data Analysis
Apps
•Build Apps to
assist/take
(automated) actions
from the insights
generated
•Data Driven Apps
Business
Models
•Create new revenue
streams leveraging
new data and new
insights
•Business
Transformation
Repeatable
Framework
• Platform for
experimenting data
driven business
models and innovation
•Experimentation
Platform
Data Lake
Platform as a Service
Manager IT Leaders Business Leader CEO
STEPSTECHNOLOGYTARGET
9© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Driven: Harder Than it Sounds
Operationalize
Ingest
Distill
Interface
Process
Analytical Transactional
Operationalize
Ingest
Distill
Interface
Process
Analytical Transactional
Operationalize
Ingest
Distill
Interface
Process
Analytical Transactional
Real Time Near Real Time Batch
Predictive Call Routing, Fraud
Prediction, Dynamic Pricing,
Re-Marketing, Stream Analytics
Analytic Model Designs, Transaction
Analysis, Trend Analysis
ETL, Archive, Trending, Monthly and
Weekly Jobs
10© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Driven: Impossible in Silos
Finance Manufacturing Marketing IT
Data Growth Over 60%
Floods These Silos
11© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Generic Business Data Lake Architecture
Ingestion
Tier
Insights
Tier
Unified Operations Tier
System monitoring System management
Unified Data Management Tier
Data mgmt.
services
MDM
RDM
Audit and
policy mgmt.
Processing Tier
Workflow management
Distillation Tier
HDFS storage
Unstructured and structured data
In-memory
MPP database
Real-time
Micro batch
Mega batch
SQL
NoSQL
SQL
MapReduce
Query interfaces
SQL
Sources Action Tier
Real-time
ingestion
Micro batch
ingestion
Batch
ingestion
Real-time
insights
Interactive
insights
Batch
insights
12© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Business Data Lake
Govern where it
matters
 Focus on MDM and RDM
 Enforce only when sharing
 Treat corporate as aggregation of local
Encourage local
requirements
 Let the business decide what they need
 Build from the bottom
 Enable traceability to source
 Disposable data views
Distill on demand
 Select only what you want
 Business friendly tooling
 Re-usable information maps
 Rapid change cycle
Store everything
 Store everything ‘as is’
 Include structured and unstructured data
 Store it cheaply
13© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Business Data Lake Architecture
Ingestion
Tier
Insights
Tier
Unified Operations Tier
Pivotal Command Center
Unified Data Management Tier
Pivotal Data
Dispatch
MDM
RDM
Pivotal Data
Dispatch
Processing Tier
Spring XD, Oozie
Distillation Tier
Pivotal HD
Unstructured and structured data
Pivotal GemFire XD
GPDB / HAWQ
Pivotal
GemFire XD
Spring XD
Spring XD
Pivotal
GemFire XD
Data Loader
Sqoop
Flume
Spring XD
Data Loader
Pivotal
GemFire XD
HAWQ
HBase
HAWQ
MapReduce
Hive
Pig
Query interfaces
HAWQ
Pivotal
GemFire XD
HBase
Sources Action Tier
Clickstream
Sensor Data
Weblogs
Network
Data
CRM Data
ERP Data
Pivotal
GemFire
GPDB/HAWQ
Pivotal
RabbitMQ
Redis
Pivotal CF
14© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Business Data Lake
Govern where it
matters
 Information governance
 MDM & RDM data integrated
 Information RADAR approach to identification
Encourage local
requirements
 HAWQ – Traditional disk-based structured SQL
 Pivotal GemFire XD – Fast in-memory database
 Pivotal GemFire XD – Real-time analytics and integration
Distill on demand
 HAWQ
 Structured SQL on Pivotal HD
 Pivotal Data Dispatch
 Data movement and transformation
Store everything
 Pivotal HD
 Low cost
 Simplified deployment
15© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
How is a Business Data Lake Different?
Business Data LakeCriteria EDW
Common data
model
Base class = standard data
Derived classes = local data
Single class = single view across the
enterprise
Data quality Full spectrum 1 0
0 1 01 0
0 1
0 1
1 1 0
Data integration
Multiple interfaces SQL, SAS, R, MapReduce, NoSQL
SQL access integration with SAS, R
and other analytical interfaces
Mixed workload
with varying QoS
Support low latency, interactive and
batch
Limited QoS separation required
16© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
17© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
18© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
19© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
20© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
21© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
22© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
23© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Components of a Business Data Lake
• Action
– Redis / Pivotal RabbitMQ
– Pivotal GemFire
– Pivotal CF
• Unified Data Management
– Pivotal Data Dispatch
• Unified Operations
– Pivotal Command Center
• Storage
– Structured
– Unstructured
• Ingestion
– Pivotal GemFire XD
– Spring XD
– Pivotal HD
• Distillation
– Pivotal Data Dispatch
– ETL
• Processing
– Pivotal HD
– HAWQ
– Pivotal GemFire XD
29© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
30© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
31© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
32© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
33© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
34© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
35© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Business Data Lake Terminology
• Streaming
• Micro Batch
• Batch
• Mega Batch
• Real Time Response
• Interactive Response
• Near Real-time Response
36© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal HD Architecture
HDFS
HBas
e Pig, Hive,
Mahout
Map
Reduce
Sqoop Flume
Resource
Management
& Workflow
YARN
ZooKeeper
Apache Pivotal
Command
Center
Configure,
Deploy,
Monitor,
Manage
Spring XD
Pivotal HD
Enterprise
Spring
Xtension
Framework
Catalog
Services
Query
Optimizer
Dynamic Pipelining
ANSI SQL + Analytics
HAWQ – Advanced
Database Services
Distributed
In-memory
Store
Query
Transactions
Ingestion
Processing
Hadoop Driver –
Parallel with Compaction
ANSI SQL + In-Memory
Pivotal GemFire XD –
Real-Time Database Services
MADlib Algorithms
Oozie
Virtual
Extensions
GraphLab,
Open MPI
37© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal HD Value
• Cost-based Query Optimizer
• ANSI SQL Compliant
• Linear, incremental scalability on
COTS hardware
• Deep Analytic OLAP Queries
• Petabyte Data Storage &
Management
• Low latency updates and
transactions
• Partitioned Events in situ w/ data
• Active-active deployment across
WAN
OLAP OLTP
SQL
HDFS
38© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Lake Interfaces
Ingestion Streaming Micro batch Batch Mega batch
Data Loader Yes Yes Yes
GemFire XD Yes
PDD
Spring XD Yes Yes Yes Yes
Sqoop Yes Yes
Distcp Yes Yes
Flume Yes Yes Yes
HDFS put Yes Yes
Talend Yes Yes
Informatica Yes Yes
Interface Real time Interactive Batch
GemFire XD (SQL) Yes Yes
HAWQ (SQL) Yes Yes Yes
Hive (HiveQL) Yes
HBase (NoSQL) Yes Yes
MapReduce Yes
Pig Yes
Impala (SQL) Yes Yes
BI Tools GemFire XD HAWQ Hive
MicroStrategy Yes Yes
BusinessObjects Yes Yes
Spotfire Yes Yes
Tableau Yes Yes
Microsoft Excel Yes Yes
Datameer Yes Yes
Karmasphere Yes Yes
Pivotal Data Dispatch
Legend:
Pivotal
Apache
Partner
Competition
Monitoring Data
Management
Configuration
Install
Pivotal
command
center
Pivotal
command
center
Data access
Ingestion Analytics+
Analytics
39© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Ingestion
Event processing
Eventcollection
File
s
Event
s
Event
s
File
s
Streaming
Mega batch
Pivotal GemFire XD
Spring XD
Micro batch
N/A
Data Loader
Spring XD
High
throughput
Low
throughput
Batc
h
Real
time
Pivotal GemFire XD
Data Loader
Spring XD
Out of the box support for HTTP, Tail, Mail, Twitter, Pivotal GemFire, TCP, JMS, Pivotal RabbitMQ,
Time, MQTT, …
Move massive amounts of data at wire speed with throttling capabilities.
SQL Insert data into a Pivotal GemFire XD and API to send data to Pivotal GemFire XD.
Pivotal
GemFire XD
Spring XD
Data Loader
40© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Access
SQL Query for interactive data access. Connectivity with industry standard BI tools.
HiveQL and MapReduce for batch data access. HBase for real-time looking and simple data queries.
SQL queries, NoSQL and alerting APIs for real-time data. Data persisted on HDFS immediately
available for interactive queries.
Pivotal
GemFire XD
HAWQ
Hive HBase
MapReduce
Analytic
s
Looku
p
Batc
h
Real
time
Interactive
Query
HAWQ
Hive
MapReduce
Pivotal
GemFire XD
HBaseMapReduce
Pig
Data distillation
MapReduce
Pig
Use connectors, programs,
models to convert to
structured data
Event access methods
Eventstorage
Unstructure
d
Structured interfaces
Unstructure
d
Structured
SQL
HiveQL
Hbase APIs
MapReduce
Pig
41© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Distillation
SQL Query for interactive data access. Connectivity with industry standard BI tools.
HiveQL and MapReduce for batch data access. HBase for real-time looking and simple data queries.
SQL queries, NoSQL and alerting APIs for real-time data. Data persisted on HDFS immediately
available for interactive queries.
Pivotal
GemFire XD
HAWQ
Hive HBase
MapReduce
Analytic
s
Looku
p
Batc
h
Real
time
Interactive
Query
HAWQ
Hive
MapReduce
Pivotal
GemFire XD
HBaseMapReduce
Pig
Connectors from
Hadoop
Pivotal Greenplum
Database
Pivotal GemFire/SQL Fire
Processing platform
Datastorage
Native
Hadoo
p
Native
HDF
S
HAWQ
Pivotal GemFire XD
PXF connectors
42© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
The Scenario Yesterday…
Application Type
Database
Hadoop Distributed File
System
Parallel Query Engine
In-Memory Data Grid for
Hadoop
In-Memory Data Grid with
SQL Layer
In-Memory Data Grid
Pricing Metric
Pivotal
Component
Data storage:
tiered
terabytes
Nodes
Nodes
TBD
CPUs and Add Ons
with restrictions
CPUs and Add Ons
with restrictions
Other add-on products: Pivotal Data Dispatch, Alpine Chorus
1
3
4
2
5
6
Greenplum DB
Pivotal HD
HAWQ
GemFire XD
SQLFire
GemFire
* GemFire XD will be included upon GA-Est. Q2-2014
43© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
The Scenario Yesterday…
Application Type
Greenplum DBDatabase
Pivotal HDHadoop Distributed File
System
HAWQParallel Query Engine
In-Memory Data Grid for
Hadoop
SQLFireIn-Memory Data Grid with
SQL Layer
GemFireIn-Memory Data Grid
Pricing Metric:
Pivotal
Component
SKU
1
3
4
2
5
6
Unit of
Measure
Price
GemFire XD*
* GemFire XD will be included upon GA. Est Q2-2014
44© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
World’s Leading Experts
Pivotal Labs – Pivotal Data Labs
On Demand Services
Pivotal Data Dispatch
BATCH BATCH
INTERACTIVE INTERACTIVEHAWQGreenplum DB
Unlimited Pivotal HD
REAL-TIME REAL-TIMEGemFire XDGemFire | SQLFire
45© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Customer Centric Model
UNLIMITED PIVOTAL HD INCLUDED
Software
Only
Core
Based
Subscription
Based
Flexible
Licensing
Customer
Incentives
46© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Store
Everything
47© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
How Does it Work in Practice?
• Obsessively collect
data
• Keep it forever
• Put the data in
one place
Store
Everything
48© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Analyze
Anything
49© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
How Does it Work in Practice?
• Cleanse, organize, and
manage your data lake
• Make the right tools
available
• Use the resources wisely
to compute, analyze,
and understand data
• Obsessively collect
data
• Keep it forever
• Put the data in
one place
Analyze
Anything
Store
Everything
50© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Build
the Right
Thing
51© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
How Does it Work in Practice?
• Use insights to
iteratively improve
your product
Build the
Right Thing
• Cleanse, organize, and
manage your data lake
• Make the right tools
available
• Use the resources wisely
to compute, analyze,
and understand data
• Obsessively collect
data
• Keep it forever
• Put the data in
one place
Analyze
Anything
Store
Everything
52© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
 Store Everything
 Analyze Anything
 Build the Right Thing
53© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Measure the Value
http://www.gopivotal.com/big-
data/pivotal-big-data-suite/value-
tool
54© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Compare the Status Quo
http://www.gopivotal.com/big-
data/pivotal-big-data-suite/value-
tool
55© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Forecast the Growth
http://www.gopivotal.com/big-
data/pivotal-big-data-suite/value-
tool
56© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
http://www.gopivotal.com/big-
data/pivotal-big-data-suite/value-
tool
57© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
http://www.gopivotal.com/big-
data/pivotal-big-data-suite/value-
tool
Pivotal Big Data Suite Enables Data-Driven Enterprises

More Related Content

What's hot

Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Precisely
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015DataWorks Summit
 
Breakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with HadoopBreakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with HadoopCloudera, Inc.
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Precisely
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeCloudera, Inc.
 
Global Data Management – a practical framework to rethinking enterprise, oper...
Global Data Management – a practical framework to rethinking enterprise, oper...Global Data Management – a practical framework to rethinking enterprise, oper...
Global Data Management – a practical framework to rethinking enterprise, oper...DataWorks Summit
 
Moving to the Cloud: Modernizing Data Architecture in Healthcare
Moving to the Cloud: Modernizing Data Architecture in HealthcareMoving to the Cloud: Modernizing Data Architecture in Healthcare
Moving to the Cloud: Modernizing Data Architecture in HealthcarePerficient, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataCloudera, Inc.
 
Actian forrester- hortonworks
Actian   forrester- hortonworksActian   forrester- hortonworks
Actian forrester- hortonworksHortonworks
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 
A New Day for Oracle Analytics
A New Day for Oracle AnalyticsA New Day for Oracle Analytics
A New Day for Oracle AnalyticsRich Clayton
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7mmathipra
 
How Data Science is Preventing College Dropouts and Advancing Student Success
How Data Science is Preventing College Dropouts and Advancing Student SuccessHow Data Science is Preventing College Dropouts and Advancing Student Success
How Data Science is Preventing College Dropouts and Advancing Student SuccessVMware Tanzu
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Cloudera, Inc.
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Hortonworks
 
Best Practices for Building a Warehouse Quickly
Best Practices for Building a Warehouse QuicklyBest Practices for Building a Warehouse Quickly
Best Practices for Building a Warehouse QuicklyWhereScape
 

What's hot (20)

Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015
 
Breakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with HadoopBreakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with Hadoop
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
 
Global Data Management – a practical framework to rethinking enterprise, oper...
Global Data Management – a practical framework to rethinking enterprise, oper...Global Data Management – a practical framework to rethinking enterprise, oper...
Global Data Management – a practical framework to rethinking enterprise, oper...
 
Moving to the Cloud: Modernizing Data Architecture in Healthcare
Moving to the Cloud: Modernizing Data Architecture in HealthcareMoving to the Cloud: Modernizing Data Architecture in Healthcare
Moving to the Cloud: Modernizing Data Architecture in Healthcare
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Actian forrester- hortonworks
Actian   forrester- hortonworksActian   forrester- hortonworks
Actian forrester- hortonworks
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
A New Day for Oracle Analytics
A New Day for Oracle AnalyticsA New Day for Oracle Analytics
A New Day for Oracle Analytics
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
 
How Data Science is Preventing College Dropouts and Advancing Student Success
How Data Science is Preventing College Dropouts and Advancing Student SuccessHow Data Science is Preventing College Dropouts and Advancing Student Success
How Data Science is Preventing College Dropouts and Advancing Student Success
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets

 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
 
Best Practices for Building a Warehouse Quickly
Best Practices for Building a Warehouse QuicklyBest Practices for Building a Warehouse Quickly
Best Practices for Building a Warehouse Quickly
 

Similar to Pivotal Big Data Suite Enables Data-Driven Enterprises

Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformEMC
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...Capgemini
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsjdijcks
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...MapR Technologies
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...jdijcks
 
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...SL Corporation
 
EMC Pivotal overview deck
EMC Pivotal overview deckEMC Pivotal overview deck
EMC Pivotal overview deckmister_moun
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Cécile Poyet
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Hortonworks
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Cécile Poyet
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Barijaxconf
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Hortonworks
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Hortonworks
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Oracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaOracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaMarketingArrowECS_CZ
 

Similar to Pivotal Big Data Suite Enables Data-Driven Enterprises (20)

Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
 
EMC Pivotal overview deck
EMC Pivotal overview deckEMC Pivotal overview deck
EMC Pivotal overview deck
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Oracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaOracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management Platforma
 

More from EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

More from EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Recently uploaded

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 

Pivotal Big Data Suite Enables Data-Driven Enterprises

  • 1. 1© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. The New Pivotal Big Data Suite Jacque Istok
  • 2. 2© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.  Store Everything  Analyze Anything  Build the Right Thing
  • 3. 3© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Enables Hadoop Market Adoption Data Lakes Unify Unstructured and Structured Data Access Big Data Apps Build analytic and transaction-led applications impacting top line revenue Data-Driven Enterprise App Dev and Operational Management on HDFS Data Architecture ETL Offload Accommodate massive data growth with existing EDW investments
  • 4. 4© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Full Approach It’s More Than Just Hadoop
  • 5. 5© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Big Data: Industry Perspective Retail • CRM – Customer Scoring • Store Siting and Layout • Fraud Detection / Prevention • Supply Chain Optimization Advertising & Public Relations • Demand Signaling • Ad Targeting • Sentiment Analysis • Customer Acquisition Financial Services • Algorithmic Trading • Risk Analysis • Fraud Detection • Portfolio Analysis Media & Telecommunications • Network Optimization • Customer Scoring • Churn Prevention • Fraud Prevention Manufacturing • Product Research • Engineering Analytics • Process and Quality Analysis • Distribution Optimization Energy • Smart Grid • Exploration Government • Market Governance • Counter-Terrorism • Econometrics • Health Informatics Healthcare & Life Sciences • Pharmaco-Genomics • Bio-Informatics • Pharmaceutical Research • Clinical Outcomes Research
  • 6. 6© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. How Pivotal Accelerates Value Creation 70% of data generated by customers 80% of data being stored 3% being prepared for analysis 0.5% being analyzed <0.5% being operationalized First Movers Smart Enterprises ~20X $2.9B ~30X$4 B ~7X $290B ~20X $120B Average Enterprises SOLVE THE BIG DATA UTILITY GAP
  • 7. 7© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Market Dynamics: Big Data Technologies Applications Analytics and Discovery Data Organization and Management Infrastructure A new generation of technologies and architectures that enable economical high-velocity capture, discovery and analysis Pivotal Data Labs Source: IDC Predictions 2013: Big Data Battle for Dominance in the Intelligent Economy
  • 8. 8© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Journey to Data Driven Enterprise Archive •Realize cost efficiencies and extend life of existing systems and •Data migration Insights •Integrate all existing data to generate business insights •Data Analysis Apps •Build Apps to assist/take (automated) actions from the insights generated •Data Driven Apps Business Models •Create new revenue streams leveraging new data and new insights •Business Transformation Repeatable Framework • Platform for experimenting data driven business models and innovation •Experimentation Platform Data Lake Platform as a Service Manager IT Leaders Business Leader CEO STEPSTECHNOLOGYTARGET
  • 9. 9© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Data Driven: Harder Than it Sounds Operationalize Ingest Distill Interface Process Analytical Transactional Operationalize Ingest Distill Interface Process Analytical Transactional Operationalize Ingest Distill Interface Process Analytical Transactional Real Time Near Real Time Batch Predictive Call Routing, Fraud Prediction, Dynamic Pricing, Re-Marketing, Stream Analytics Analytic Model Designs, Transaction Analysis, Trend Analysis ETL, Archive, Trending, Monthly and Weekly Jobs
  • 10. 10© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Data Driven: Impossible in Silos Finance Manufacturing Marketing IT Data Growth Over 60% Floods These Silos
  • 11. 11© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Generic Business Data Lake Architecture Ingestion Tier Insights Tier Unified Operations Tier System monitoring System management Unified Data Management Tier Data mgmt. services MDM RDM Audit and policy mgmt. Processing Tier Workflow management Distillation Tier HDFS storage Unstructured and structured data In-memory MPP database Real-time Micro batch Mega batch SQL NoSQL SQL MapReduce Query interfaces SQL Sources Action Tier Real-time ingestion Micro batch ingestion Batch ingestion Real-time insights Interactive insights Batch insights
  • 12. 12© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Business Data Lake Govern where it matters  Focus on MDM and RDM  Enforce only when sharing  Treat corporate as aggregation of local Encourage local requirements  Let the business decide what they need  Build from the bottom  Enable traceability to source  Disposable data views Distill on demand  Select only what you want  Business friendly tooling  Re-usable information maps  Rapid change cycle Store everything  Store everything ‘as is’  Include structured and unstructured data  Store it cheaply
  • 13. 13© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Business Data Lake Architecture Ingestion Tier Insights Tier Unified Operations Tier Pivotal Command Center Unified Data Management Tier Pivotal Data Dispatch MDM RDM Pivotal Data Dispatch Processing Tier Spring XD, Oozie Distillation Tier Pivotal HD Unstructured and structured data Pivotal GemFire XD GPDB / HAWQ Pivotal GemFire XD Spring XD Spring XD Pivotal GemFire XD Data Loader Sqoop Flume Spring XD Data Loader Pivotal GemFire XD HAWQ HBase HAWQ MapReduce Hive Pig Query interfaces HAWQ Pivotal GemFire XD HBase Sources Action Tier Clickstream Sensor Data Weblogs Network Data CRM Data ERP Data Pivotal GemFire GPDB/HAWQ Pivotal RabbitMQ Redis Pivotal CF
  • 14. 14© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Business Data Lake Govern where it matters  Information governance  MDM & RDM data integrated  Information RADAR approach to identification Encourage local requirements  HAWQ – Traditional disk-based structured SQL  Pivotal GemFire XD – Fast in-memory database  Pivotal GemFire XD – Real-time analytics and integration Distill on demand  HAWQ  Structured SQL on Pivotal HD  Pivotal Data Dispatch  Data movement and transformation Store everything  Pivotal HD  Low cost  Simplified deployment
  • 15. 15© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. How is a Business Data Lake Different? Business Data LakeCriteria EDW Common data model Base class = standard data Derived classes = local data Single class = single view across the enterprise Data quality Full spectrum 1 0 0 1 01 0 0 1 0 1 1 1 0 Data integration Multiple interfaces SQL, SAS, R, MapReduce, NoSQL SQL access integration with SAS, R and other analytical interfaces Mixed workload with varying QoS Support low latency, interactive and batch Limited QoS separation required
  • 16. 16© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 17. 17© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 18. 18© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 19. 19© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 20. 20© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 21. 21© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 22. 22© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 23. 23© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Components of a Business Data Lake • Action – Redis / Pivotal RabbitMQ – Pivotal GemFire – Pivotal CF • Unified Data Management – Pivotal Data Dispatch • Unified Operations – Pivotal Command Center • Storage – Structured – Unstructured • Ingestion – Pivotal GemFire XD – Spring XD – Pivotal HD • Distillation – Pivotal Data Dispatch – ETL • Processing – Pivotal HD – HAWQ – Pivotal GemFire XD
  • 24. 29© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 25. 30© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 26. 31© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 27. 32© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 28. 33© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 29. 34© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 30. 35© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Business Data Lake Terminology • Streaming • Micro Batch • Batch • Mega Batch • Real Time Response • Interactive Response • Near Real-time Response
  • 31. 36© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal HD Architecture HDFS HBas e Pig, Hive, Mahout Map Reduce Sqoop Flume Resource Management & Workflow YARN ZooKeeper Apache Pivotal Command Center Configure, Deploy, Monitor, Manage Spring XD Pivotal HD Enterprise Spring Xtension Framework Catalog Services Query Optimizer Dynamic Pipelining ANSI SQL + Analytics HAWQ – Advanced Database Services Distributed In-memory Store Query Transactions Ingestion Processing Hadoop Driver – Parallel with Compaction ANSI SQL + In-Memory Pivotal GemFire XD – Real-Time Database Services MADlib Algorithms Oozie Virtual Extensions GraphLab, Open MPI
  • 32. 37© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Pivotal HD Value • Cost-based Query Optimizer • ANSI SQL Compliant • Linear, incremental scalability on COTS hardware • Deep Analytic OLAP Queries • Petabyte Data Storage & Management • Low latency updates and transactions • Partitioned Events in situ w/ data • Active-active deployment across WAN OLAP OLTP SQL HDFS
  • 33. 38© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Data Lake Interfaces Ingestion Streaming Micro batch Batch Mega batch Data Loader Yes Yes Yes GemFire XD Yes PDD Spring XD Yes Yes Yes Yes Sqoop Yes Yes Distcp Yes Yes Flume Yes Yes Yes HDFS put Yes Yes Talend Yes Yes Informatica Yes Yes Interface Real time Interactive Batch GemFire XD (SQL) Yes Yes HAWQ (SQL) Yes Yes Yes Hive (HiveQL) Yes HBase (NoSQL) Yes Yes MapReduce Yes Pig Yes Impala (SQL) Yes Yes BI Tools GemFire XD HAWQ Hive MicroStrategy Yes Yes BusinessObjects Yes Yes Spotfire Yes Yes Tableau Yes Yes Microsoft Excel Yes Yes Datameer Yes Yes Karmasphere Yes Yes Pivotal Data Dispatch Legend: Pivotal Apache Partner Competition Monitoring Data Management Configuration Install Pivotal command center Pivotal command center Data access Ingestion Analytics+ Analytics
  • 34. 39© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Data Ingestion Event processing Eventcollection File s Event s Event s File s Streaming Mega batch Pivotal GemFire XD Spring XD Micro batch N/A Data Loader Spring XD High throughput Low throughput Batc h Real time Pivotal GemFire XD Data Loader Spring XD Out of the box support for HTTP, Tail, Mail, Twitter, Pivotal GemFire, TCP, JMS, Pivotal RabbitMQ, Time, MQTT, … Move massive amounts of data at wire speed with throttling capabilities. SQL Insert data into a Pivotal GemFire XD and API to send data to Pivotal GemFire XD. Pivotal GemFire XD Spring XD Data Loader
  • 35. 40© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Data Access SQL Query for interactive data access. Connectivity with industry standard BI tools. HiveQL and MapReduce for batch data access. HBase for real-time looking and simple data queries. SQL queries, NoSQL and alerting APIs for real-time data. Data persisted on HDFS immediately available for interactive queries. Pivotal GemFire XD HAWQ Hive HBase MapReduce Analytic s Looku p Batc h Real time Interactive Query HAWQ Hive MapReduce Pivotal GemFire XD HBaseMapReduce Pig Data distillation MapReduce Pig Use connectors, programs, models to convert to structured data Event access methods Eventstorage Unstructure d Structured interfaces Unstructure d Structured SQL HiveQL Hbase APIs MapReduce Pig
  • 36. 41© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Data Distillation SQL Query for interactive data access. Connectivity with industry standard BI tools. HiveQL and MapReduce for batch data access. HBase for real-time looking and simple data queries. SQL queries, NoSQL and alerting APIs for real-time data. Data persisted on HDFS immediately available for interactive queries. Pivotal GemFire XD HAWQ Hive HBase MapReduce Analytic s Looku p Batc h Real time Interactive Query HAWQ Hive MapReduce Pivotal GemFire XD HBaseMapReduce Pig Connectors from Hadoop Pivotal Greenplum Database Pivotal GemFire/SQL Fire Processing platform Datastorage Native Hadoo p Native HDF S HAWQ Pivotal GemFire XD PXF connectors
  • 37. 42© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. The Scenario Yesterday… Application Type Database Hadoop Distributed File System Parallel Query Engine In-Memory Data Grid for Hadoop In-Memory Data Grid with SQL Layer In-Memory Data Grid Pricing Metric Pivotal Component Data storage: tiered terabytes Nodes Nodes TBD CPUs and Add Ons with restrictions CPUs and Add Ons with restrictions Other add-on products: Pivotal Data Dispatch, Alpine Chorus 1 3 4 2 5 6 Greenplum DB Pivotal HD HAWQ GemFire XD SQLFire GemFire * GemFire XD will be included upon GA-Est. Q2-2014
  • 38. 43© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. The Scenario Yesterday… Application Type Greenplum DBDatabase Pivotal HDHadoop Distributed File System HAWQParallel Query Engine In-Memory Data Grid for Hadoop SQLFireIn-Memory Data Grid with SQL Layer GemFireIn-Memory Data Grid Pricing Metric: Pivotal Component SKU 1 3 4 2 5 6 Unit of Measure Price GemFire XD* * GemFire XD will be included upon GA. Est Q2-2014
  • 39. 44© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. World’s Leading Experts Pivotal Labs – Pivotal Data Labs On Demand Services Pivotal Data Dispatch BATCH BATCH INTERACTIVE INTERACTIVEHAWQGreenplum DB Unlimited Pivotal HD REAL-TIME REAL-TIMEGemFire XDGemFire | SQLFire
  • 40. 45© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Customer Centric Model UNLIMITED PIVOTAL HD INCLUDED Software Only Core Based Subscription Based Flexible Licensing Customer Incentives
  • 41. 46© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Store Everything
  • 42. 47© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. How Does it Work in Practice? • Obsessively collect data • Keep it forever • Put the data in one place Store Everything
  • 43. 48© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Analyze Anything
  • 44. 49© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. How Does it Work in Practice? • Cleanse, organize, and manage your data lake • Make the right tools available • Use the resources wisely to compute, analyze, and understand data • Obsessively collect data • Keep it forever • Put the data in one place Analyze Anything Store Everything
  • 45. 50© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Build the Right Thing
  • 46. 51© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. How Does it Work in Practice? • Use insights to iteratively improve your product Build the Right Thing • Cleanse, organize, and manage your data lake • Make the right tools available • Use the resources wisely to compute, analyze, and understand data • Obsessively collect data • Keep it forever • Put the data in one place Analyze Anything Store Everything
  • 47. 52© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.  Store Everything  Analyze Anything  Build the Right Thing
  • 48. 53© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Measure the Value http://www.gopivotal.com/big- data/pivotal-big-data-suite/value- tool
  • 49. 54© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Compare the Status Quo http://www.gopivotal.com/big- data/pivotal-big-data-suite/value- tool
  • 50. 55© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. Forecast the Growth http://www.gopivotal.com/big- data/pivotal-big-data-suite/value- tool
  • 51. 56© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. http://www.gopivotal.com/big- data/pivotal-big-data-suite/value- tool
  • 52. 57© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. http://www.gopivotal.com/big- data/pivotal-big-data-suite/value- tool