SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
pivotal.io
PIVOTAL HANDOUT
Pivotal Big Data Suite
Product Suite
COMPLETE PLATFORM FOR DATA-DRIVEN ENTERPRISES
Many industry stalwarts have found their traditional business models under threat by a
new generation of fast growing competitors that leverage big data and analytics. These
new companies are transforming and redefining markets by creating innovative customer
experiences with intelligent, customer-centered applications.
Powering these applications are significant advances made in data processing and analytics
with technologies such as scale-out processing, machine learning, and in-memory
computation. These advances leverage hardware trends such as cloud computing,
convergence of storage and compute resources, and rapidly increasing RAM per system.
Collectively known as big data and advanced analytics, these technologies are developed
within open source communities.
Pivotal Software is a leading contributor to many big data and analytics open source
software projects, and is dedicated to driving innovation in the open source ecosystem.
To help companies adopt big data and analytics and create data-driven business models,
Pivotal has rolled these open source technologies into a comprehensive platform called
Pivotal Big Data Suite, as depicted in Figure 1. Big Data Suite allows companies to
modernize their data infrastructure, discover more insights with advanced analytics, and
build analytic applications at scale.
KEY ADVANTAGES
•	 Quickly deploy and manage an
analytics-optimized business data lake
based on Hadoop
•	 Discover more insights using advanced
analytics with SQL on Hadoop or an
analytics data warehouse
•	 Innovate at scale with smart,
predictive applications backed by
distributed in-memory data stores
FEATURES OF BIG DATA SUITE
•	 Comprehensive offering covering
data processing & storage, advanced
analytics, in-memory data processing
& messaging
•	 Works with Pivotal Cloud Foundry –
deploy with Ops Manager, consume
as services within Pivotal Cloud
Foundry apps
•	 Compatible with Open Data
Platform (ODP) core based
distributions of Hadoop
•	 Based on open source
•	 Processing core-based subscription
license for 1 to 3 years
•	 Flexible licensing – reallocate licensed
core capacity between components
depending on need
•	 Multiple deployment options:
commodity hardware, appliance,
virtualized, cloud and hybrid cloud
Overview
pivotal.io
PIVOTAL HANDOUT
MODERNIZE DATA INFRASTRUCTURE
Store and Process Any Size and Type of Data
A first step for many companies in becoming a data-driven enterprise is to deploy a
modern data infrastructure for storage and data processing based on Hadoop. Pivotal
Big Data Suite helps companies with this transformation at the data processing layer by
including Spring XD, Pivotal HD, and Cloud Foundry Operations Manager.
In an agile infrastructure, data scientists and architects need a rapid, scalable way to
develop specific data flows for ingestion and processing. Spring XD helps customers
quickly create data pipelines to orchestrate the flow of data from any source, between
processing steps, and into any final repository.
Massive data volumes and enterprise IT transformation will require that future data
storage will be based on HDFS. Pivotal HD is a distribution of Hadoop based on Open
Data Platform (ODP) core that is targeted for analytical use cases. Pivotal HD provides a
scale-out flexible data management framework that can handle any data type. Pivotal HD
can work with any big data ecosystem applications or tools that support ODP-based
Hadoop distributions.
PIVOTAL BIG DATA SUITE
COMPONENTS OF BIG DATA SUITE
•	 Pivotal HD - ODP core-based Hadoop
distribution targeting SQL and
advanced analytics
•	 Pivotal Greenplum Database®
-
Leading analytical massively-parallel
processing data warehouse
•	 Pivotal HAWQ®
- Highly scalable ANSI-
compliant SQL on Hadoop analytic
query engine
•	 Pivotal GemFire®
- High-performing
distributed in-memory NoSQL
database
•	 Spring XD - Distributed data pipeline
data ingestion, stream processing
and orchestration
•	 Redis - Leading scalable key-value
store and data structure server
•	 RabbitMQ™
- Leading scalable open
source reliable message queue for
applications
•	 Pivotal Big Data Suite on Pivotal
Cloud Foundry - Big Data Suite
components exposed as data
services in Pivotal Cloud Foundry
•	 Pivotal Cloud Foundry Ops Manager -
deployment and management of
Cloud Foundry PaaS
Figure 1. Pivotal Big Data Suite is the advanced analytics and in-memory processing stack for
data-driven enterprises.
pivotal.io
PIVOTAL HANDOUT
In-memory computing, where entire data sets reside in memory, are future state of
the art for analytics and processing. Pivotal HD includes the powerful Spark stack for
in-memory distributed data processing.
IT infrastructures are migrating to open cloud platforms. To help customers make this
transition, an instance of Pivotal Cloud Foundry Ops Manager is provided to automate
deployment of Big Data Suite components and help Cloud Foundry applications leverage
Big Data Suite capabilities as services. This delivers a complete agile data stack, in a single
subscription offering.
Modernizing data infrastructure allows customers to implement a business data lake.
Data from any source can be ingested in any format, whether as batch files or at
real-time streaming velocity. Now customers have additional flexibility for performing
large scale ETL such as processing a data stream before storage. It becomes practical to
run SQL queries on very large data sets at interactive speed.
DISCOVER MORE INSIGHTS WITH ADVANCED ANALYTICS
Massively Parallel Processing on Large Data Sets
A key capability of data-driven enterprises is their ability to leverage data science and
advanced analytics. For advanced analytics, Pivotal Big Data Suite includes two massively
scalable SQL engines: HAWQ and Pivotal Greenplum Database. HAWQ is the most
advanced SQL on Hadoop engine in the industry. It provides interactive and complex query
processing on very large data leveraging compute resources directly in Hadoop nodes.
Pivotal Greenplum Database is the leading analytical data warehouse with a shared-nothing
scale-out architecture, fast data loading, and enterprise-grade reliability, administration,
and advanced security capabilities. Both HAWQ and Greenplum Database share the cost-
based Pivotal Query Optimizer technology which dramatically speeds up execution of
complex joins. Both engines provide massively parallel execution of powerful open source
data science libraries such as MADlib.
HAWQ will run in any ODP-based distribution of Hadoop and tightly integrates with
management tools within Hadoop such as Ambari, YARN, and HCatalog. Pivotal Greenplum
database provides import and export integration with most leading Hadoop distributions.
By deploying an advanced analytics platform, customers can apply data science to discover
new insights for solving business problems. Data scientists can run complex queries at
breakthrough speed on petabyte-scale data sets, and access powerful predictive analytics
and machine learning capabilities based on SQL.
PIVOTAL BIG DATA SUITE
pivotal.io
PIVOTAL HANDOUT
BUILD ANALYTIC APPLICATIONS AT SCALE
Scale-Out Apps with Elastic, Distributed In-memory Data Stores
Data-driven enterprises are able to take insights they glean from their data and
operationalize them through massively scaled analytic-driven applications.
Pivotal Big Data Suite provides key building blocks for rapid development and deployment
of high scale data-centric applications. These include Big Data Suite on Pivotal Cloud
Foundry, Pivotal GemFire, Redis, and RabbitMQ.
Big data application development teams can radically speed time to market by leveraging
Pivotal Cloud Foundry as their development and deployment environment. All components
of Big Data Suite can be accessed as services within Pivotal Cloud Foundry, and Big
Data Suite includes an instance of Pivotal Cloud Foundry Ops Manager to automate
this deployment.
Pivotal GemFire is a distributed, in-memory NoSQL database. This enables enterprises
to build scaled-out, highly available transactional systems with sub-second latency
requirements. GemFire-powered applications can process many simultaneous operations
and maintain sub-second response time at linear scale. Examples of such applications
include large scale ticketing or financial trading applications.
The large volumes of historical data typically generated by these kinds of applications
can be archived into traditional RDBMS or pipelined to the analytical components
within Big Data Suite using Spring XD.
Big Data Suite also provides support for Redis and RabbitMQ either as services within
Pivotal Cloud Foundry, or as part of a stand alone application stack.
With Pivotal Big Data Suite, data-driven companies can rapidly turn their insights into
action and deploy high scale analytic applications.. Such applications can support
mobile customer experiences, mass market transactions, and global Internet of
Things networks leading to new revenue opportunities and competitive advantages.
PIVOTAL BIG DATA SUITE
PLEASE KEEP CONFIDENTIAL
pivotal.io
PIVOTAL HANDOUT
SUMMARY
Big Data Suite is Agile, Cloud-Ready, and Open
AGILE: With Pivotal Big Data Suite, companies can become agile with their data
modernizing their data infrastructure, gaining insights with fast advanced analytic queries,
and quickly making it operational with rapid app development. Big Data Suite is a flexible
core-based subscription that can be allocated and reallocated between all components.
CLOUD-READY: Big Data Suite components can be deployed on commodity
hardware, pre-certified appliances, virtualized and private cloud instances, hybrid cloud
configurations and in public clouds. In virtualized and cloud environments, vCPUs count
the same as physical CPU cores for the subscription license.
OPEN: Pivotal Big Data Suite is based on open source software, including the Open Data
Platform core. This enables Big Data Suite to be leveraged with other ODP-compatible
Hadoop distributions, and allows customers to co-innovate on advancing this technology
through software foundations and open source communities.
Companies seeking to transform into data-driven enterprises have all the tools they need
with Pivotal Big Data Suite. Find out more at Pivotal.io/BigData.
Pivotal®
Big Data Suite, Pivotal Cloud Foundry®
, Pivotal Greenplum®
DataBase, Pivotal®
HD, HAWQ®
. Pivotal GemFire®
, Pivotal GemFire®
and Pivotal RabbitMQ®
are trademarks and/or registered trademark of Pivotal Software, Inc. in the United States and other Countries. All
other trademarks used herein are the property of their respective owners. © Copyright 2015 Pivotal Software, Inc. All rights reserved.
Published in the USA. PVTL-DS-03/15
Pivotal offers a modern approach to technology that organizations need to thrive in a new era of business innovation. Our
solutions intersect cloud, big data and agile development, creating a framework that increases data leverage, accelerates
application delivery, and decreases costs, while providing enterprises the speed and scale they need to compete.
Pivotal 3495 Deer Creek Road Palo Alto, CA 94304 pivotal.io

Contenu connexe

Tendances

A-B-C Strategies for File and Content Brochure
A-B-C Strategies for File and Content BrochureA-B-C Strategies for File and Content Brochure
A-B-C Strategies for File and Content BrochureHitachi Vantara
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameCloudera, Inc.
 
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...Denodo
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseHortonworks
 
Oil & Gas Big Data use cases
Oil & Gas Big Data use casesOil & Gas Big Data use cases
Oil & Gas Big Data use caseselephantscale
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationDenodo
 
Consumption based analytics enabled by Data Virtualization
Consumption based analytics enabled by Data VirtualizationConsumption based analytics enabled by Data Virtualization
Consumption based analytics enabled by Data VirtualizationDenodo
 
Hitachi Unified Storage 100 Family: Unify Without Compromise -- Datasheet
Hitachi Unified Storage 100 Family: Unify Without Compromise -- DatasheetHitachi Unified Storage 100 Family: Unify Without Compromise -- Datasheet
Hitachi Unified Storage 100 Family: Unify Without Compromise -- DatasheetHitachi Vantara
 
Data warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaData warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaJyrki Määttä
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Denodo
 
Hadoop and Your Data Warehouse
Hadoop and Your Data WarehouseHadoop and Your Data Warehouse
Hadoop and Your Data WarehouseCaserta
 
Hitachi Cloud Solutions Profile
Hitachi Cloud Solutions Profile Hitachi Cloud Solutions Profile
Hitachi Cloud Solutions Profile Hitachi Vantara
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Preparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights sharedPreparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights sharedThe Economist Media Businesses
 
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015DataWorks Summit
 
Pervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricityPervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricityCloudera, Inc.
 
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Denodo
 
Data Integration with MapR | Diyotta India
Data Integration with MapR | Diyotta IndiaData Integration with MapR | Diyotta India
Data Integration with MapR | Diyotta Indiadiyotta
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIDenodo
 

Tendances (20)

Capgemini Insights and Data
Capgemini Insights and Data Capgemini Insights and Data
Capgemini Insights and Data
 
A-B-C Strategies for File and Content Brochure
A-B-C Strategies for File and Content BrochureA-B-C Strategies for File and Content Brochure
A-B-C Strategies for File and Content Brochure
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the Same
 
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Oil & Gas Big Data use cases
Oil & Gas Big Data use casesOil & Gas Big Data use cases
Oil & Gas Big Data use cases
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 
Consumption based analytics enabled by Data Virtualization
Consumption based analytics enabled by Data VirtualizationConsumption based analytics enabled by Data Virtualization
Consumption based analytics enabled by Data Virtualization
 
Hitachi Unified Storage 100 Family: Unify Without Compromise -- Datasheet
Hitachi Unified Storage 100 Family: Unify Without Compromise -- DatasheetHitachi Unified Storage 100 Family: Unify Without Compromise -- Datasheet
Hitachi Unified Storage 100 Family: Unify Without Compromise -- Datasheet
 
Data warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaData warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-cloudera
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)
 
Hadoop and Your Data Warehouse
Hadoop and Your Data WarehouseHadoop and Your Data Warehouse
Hadoop and Your Data Warehouse
 
Hitachi Cloud Solutions Profile
Hitachi Cloud Solutions Profile Hitachi Cloud Solutions Profile
Hitachi Cloud Solutions Profile
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Preparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights sharedPreparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights shared
 
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015
 
Pervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricityPervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricity
 
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
 
Data Integration with MapR | Diyotta India
Data Integration with MapR | Diyotta IndiaData Integration with MapR | Diyotta India
Data Integration with MapR | Diyotta India
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AI
 

Similaire à ds_Pivotal_Big_Data_Suite_Product_Suite

Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache SoftwareBob Marcus
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overviewvhrocca
 
Traditional data word
Traditional data wordTraditional data word
Traditional data wordorcoxsm
 
Open Source DWBI-A Primer
Open Source DWBI-A PrimerOpen Source DWBI-A Primer
Open Source DWBI-A Primerpartha69
 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopAppfluent Technology
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsJane Roberts
 
Best Bigquery ETL Tool
Best Bigquery ETL ToolBest Bigquery ETL Tool
Best Bigquery ETL ToolLyftron Data
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceIBM Cloud Data Services
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperImpetus Technologies
 
EMC Pivotal overview deck
EMC Pivotal overview deckEMC Pivotal overview deck
EMC Pivotal overview deckmister_moun
 
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...Alluxio, Inc.
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Denodo
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization Denodo
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItDenodo
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunitiesBigdata Meetup Kochi
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionAppfluent Technology
 

Similaire à ds_Pivotal_Big_Data_Suite_Product_Suite (20)

Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overview
 
Traditional data word
Traditional data wordTraditional data word
Traditional data word
 
Open Source DWBI-A Primer
Open Source DWBI-A PrimerOpen Source DWBI-A Primer
Open Source DWBI-A Primer
 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using Hadoop
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
 
Best Bigquery ETL Tool
Best Bigquery ETL ToolBest Bigquery ETL Tool
Best Bigquery ETL Tool
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a Service
 
Hadoop in the Cloud
Hadoop in the CloudHadoop in the Cloud
Hadoop in the Cloud
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White Paper
 
EMC Pivotal overview deck
EMC Pivotal overview deckEMC Pivotal overview deck
EMC Pivotal overview deck
 
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 
4AA6-4492ENW
4AA6-4492ENW4AA6-4492ENW
4AA6-4492ENW
 

ds_Pivotal_Big_Data_Suite_Product_Suite

  • 1. pivotal.io PIVOTAL HANDOUT Pivotal Big Data Suite Product Suite COMPLETE PLATFORM FOR DATA-DRIVEN ENTERPRISES Many industry stalwarts have found their traditional business models under threat by a new generation of fast growing competitors that leverage big data and analytics. These new companies are transforming and redefining markets by creating innovative customer experiences with intelligent, customer-centered applications. Powering these applications are significant advances made in data processing and analytics with technologies such as scale-out processing, machine learning, and in-memory computation. These advances leverage hardware trends such as cloud computing, convergence of storage and compute resources, and rapidly increasing RAM per system. Collectively known as big data and advanced analytics, these technologies are developed within open source communities. Pivotal Software is a leading contributor to many big data and analytics open source software projects, and is dedicated to driving innovation in the open source ecosystem. To help companies adopt big data and analytics and create data-driven business models, Pivotal has rolled these open source technologies into a comprehensive platform called Pivotal Big Data Suite, as depicted in Figure 1. Big Data Suite allows companies to modernize their data infrastructure, discover more insights with advanced analytics, and build analytic applications at scale. KEY ADVANTAGES • Quickly deploy and manage an analytics-optimized business data lake based on Hadoop • Discover more insights using advanced analytics with SQL on Hadoop or an analytics data warehouse • Innovate at scale with smart, predictive applications backed by distributed in-memory data stores FEATURES OF BIG DATA SUITE • Comprehensive offering covering data processing & storage, advanced analytics, in-memory data processing & messaging • Works with Pivotal Cloud Foundry – deploy with Ops Manager, consume as services within Pivotal Cloud Foundry apps • Compatible with Open Data Platform (ODP) core based distributions of Hadoop • Based on open source • Processing core-based subscription license for 1 to 3 years • Flexible licensing – reallocate licensed core capacity between components depending on need • Multiple deployment options: commodity hardware, appliance, virtualized, cloud and hybrid cloud Overview
  • 2. pivotal.io PIVOTAL HANDOUT MODERNIZE DATA INFRASTRUCTURE Store and Process Any Size and Type of Data A first step for many companies in becoming a data-driven enterprise is to deploy a modern data infrastructure for storage and data processing based on Hadoop. Pivotal Big Data Suite helps companies with this transformation at the data processing layer by including Spring XD, Pivotal HD, and Cloud Foundry Operations Manager. In an agile infrastructure, data scientists and architects need a rapid, scalable way to develop specific data flows for ingestion and processing. Spring XD helps customers quickly create data pipelines to orchestrate the flow of data from any source, between processing steps, and into any final repository. Massive data volumes and enterprise IT transformation will require that future data storage will be based on HDFS. Pivotal HD is a distribution of Hadoop based on Open Data Platform (ODP) core that is targeted for analytical use cases. Pivotal HD provides a scale-out flexible data management framework that can handle any data type. Pivotal HD can work with any big data ecosystem applications or tools that support ODP-based Hadoop distributions. PIVOTAL BIG DATA SUITE COMPONENTS OF BIG DATA SUITE • Pivotal HD - ODP core-based Hadoop distribution targeting SQL and advanced analytics • Pivotal Greenplum Database® - Leading analytical massively-parallel processing data warehouse • Pivotal HAWQ® - Highly scalable ANSI- compliant SQL on Hadoop analytic query engine • Pivotal GemFire® - High-performing distributed in-memory NoSQL database • Spring XD - Distributed data pipeline data ingestion, stream processing and orchestration • Redis - Leading scalable key-value store and data structure server • RabbitMQ™ - Leading scalable open source reliable message queue for applications • Pivotal Big Data Suite on Pivotal Cloud Foundry - Big Data Suite components exposed as data services in Pivotal Cloud Foundry • Pivotal Cloud Foundry Ops Manager - deployment and management of Cloud Foundry PaaS Figure 1. Pivotal Big Data Suite is the advanced analytics and in-memory processing stack for data-driven enterprises.
  • 3. pivotal.io PIVOTAL HANDOUT In-memory computing, where entire data sets reside in memory, are future state of the art for analytics and processing. Pivotal HD includes the powerful Spark stack for in-memory distributed data processing. IT infrastructures are migrating to open cloud platforms. To help customers make this transition, an instance of Pivotal Cloud Foundry Ops Manager is provided to automate deployment of Big Data Suite components and help Cloud Foundry applications leverage Big Data Suite capabilities as services. This delivers a complete agile data stack, in a single subscription offering. Modernizing data infrastructure allows customers to implement a business data lake. Data from any source can be ingested in any format, whether as batch files or at real-time streaming velocity. Now customers have additional flexibility for performing large scale ETL such as processing a data stream before storage. It becomes practical to run SQL queries on very large data sets at interactive speed. DISCOVER MORE INSIGHTS WITH ADVANCED ANALYTICS Massively Parallel Processing on Large Data Sets A key capability of data-driven enterprises is their ability to leverage data science and advanced analytics. For advanced analytics, Pivotal Big Data Suite includes two massively scalable SQL engines: HAWQ and Pivotal Greenplum Database. HAWQ is the most advanced SQL on Hadoop engine in the industry. It provides interactive and complex query processing on very large data leveraging compute resources directly in Hadoop nodes. Pivotal Greenplum Database is the leading analytical data warehouse with a shared-nothing scale-out architecture, fast data loading, and enterprise-grade reliability, administration, and advanced security capabilities. Both HAWQ and Greenplum Database share the cost- based Pivotal Query Optimizer technology which dramatically speeds up execution of complex joins. Both engines provide massively parallel execution of powerful open source data science libraries such as MADlib. HAWQ will run in any ODP-based distribution of Hadoop and tightly integrates with management tools within Hadoop such as Ambari, YARN, and HCatalog. Pivotal Greenplum database provides import and export integration with most leading Hadoop distributions. By deploying an advanced analytics platform, customers can apply data science to discover new insights for solving business problems. Data scientists can run complex queries at breakthrough speed on petabyte-scale data sets, and access powerful predictive analytics and machine learning capabilities based on SQL. PIVOTAL BIG DATA SUITE
  • 4. pivotal.io PIVOTAL HANDOUT BUILD ANALYTIC APPLICATIONS AT SCALE Scale-Out Apps with Elastic, Distributed In-memory Data Stores Data-driven enterprises are able to take insights they glean from their data and operationalize them through massively scaled analytic-driven applications. Pivotal Big Data Suite provides key building blocks for rapid development and deployment of high scale data-centric applications. These include Big Data Suite on Pivotal Cloud Foundry, Pivotal GemFire, Redis, and RabbitMQ. Big data application development teams can radically speed time to market by leveraging Pivotal Cloud Foundry as their development and deployment environment. All components of Big Data Suite can be accessed as services within Pivotal Cloud Foundry, and Big Data Suite includes an instance of Pivotal Cloud Foundry Ops Manager to automate this deployment. Pivotal GemFire is a distributed, in-memory NoSQL database. This enables enterprises to build scaled-out, highly available transactional systems with sub-second latency requirements. GemFire-powered applications can process many simultaneous operations and maintain sub-second response time at linear scale. Examples of such applications include large scale ticketing or financial trading applications. The large volumes of historical data typically generated by these kinds of applications can be archived into traditional RDBMS or pipelined to the analytical components within Big Data Suite using Spring XD. Big Data Suite also provides support for Redis and RabbitMQ either as services within Pivotal Cloud Foundry, or as part of a stand alone application stack. With Pivotal Big Data Suite, data-driven companies can rapidly turn their insights into action and deploy high scale analytic applications.. Such applications can support mobile customer experiences, mass market transactions, and global Internet of Things networks leading to new revenue opportunities and competitive advantages. PIVOTAL BIG DATA SUITE
  • 5. PLEASE KEEP CONFIDENTIAL pivotal.io PIVOTAL HANDOUT SUMMARY Big Data Suite is Agile, Cloud-Ready, and Open AGILE: With Pivotal Big Data Suite, companies can become agile with their data modernizing their data infrastructure, gaining insights with fast advanced analytic queries, and quickly making it operational with rapid app development. Big Data Suite is a flexible core-based subscription that can be allocated and reallocated between all components. CLOUD-READY: Big Data Suite components can be deployed on commodity hardware, pre-certified appliances, virtualized and private cloud instances, hybrid cloud configurations and in public clouds. In virtualized and cloud environments, vCPUs count the same as physical CPU cores for the subscription license. OPEN: Pivotal Big Data Suite is based on open source software, including the Open Data Platform core. This enables Big Data Suite to be leveraged with other ODP-compatible Hadoop distributions, and allows customers to co-innovate on advancing this technology through software foundations and open source communities. Companies seeking to transform into data-driven enterprises have all the tools they need with Pivotal Big Data Suite. Find out more at Pivotal.io/BigData. Pivotal® Big Data Suite, Pivotal Cloud Foundry® , Pivotal Greenplum® DataBase, Pivotal® HD, HAWQ® . Pivotal GemFire® , Pivotal GemFire® and Pivotal RabbitMQ® are trademarks and/or registered trademark of Pivotal Software, Inc. in the United States and other Countries. All other trademarks used herein are the property of their respective owners. © Copyright 2015 Pivotal Software, Inc. All rights reserved. Published in the USA. PVTL-DS-03/15 Pivotal offers a modern approach to technology that organizations need to thrive in a new era of business innovation. Our solutions intersect cloud, big data and agile development, creating a framework that increases data leverage, accelerates application delivery, and decreases costs, while providing enterprises the speed and scale they need to compete. Pivotal 3495 Deer Creek Road Palo Alto, CA 94304 pivotal.io