Hadoop Reporting and Analysis - Jaspersoft

Hadoop Reporting & Analysis
What Architecture is Best for Me?

©2013 Jaspersoft Corporation. 2
Jim Walker
Director Product Marketing, Hortonworks
Twenty years experience building products and bringing
them to market. His expertise includes data loss
prevention, master data management and now big data.
Ben Connors
Worldwide Head of Alliances, Jaspersoft
Prior to Jaspersoft, Ben was at HP, Oracle, Viador, and
other BI companies. He has over 20 years of experience
in databases and business intelligence.
Matt Dahlman
Technical Director of Alliances, Jaspersoft
Prior to Jaspersoft, Matt was with Oracle, Netonomy, and
Sybase. He brings over 15 years of database and business
intelligence experience to his role.
Presenters

Agenda
 Hadoop in the Modern Data architecture
 Hadoop Usage Patterns
 Jaspersoft
 Company
 BI Suite
 Jaspersoft/Hortonworks Integration
 Demo
 The Future of Interactive Hadoop
 Q&A
©2013 Jaspersoft Corporation. Proprietary and Confidential 3

© Hortonworks Inc. 2013
A Brief History of Apache Hadoop
Page 4
2013
Focus on INNOVATION
2005: Yahoo! creates
team under E14 to
work on Hadoop
Focus on OPERATIONS
2008: Yahoo team extends focus to
operations to support multiple
projects & growing clusters
Yahoo! begins to
Operate at scale
Enterprise
Hadoop
Apache Project
Established
Hortonworks
Data Platform
2004 2008 2010 20122006
STABILITY
2011: Hortonworks created to focus on
“Enterprise Hadoop“. Starts with 24
key Hadoop engineers from Yahoo

Existing Data Architecture
Page 5
APPLICATIONSDATASYSTEMS
TRADITIONAL REPOS
RDBMS EDW MPP
DATASOURCES
OLTP, PO
S
SYSTEMS
OPERATIONAL
TOOLS
MANAGE &
MONITOR
Traditional Sources
(RDBMS, OLTP, OLAP)
DEV & DATA
TOOLS
BUILD &
TEST
Business
Analytics
Custom
Applications
Enterprise
Applications

An Emerging Data Architecture
Page 6
TRADITIONAL REPOS
RDBMS EDW MPP
DATASOURCES
MOBILE
DATA
OLTP, PO
S
SYSTEMS
OPERATIONAL
TOOLS
MANAGE &
MONITOR
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
(web logs, email, sensor data, social media)
DEV & DATA
TOOLS
BUILD &
TEST
Business
Analytics
Custom
Applications
Enterprise
Applications
HORTONWORKS
DATA PLATFORM

Interoperating With Your Tools
Page 7
TRADITIONAL REPOS
apps
HORTONWORKS
DATA PLATFORM
DATASOURCES
MOBILE
DATA
OLTP, PO
S
SYSTEMS
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
OPERATIONAL
TOOLS
MANAGE &
MONITOR
DEV & DATA
TOOLS
BUILD &
TEST

OS Cloud VM Appliance
HDP: Enterprise Hadoop Distribution
Page 8
PLATFORM SERVICES
HADOOP CORE
DATA
SERVICES
OPERATIONAL
SERVICES
Manage &
Operate at
Scale
Store, Proces
s and Access
Data
Enterprise Readiness:
HA, DR, Snapshots, Security
, …
HORTONWORKS
DATA PLATFORM (HDP)
Distributed
Storage & Processing
Hortonworks
Data Platform (HDP)
Enterprise Hadoop
• The ONLY 100% open source
and complete distribution
• Enterprise grade, proven and
tested at scale
• Ecosystem endorsed to
ensure interoperability
HDFS YARN (in 2.0)
WEBHDFS MAP REDUCE
HCATALOG
HIVEPIG
HBASE
SQOOP
FLUME
OOZIE
AMBARI

Operational Data Refinery
Page 9
DATASYSTEMSDATASOURCES
1
3
1 Capture
Capture all data
Process
Parse, cleanse, apply
structure & transform
Exchange
Push to existing data
warehouse for use with
existing analytic tools
2
3
Refine Explore
Enric
h
2
APPLICATIONS
Collect data and apply
a known algorithm to it
in trusted operational
process
TRADITIONAL REPOS
RDBMS EDW MPP
HORTONWORKS
DATA PLATFORM
Business
Analytics
Custom
Applications
Enterprise
Applications
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources

Application Enrichment
Page 10
Refine Explore Enrich
APPLICATIONS
1 Capture
Capture all data
Process
Exchange
Incorporate data directly
into applications
2
3
Collect data, analyze
and present salient
results for online apps
3
1
2
TRADITIONAL REPOS
RDBMS EDW MPP
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
Custom
Applications
Enterprise
Applications
NOSQL
HORTONWORKS
DATA PLATFORM

Big Data Exploration & Visualization
Page 11
Refine Explore Enrich
APPLICATIONS
1 Capture
Capture all data
Process
Exchange
Explore and visualize
with analytics tools
supporting Hadoop
2
3
Collect data and
perform iterative
investigation for value
3
2
TRADITIONAL REPOS
RDBMS EDW MPP
1
HORTONWORKS
DATA PLATFORM
Business
Analytics
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources

Competing on Time and Information
“The New Factors of Production: Time and Information”
Brian Gentile, Jaspersoft
But business users
don’t have access to
timely, actionable data
Why?
Most don’t spend their
day inside a BI tool
…nor do they want to!

We Need “Intelligence Inside”
We want information to FIND US, not the other way round
“We need Intelligence Inside
the applications and business
processes we use every day.”
 Pipeline dashboard inside SaaS CRM app
 Performance report inside partner portal
 Salary data visualizations inside HR intranet
 Portfolio analytics inside client website
 Tickets crosstab inside custom helpdesk app
 Interactive charts inside native mobile app

Jaspersoft: The Intelligence Inside
Self-Service BI + Embeddable + Affordable
“We empower millions of people every day to make decisions
faster by delivering timely, actionable data to them inside their apps
and business process through an embeddable, cost-effective
reporting and analytics platform.”

Intelligence
Inside
Example Customers
Commercial
Apps
Customer
Portals
Cloud Apps
Internal Apps
Big Data
Analytics
The Intelligence Inside Business

The Intelligence Inside the New IT Stack
 Inaugural BI service:
 On VMware Cloud Foundry
 On Red Hat OpenShift
 Jaspersoft Certified Amazon Redshift and RDS
 To connect directly (no ETL) to non-SQL like MongoDB and HBase
“Our mission is to become the de facto reporting and analytic
service in the New IT Stack, enabling BI Builders to build the
Intelligence Inside internal and commercial apps on the leading
Cloud platforms, powered by the new Big Data stores.”

Broad Recognition, Strong Partnerships
50%+ ACV Growth Every Year
Magic Quadrants
18©2013 Jaspersoft Corporation. Proprietary and Confidential
World’s Most Widely Deployed BI
• Commercial Open Source BI Suite
• Nearly 200 people in US, EMEA, APAC
• 16,000,000 downloads
• 325,000 community members
• 130,000 embedded applications
• 15,000 paying customers
• 1,800 subscription customers
Jaspersoft: High Growth and Momentum

Design Any Report . . .

… Dashboard

… or Analytic View

POJO files
… using Any Data Type
Relational FilesRelational Big Data Files
Redshift
BigQuery

… bringing Intelligence to Any App

… with a World-Class BI Platform
Reporting, Dashboards, Visualization, OLA
P Analysis
Columnar-Based In-Memory Engine
Data Connectivity to Any Data
100%WebStandards:CSS,.JS,.JSP,Java
ExtensiveAPIs:HTTP,SOAP,REST
HTML5 Browser, Native Mobile Apps
Business Metadata Layer
Data
Integration
Data
Virtualization Direct
Hadoop Other DataRDBMS

Approach Data Exploration Operational Reporting Analytics
Use Case For data analysts and data scientists
who want to discover real-time
patterns as they emerge from their
Big Data content
For executives and operational
managers who want summarized,
pre-built daily reports on Big Data
content
For data analysts and operational
managers who want to analyze historical
trends based upon pre-defined
questions in their Big Data content
Latency Low Medium High
Big Data HBase, NoSQL, Analytic DBMS Hive, NoSQL, Analytic DBMS Hadoop, NoSQL, Analytic DBMS
Connectivity Native Native, SQL ETL
Architecture
Three Approaches to Big Data Analysis
BI Platform
In-Memory Engine
Native
BI Platform
Native SQL
BI Platform
OLAP Engine
Data
Mart
ETL
Multi-Dimensional
Analysis
Reports &
Dashboards
Multi-Dimensional
Analysis
©2013 Jaspersoft Corporation. Proprietary and Confidential

Jaspersoft’s Hadoop Difference
 Advanced Hadoop integration
 Only BI provider than can support 3 approaches to Hadoop analytics
 Live Exploration, Batch Analysis, Batch reporting
 Direct, native connectors to Hive and HBase
 Broad partnerships
 Deep knowledge and ecosystem

Jaspersoft 5 Demo
28
“We've taken the
desktop power of data
visualization tools,
built it scale on the
HTML5 web, and
made it embeddable
within any app, device
or portal”
©2013 Jaspersoft Corporation. Proprietary and Confidential

Hortonworks Snapshot
Page 29
• We distribute the only 100%
Open Source Enterprise
Hadoop Distribution:
Hortonworks Data
Platform
• We engineer, test & certify
HDP for enterprise usage
• We employ the core
architects, builders and
operators of Apache Hadoop
• We drive innovation within
Apache Software
Foundation projects
• We are uniquely positioned
to deliver the highest quality
of Hadoop support
• We enable the ecosystem to
work better with Hadoop
Develop Distribute Support
We develop, distribute and support
the ONLY 100% open source
Enterprise Hadoop distribution
Endorsed by Strategic Partners
Headquarters: Palo Alto, CA
Employees: 180+ and growing
Investors: Benchmark, Index, Yahoo

Hortonworks Approach
Identify and introduce enterprise
requirements into the pubic domain
Work with the community to advance and
incubate open source projects
Apply Enterprise Rigor to provide the most
stable and reliable distribution
Community Driven Enterprise Apache Hadoop

The Intelligence Inside
Thank You
www.jaspersoft.com
BigData@jaspersoft.com

Hadoop Reporting and Analysis - Jaspersoft

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (20)

Similaire à Hadoop Reporting and Analysis - Jaspersoft

Similaire à Hadoop Reporting and Analysis - Jaspersoft (20)

Plus de Hortonworks

Plus de Hortonworks (20)

Dernier

Dernier (20)

Hadoop Reporting and Analysis - Jaspersoft

Notes de l'éditeur