SlideShare une entreprise Scribd logo
1  sur  25
Télécharger pour lire hors ligne
Verizon & Big Data:
Getting More from CDR Data
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
The program will begin shortly.
Please listen to the webinar with your computer speakers turned on.
John Michael Brack, Manager - Northeast Region, Pentaho
Bo Borland, Head of System Engineers, Pentaho
Agenda
•  Project Profile
•  Pentaho Overview
•  Big Wireless Analytics
•  Call Detail Records (CDR) dashboard and analysis
•  Retail Sales Reporting, IR, Mobile and Analysis
•  DW Optimization
•  Rescuing CDR data from tape archive with Hadoop
•  Pentaho for Hadoop (ingestion, map reduce, orchestration, analysis)
2© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Verizon Project Profile
Audience:
Business, Executives, Management, IT, Architects
Company:
Verizon Communications Inc. was founded in 1983 and is based in New York, New York and
has over 180,000 associates and 115 B in revenue. Verizon Communications Inc., through its
subsidiaries, provides communications, information and entertainment products and services to
consumers, businesses, and governmental agencies worldwide. Its Verizon Wireless segment
offers access to various wireless voice and data services comprising Internet access through
smart phones and basic phones, and notebook computers and tablets; messaging services,
which enable customers to send and receive text, picture, and video messages; and consumer-
focused and business-focused multimedia applications. .
Goals:
•  Self Service Business Analytics
•  Better TCO
•  Data Integration
•  Self Service Powerful Reporting
•  Usability and connectivity
•  Self Service Dashboards
•  Time to Market
•  Access to Big Data and Analytical DB
3© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
A modern, unified embeddable platform built for the future of analytics,
including big data and cloud-ready analytics
4© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
CENTRAL ADMINISTRATION, AUDITING & MONITORING
DELIVER
When & Where
Users Need It
STREAMLINE
Information Delivery
VISUALIZE
& Report Information
In Any Style
ACCESS
All Enterprise
Data Sources
ISV & Packaged
Applications
SaaS / Cloud
Applications
EMBEDDED
Web
Mobile
Print
E-Mail
STANDALONE
‣  Advanced &
Predictive Analytics
DATA MINING
‣  Interactive
‣  Operational
‣  Enterprise
REPORTING
‣  Ad hoc Exploration
‣  Multi-Dimensional
ANALYSIS
‣  Interactive Metrics
‣  Rich Visualizations
DASHBOARDS
ERP / CRM /
Enterprise Apps
(e.g. SAP, Oracle)
Hadoop &
NoSQL Data
Unstructured &
semi-structured
(XML, Excel, Files, etc.)
Relational
Data Sources
Cloud
(e.g. Salesforce,
Amazon, Dell)
‣  Direct
Access
‣  Data Integration
‣  Hadoop
Clustering
‣  Graphical
ETL Designer
‣  Enterprise
Scalability
INTEGRATE,
CLEANSE,
& ENRICH DATA
‣  In Memory
Caching
‣  High
Performance
‣  Relational
OLAP Cubes
METADATA
LAYER
Over 1500 Customers Across All Industries
5© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Skype – Sales & financial reporting with plans to analyze
user stats to improve performance monitoring.
Comcast – Data integration for “single version of the truth”
and new BI initiative to enable self service reporting/analytics
for business analysts.
Nuance – Use Pentaho to join and transform data from mobile
device usage logs to perform complex data analysis.
Bell Canada – Optimize project and consulting resources by
being able to view all project activity across multiple
teams (and acquisitions –all different data sources).
Pentaho for Hadoop
Large Financial Institution
6© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
6
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Why Pentaho
•  Ability to load disparate data sources into
HDFS and Hbase
•  Ability to load post-processed data into
DB2
•  Ability to interface with caching and
message queue technologies via customer-
specific Java libraries
Business Challenge
To gain competitive advantage through intraday balance reporting for commercial customers.
Pentaho Benefits
•  Lowers technical barriers by providing an easy to use ETL
environment for designing MapReduce jobs without having
to write code
•  Provides a graphical orchestration environment for
Hadoop, HBase and DB2 data integration workloads
•  Processes Client, Account, Reference, Transaction and
Balance information at the lowest level of granularity
possible
Forrester Enterprise Hadoop Solutions Wave
Highest-Scored Analytics Vendor
7© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Pentaho BIG Data & Data Integration
Walkthrough
Bo Borland
Head of System Engineers
8© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Pentaho Platform Design Drivers
1.  Big data is changing the world
2.  Open systems are more innovative
3.  Subscriptions models reduce cost and risk
4.  Simplicity empowers the masses
5.  Pluggable java architectures enables flexibility and
competitive advantage
6.  Enterprise-wide integration reduce cost and complexity
7.  Predictive technologies are next big thing in analytics
9
Big Wireless
10© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
3
Calling
Plans
•  Nationwide
•  PAYG
•  Prepaid 50
2
Business
Units
•  B2B
•  B2C
7
Retail
Stores
7
Product
Lines
3
Websites
Big Wireless– Wireless Carrier
•  San Francisco
•  Boston
•  NYC
•  Paris
•  Tokyo
•  Sydney
•  London
•  Smartphones
•  Home Phones
•  Wifi Devices
•  Modems
•  Notebooks
•  Tablets
•  Accessories
•  Ecommerce Site
•  Reseller Portal
•  Manufacturer Portal
Store Managers
Executives & Product
Managers
Operations and Store
Employees
Marketing & Customer
Support
B2B Sales Organization
Databases
Call Detail
Records
Retail Sales
Website
Clickstream
Website
User Registration
2013 Performance Goals
12
Increase subscription revenue
Improve store profitability
Eliminate inventory stock outs
Leverage big data to maximize profits
Profile and target profitable customers
Improve supply chain visibility for partners
2013 Performance Goals
13
Goals Objectives Enablers
Increase
subscription
revenue
Analyze call data to upsell PAYG
customers to subscriptions
Improve store
profitability
Hold store managers accountable by
pushing store income statements to
email
Eliminate inventory
stock outs
Empower store employees with iPads
and real-time inventory reports
Profile and target
profitable
customers
Profile mobile plan customers with
high average call duration
Leverage big data
to maximize profits
Analyze e-commerce clickstream data
in MongoDB to profile purchasing
users and predict users propensity to
purchase.
Improve supply
chain visibility for
partners
Give phone manufacturers and
resellers web access to secure sales
reports
3
Calling
Plans
•  Nationwide
•  PAYG
•  Prepaid 50
2
Business
Units
•  B2B
•  B2C
7
Retail
Stores
7
Product
Lines
3
Websites
Enterprise-Wide Analytics
10
Resellers
10
Phone
Manf
Red River Mobile
•  San Francisco
•  Boston
•  NYC
•  Paris
•  Tokyo
•  Sydney
•  London
•  Smartphones
•  Home Phones
•  Wifi Devices
•  Modems
•  Notebooks
•  Tablets
•  Accessories
•  Ecommerce Site
•  Reseller Portal
•  Manufacturer Portal
EXTERNAL INTERNAL
IFrame Integration
Custom Widget
Embedding
Big Data
15© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Mobile Network Provider
Call Detail Records (CDR)
•  Mobile networks generate vast amounts of daily call data
•  CDR tracks every voice, SMS, or location service
•  2 years of detailed CDR records in DW
•  Archived to tape after 2 years
Data Sources Data Warehouse Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Tape
Archive
Current Data Warehouse Architecture
Data Sources Data Warehouse Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Tape
Archive
With Current EDW Architecture With Hadoop
EDW stores only 2 years of data à Hadoop active archive for all history
Infrastructure at capacity à Frees EDW capacity for high value data
Expensive to scale à Lowers cost and inexpensive to scale
ETL process complex and slow à Streamlined ingestion of raw data
Only analyze 2 years of data à Analyze 10 years of data
Data Warehouse Optimization
Data Sources Big Data Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Logs
Logs
Other Data
Raw Data
Parsed Data
Analytic Datasets
Master Data
Tape
Archive
ORCHESTRATE
ERP DW
Processing
CRM
Pig, Oozie, Flume, Hive,
Hbase, Sqoop
Raw Data
Parsed Data
Analytic Datasets
Pentaho Analytics for Hadoop
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 19
Master Data
Analysis &
Reporting
A
N
A
L
Y
Z
E
Unstructured
Data
Structured
Data
I
N
G
E
S
T
Ingestion
VISUAL MAP REDUCE
Data Integration Analytics
Raw Data
Ingest Raw and Master Data
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 20
Master Data
Unstructured
Data
Structured
Data
I
N
G
E
S
T
Ingestion
Processing
Raw Data
Parsed Data
Analytic Datasets
Visual Map Reduce
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 21
Master Data
VISUAL MAP REDUCE
1.  Map Reduce Input – calling data
2.  Calculate Month, Day, Day of Week
3.  Extract 3 digit area code
4.  Lookup geo master data in HDFS
5.  Filter for weekend and US only calls
6.  Create “Value” field for Key-Value Pair
7.  Create “Key “ field for Key-Value Pair
8.  Map Reduce Output – Key-Value Pair
Java
Programing
Data Agnostic & Data Orchestration
© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 22
Pentaho Data Integration
Parsed Data
Analytic Datasets
Hadoop Data Analysis
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 23
Analysis
A
N
A
L
Y
Z
E
•  Analyze 10 years of call data by
geography, time zone for US only calls
made on the weekend.
–  Understand annual growth rates
–  Which geographies are driving highest
call volume growth rates?
ORCHESTRATE
Raw Data
Parsed Data
Analytic Datasets
Pentaho Big Data Demonstration
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 24
Master Data
A
N
A
L
Y
Z
E
I
N
G
E
S
T
VISUAL MAP REDUCE
Ingest CDR data
into Hadoop1
Execute Map Reduce
to enrich CDR data2
Create and load a Hive
table with the map
reduce results3
Analyze 10 years
of call data 4
© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
25
Thank You
Join the conversation. You can find us on:
blog.pentaho.com
@Pentaho
Facebook.com/Pentaho
Pentaho Business Analytics

Contenu connexe

Tendances

Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product ManagersPentaho
 
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho
 
Big Data Predictions for 2015
Big Data Predictions for 2015 Big Data Predictions for 2015
Big Data Predictions for 2015 Pentaho
 
30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho Evaluation30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho EvaluationPentaho
 
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...MongoDB
 
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014Pentaho
 
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...MongoDB
 
Up Your Analytics Game with Pentaho and Vertica
Up Your Analytics Game with Pentaho and Vertica Up Your Analytics Game with Pentaho and Vertica
Up Your Analytics Game with Pentaho and Vertica Pentaho
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpen Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpenAnalytics Spain
 
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Pentaho
 
Bay Area Hadoop User Group
Bay Area Hadoop User GroupBay Area Hadoop User Group
Bay Area Hadoop User GroupPentaho
 
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...BICC Thomas More
 
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...MongoDB
 
Pentaho big data camp - 5 min
Pentaho   big data camp - 5 minPentaho   big data camp - 5 min
Pentaho big data camp - 5 minianfyfe
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageWebinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageCloudera, Inc.
 
Pentaho - Jake Cornelius - Hadoop World 2010
Pentaho - Jake Cornelius - Hadoop World 2010Pentaho - Jake Cornelius - Hadoop World 2010
Pentaho - Jake Cornelius - Hadoop World 2010Cloudera, Inc.
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Data Aggregation, Curation and analytics for security and situational awareness
Data Aggregation, Curation and analytics for security and situational awarenessData Aggregation, Curation and analytics for security and situational awareness
Data Aggregation, Curation and analytics for security and situational awarenessDataWorks Summit/Hadoop Summit
 

Tendances (20)

Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product Managers
 
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
 
Big Data Predictions for 2015
Big Data Predictions for 2015 Big Data Predictions for 2015
Big Data Predictions for 2015
 
30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho Evaluation30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho Evaluation
 
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
 
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
 
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
 
Up Your Analytics Game with Pentaho and Vertica
Up Your Analytics Game with Pentaho and Vertica Up Your Analytics Game with Pentaho and Vertica
Up Your Analytics Game with Pentaho and Vertica
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpen Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
 
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
 
Bay Area Hadoop User Group
Bay Area Hadoop User GroupBay Area Hadoop User Group
Bay Area Hadoop User Group
 
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
 
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
 
Pentaho big data camp - 5 min
Pentaho   big data camp - 5 minPentaho   big data camp - 5 min
Pentaho big data camp - 5 min
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageWebinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
 
Pentaho - Jake Cornelius - Hadoop World 2010
Pentaho - Jake Cornelius - Hadoop World 2010Pentaho - Jake Cornelius - Hadoop World 2010
Pentaho - Jake Cornelius - Hadoop World 2010
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Ask bigger questions
Ask bigger questionsAsk bigger questions
Ask bigger questions
 
Data Aggregation, Curation and analytics for security and situational awareness
Data Aggregation, Curation and analytics for security and situational awarenessData Aggregation, Curation and analytics for security and situational awareness
Data Aggregation, Curation and analytics for security and situational awareness
 

Similaire à Exclusive Verizon Employee Webinar: Getting More From Your CDR Data

Pentaho Analytics on MongoDB
Pentaho Analytics on MongoDBPentaho Analytics on MongoDB
Pentaho Analytics on MongoDBMark Kromer
 
IoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJIoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJDaniel Madrigal
 
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks
 
Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台Etu Solution
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Barijaxconf
 
Big data tim
Big data timBig data tim
Big data timT Weir
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsDataWorks Summit
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsjdijcks
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...Denodo
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBigDataExpo
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHortonworks
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachDataWorks Summit
 
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...MongoDB
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with HadoopPrecisely
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunitiesBigdata Meetup Kochi
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBICC Thomas More
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Hortonworks
 

Similaire à Exclusive Verizon Employee Webinar: Getting More From Your CDR Data (20)

Pentaho Analytics on MongoDB
Pentaho Analytics on MongoDBPentaho Analytics on MongoDB
Pentaho Analytics on MongoDB
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
 
IoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJIoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJ
 
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
 
Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Big data tim
Big data timBig data tim
Big data tim
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with Hadoop
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
 
Filling the Data Lake
Filling the Data LakeFilling the Data Lake
Filling the Data Lake
 

Plus de Pentaho

Data Mashups for Analytics
Data Mashups for AnalyticsData Mashups for Analytics
Data Mashups for AnalyticsPentaho
 
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationFilling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationPentaho
 
The Next Big Thing in Big Data
The Next Big Thing in Big DataThe Next Big Thing in Big Data
The Next Big Thing in Big DataPentaho
 
Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity Pentaho
 
Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Pentaho
 
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...Pentaho
 
Pentaho Healthcare Solutions
Pentaho Healthcare SolutionsPentaho Healthcare Solutions
Pentaho Healthcare SolutionsPentaho
 

Plus de Pentaho (7)

Data Mashups for Analytics
Data Mashups for AnalyticsData Mashups for Analytics
Data Mashups for Analytics
 
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationFilling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
 
The Next Big Thing in Big Data
The Next Big Thing in Big DataThe Next Big Thing in Big Data
The Next Big Thing in Big Data
 
Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity
 
Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics
 
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
 
Pentaho Healthcare Solutions
Pentaho Healthcare SolutionsPentaho Healthcare Solutions
Pentaho Healthcare Solutions
 

Dernier

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Dernier (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Exclusive Verizon Employee Webinar: Getting More From Your CDR Data

  • 1. Verizon & Big Data: Getting More from CDR Data © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 The program will begin shortly. Please listen to the webinar with your computer speakers turned on. John Michael Brack, Manager - Northeast Region, Pentaho Bo Borland, Head of System Engineers, Pentaho
  • 2. Agenda •  Project Profile •  Pentaho Overview •  Big Wireless Analytics •  Call Detail Records (CDR) dashboard and analysis •  Retail Sales Reporting, IR, Mobile and Analysis •  DW Optimization •  Rescuing CDR data from tape archive with Hadoop •  Pentaho for Hadoop (ingestion, map reduce, orchestration, analysis) 2© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
  • 3. Verizon Project Profile Audience: Business, Executives, Management, IT, Architects Company: Verizon Communications Inc. was founded in 1983 and is based in New York, New York and has over 180,000 associates and 115 B in revenue. Verizon Communications Inc., through its subsidiaries, provides communications, information and entertainment products and services to consumers, businesses, and governmental agencies worldwide. Its Verizon Wireless segment offers access to various wireless voice and data services comprising Internet access through smart phones and basic phones, and notebook computers and tablets; messaging services, which enable customers to send and receive text, picture, and video messages; and consumer- focused and business-focused multimedia applications. . Goals: •  Self Service Business Analytics •  Better TCO •  Data Integration •  Self Service Powerful Reporting •  Usability and connectivity •  Self Service Dashboards •  Time to Market •  Access to Big Data and Analytical DB 3© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
  • 4. A modern, unified embeddable platform built for the future of analytics, including big data and cloud-ready analytics 4© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 CENTRAL ADMINISTRATION, AUDITING & MONITORING DELIVER When & Where Users Need It STREAMLINE Information Delivery VISUALIZE & Report Information In Any Style ACCESS All Enterprise Data Sources ISV & Packaged Applications SaaS / Cloud Applications EMBEDDED Web Mobile Print E-Mail STANDALONE ‣  Advanced & Predictive Analytics DATA MINING ‣  Interactive ‣  Operational ‣  Enterprise REPORTING ‣  Ad hoc Exploration ‣  Multi-Dimensional ANALYSIS ‣  Interactive Metrics ‣  Rich Visualizations DASHBOARDS ERP / CRM / Enterprise Apps (e.g. SAP, Oracle) Hadoop & NoSQL Data Unstructured & semi-structured (XML, Excel, Files, etc.) Relational Data Sources Cloud (e.g. Salesforce, Amazon, Dell) ‣  Direct Access ‣  Data Integration ‣  Hadoop Clustering ‣  Graphical ETL Designer ‣  Enterprise Scalability INTEGRATE, CLEANSE, & ENRICH DATA ‣  In Memory Caching ‣  High Performance ‣  Relational OLAP Cubes METADATA LAYER
  • 5. Over 1500 Customers Across All Industries 5© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 Skype – Sales & financial reporting with plans to analyze user stats to improve performance monitoring. Comcast – Data integration for “single version of the truth” and new BI initiative to enable self service reporting/analytics for business analysts. Nuance – Use Pentaho to join and transform data from mobile device usage logs to perform complex data analysis. Bell Canada – Optimize project and consulting resources by being able to view all project activity across multiple teams (and acquisitions –all different data sources).
  • 6. Pentaho for Hadoop Large Financial Institution 6© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 6 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 Why Pentaho •  Ability to load disparate data sources into HDFS and Hbase •  Ability to load post-processed data into DB2 •  Ability to interface with caching and message queue technologies via customer- specific Java libraries Business Challenge To gain competitive advantage through intraday balance reporting for commercial customers. Pentaho Benefits •  Lowers technical barriers by providing an easy to use ETL environment for designing MapReduce jobs without having to write code •  Provides a graphical orchestration environment for Hadoop, HBase and DB2 data integration workloads •  Processes Client, Account, Reference, Transaction and Balance information at the lowest level of granularity possible
  • 7. Forrester Enterprise Hadoop Solutions Wave Highest-Scored Analytics Vendor 7© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
  • 8. Pentaho BIG Data & Data Integration Walkthrough Bo Borland Head of System Engineers 8© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
  • 9. Pentaho Platform Design Drivers 1.  Big data is changing the world 2.  Open systems are more innovative 3.  Subscriptions models reduce cost and risk 4.  Simplicity empowers the masses 5.  Pluggable java architectures enables flexibility and competitive advantage 6.  Enterprise-wide integration reduce cost and complexity 7.  Predictive technologies are next big thing in analytics 9
  • 10. Big Wireless 10© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
  • 11. 3 Calling Plans •  Nationwide •  PAYG •  Prepaid 50 2 Business Units •  B2B •  B2C 7 Retail Stores 7 Product Lines 3 Websites Big Wireless– Wireless Carrier •  San Francisco •  Boston •  NYC •  Paris •  Tokyo •  Sydney •  London •  Smartphones •  Home Phones •  Wifi Devices •  Modems •  Notebooks •  Tablets •  Accessories •  Ecommerce Site •  Reseller Portal •  Manufacturer Portal Store Managers Executives & Product Managers Operations and Store Employees Marketing & Customer Support B2B Sales Organization Databases Call Detail Records Retail Sales Website Clickstream Website User Registration
  • 12. 2013 Performance Goals 12 Increase subscription revenue Improve store profitability Eliminate inventory stock outs Leverage big data to maximize profits Profile and target profitable customers Improve supply chain visibility for partners
  • 13. 2013 Performance Goals 13 Goals Objectives Enablers Increase subscription revenue Analyze call data to upsell PAYG customers to subscriptions Improve store profitability Hold store managers accountable by pushing store income statements to email Eliminate inventory stock outs Empower store employees with iPads and real-time inventory reports Profile and target profitable customers Profile mobile plan customers with high average call duration Leverage big data to maximize profits Analyze e-commerce clickstream data in MongoDB to profile purchasing users and predict users propensity to purchase. Improve supply chain visibility for partners Give phone manufacturers and resellers web access to secure sales reports
  • 14. 3 Calling Plans •  Nationwide •  PAYG •  Prepaid 50 2 Business Units •  B2B •  B2C 7 Retail Stores 7 Product Lines 3 Websites Enterprise-Wide Analytics 10 Resellers 10 Phone Manf Red River Mobile •  San Francisco •  Boston •  NYC •  Paris •  Tokyo •  Sydney •  London •  Smartphones •  Home Phones •  Wifi Devices •  Modems •  Notebooks •  Tablets •  Accessories •  Ecommerce Site •  Reseller Portal •  Manufacturer Portal EXTERNAL INTERNAL IFrame Integration Custom Widget Embedding
  • 15. Big Data 15© 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
  • 16. Mobile Network Provider Call Detail Records (CDR) •  Mobile networks generate vast amounts of daily call data •  CDR tracks every voice, SMS, or location service •  2 years of detailed CDR records in DW •  Archived to tape after 2 years Data Sources Data Warehouse Architecture Data Warehouse (Master & Transactional Data) ERP CRM CDR Analytic Data Mart(s) Analytic Data Mart(s) Analytic Data Mart(s) Tape Archive
  • 17. Current Data Warehouse Architecture Data Sources Data Warehouse Architecture Data Warehouse (Master & Transactional Data) ERP CRM CDR Analytic Data Mart(s) Analytic Data Mart(s) Analytic Data Mart(s) Tape Archive With Current EDW Architecture With Hadoop EDW stores only 2 years of data à Hadoop active archive for all history Infrastructure at capacity à Frees EDW capacity for high value data Expensive to scale à Lowers cost and inexpensive to scale ETL process complex and slow à Streamlined ingestion of raw data Only analyze 2 years of data à Analyze 10 years of data
  • 18. Data Warehouse Optimization Data Sources Big Data Architecture Data Warehouse (Master & Transactional Data) ERP CRM CDR Analytic Data Mart(s) Analytic Data Mart(s) Analytic Data Mart(s) Logs Logs Other Data Raw Data Parsed Data Analytic Datasets Master Data Tape Archive
  • 19. ORCHESTRATE ERP DW Processing CRM Pig, Oozie, Flume, Hive, Hbase, Sqoop Raw Data Parsed Data Analytic Datasets Pentaho Analytics for Hadoop © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 19 Master Data Analysis & Reporting A N A L Y Z E Unstructured Data Structured Data I N G E S T Ingestion VISUAL MAP REDUCE Data Integration Analytics
  • 20. Raw Data Ingest Raw and Master Data © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 20 Master Data Unstructured Data Structured Data I N G E S T Ingestion
  • 21. Processing Raw Data Parsed Data Analytic Datasets Visual Map Reduce © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 21 Master Data VISUAL MAP REDUCE 1.  Map Reduce Input – calling data 2.  Calculate Month, Day, Day of Week 3.  Extract 3 digit area code 4.  Lookup geo master data in HDFS 5.  Filter for weekend and US only calls 6.  Create “Value” field for Key-Value Pair 7.  Create “Key “ field for Key-Value Pair 8.  Map Reduce Output – Key-Value Pair Java Programing
  • 22. Data Agnostic & Data Orchestration © 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 22 Pentaho Data Integration
  • 23. Parsed Data Analytic Datasets Hadoop Data Analysis © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 23 Analysis A N A L Y Z E •  Analyze 10 years of call data by geography, time zone for US only calls made on the weekend. –  Understand annual growth rates –  Which geographies are driving highest call volume growth rates?
  • 24. ORCHESTRATE Raw Data Parsed Data Analytic Datasets Pentaho Big Data Demonstration © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 24 Master Data A N A L Y Z E I N G E S T VISUAL MAP REDUCE Ingest CDR data into Hadoop1 Execute Map Reduce to enrich CDR data2 Create and load a Hive table with the map reduce results3 Analyze 10 years of call data 4
  • 25. © 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 25 Thank You Join the conversation. You can find us on: blog.pentaho.com @Pentaho Facebook.com/Pentaho Pentaho Business Analytics