SlideShare une entreprise Scribd logo
1  sur  36
Enriching Your
Data Warehouse
With Hadoop
Presenting with
Peter Yu, Senior Director
iTech Asia-Pacific & Japan
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.2
Agenda
 Opportunity
 Challenges
 Strategy
 Examples
 Best Practices
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.3
Connecting With Your Customer
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.4
Big Data Improves Operational Performance
Source: Economist Intelligence Unit, .”The Deciding Factor: Big Data and Decision Making“
Big data benefits
seen growing
substantially
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.5
Use
Data
12%
Executives who feel they
understand the impact data
will have on their organizations
Produce
Data
The Problem
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.6
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.7
DON’T BELIEVE
EVERYTHING
YOU READ
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.8
Electricity: AC or DC?
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.9
RDBMSRDBMS
Today
Discovery & Analytics
Business Intelligence
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.10
RDBMSRDBMS
Today
Discovery & Analytics
Business Intelligence
External
ETL Cluster
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.11
RDBMSRDBMS
What about Archived Data?
Discovery & Analytics
Business Intelligence
Archive
External
ETL Cluster
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.12
RDBMSRDBMS
What about New Data?
Discovery & Analytics
Business Intelligence
?
Archive
External
ETL Cluster
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.13
RDBMSRDBMS
Expand Your Data Warehouse
Discovery & Analytics
Business Intelligence
?
External
ETL Cluster
Archive
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.14
RDBMS
Hadoop
RDBMS
Hadoop
Integrate Hadoop and RDBMS
Discovery & Analytics
Business Intelligence
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.15
“[Facebook] started in the Hadoop world. We are now
bringing in relational to enhance that. We're kind of going [in]
the other direction ... We've been there, and [we] realized
that using the wrong technology for certain kinds of
problems can be difficult.”
Ken Rubin
Director of Analytics
Facebook
http://tdwi.org/Articles/2013/05/06/Facebooks-Relational-Platform.aspx?Page=1
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.16
Get Fast Answers
to New Questions
Create a Data
Reservoir
Predict More,
More Accurately
Accelerate
Data-Driven Action
Key Hadoop Use Cases
Complementing An Existing Data Warehouse
ETL
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.17
• Transactional data
• Customer information
• Web log and session data
• Machine/Sensor data
• Historical data
Data Reservoir
Keep All Potentially Valuable Data in One Place
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.18
RDBMSRDBMS
Today
Discovery & Analytics
Business Intelligence
?
Archive
External
ETL Cluster
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.19
RDBMS
Hadoop
RDBMS
Hadoop
Create an Active Archive with Hadoop
Discovery & Analytics
Business Intelligence
Σ
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.20
 80 years of historical data in Hadoop
 Structured and unstructured data includes
customer data, economic trends, telematic
sensors, weather, public data
 Integrated with mainframes and EDWs
 Before Hadoop, could analyze only one
state, took 24 hours
 With Cloudera, can analyze risk across all
50 states, in 16 hours (500x improvement)
 First 3 use cases: Data hub, ETL offload,
advanced analytics
Comprehensive risk analysis
Customer Example: Insurer Real-Time Data Hub
Cloudera
Hadoop
EDW and
Mainframe
Customer
Data
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.21
 Optimize offers
 Increase revenue
 Lower costs (offer bytes?)
 Reduce complexity
 Faster time to value
Maximize offer effectiveness
Customer Example: Travel Industry
Big Data
Appliance
Legacy
Data
Warehouse
Customer
Data
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.22
 Fast access to 85% more data
 Lowered costs
 Simplified architecture
 Faster time to value
Compliance, cost reduction
Customer Example: Regional Bank
Big Data
Appliance
Oracle
Exadata
Mainframe,
RDBMS
Oracle Data Integrator
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.23
• Batch window constraints
• Adding value vs. adding cost
• Analysis vs. Transformation
• Analysis vs. Data movement and replication
• Uncertain value of new data sources
ETL Challenges Today
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.24
Discovery & Analytics
Business Intelligence
RDBMSRDBMS
Typical ETL Today
External
ETL Cluster
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.25
RDBMS
Hadoop
RDBMS
Hadoop
ETL Offload with Hadoop
Discovery & Analytics
Business Intelligence
Σ
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.26
 Exponential growth in data,
generated by new consumer devices
 ETL and storage constraints limited
analytics to 1% sample
 Now combined Oracle Exadata and
Cloudera Hadoop delivers analytics
on 100% of data (half a PB per day!)
 Query times reduced dramatically
(i.e. from 4 days to 53 minutes)
 90% reduction of ETL code base
From 1% sampling to 100% analysis
Customer Example: Communications Services
Archive Storage
Data Warehouse
Complex
Correlation
Alerting
Filter
&
Split
Event
Monitoring
Streaming ETL
Streaming ETL
Teleco
m
Services
Before
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.27
 Exponential growth in data,
generated by new consumer devices
 ETL and storage constraints limited
analytics to 1% sample
 Now combined Oracle Exadata and
Cloudera Hadoop delivers analytics
on 100% of data (half a PB per day!)
 Query times reduced dramatically
(i.e. from 4 days to 53 minutes)
 90% reduction of ETL code base
From 1% sampling to 100% analysis
Customer Example: Communications Services
Archive Storage
Data Warehouse
Complex
Correlation
Alerting
Filter
&
Split
Event
Monitoring
Streaming ETL
Streaming ETL
Teleco
m
Services
Before
Data
Warehouse
Alerting
Filter
&
Split
Event
Monitoring
Hadoop
Archive Storage
ETL
Correlation
Stage 1 DWH
Teleco
m
Services
After
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.28
Identifying Potential Opportunities
Do you have a
problem with ETL
performance?
Do you have
potentially valuable
data that you aren’t
using, but might
deliver new insight?
Should you focus
on analyzing
structured data,
unstructured data,
or a combination?
Are Big Data
solutions already
being built as silos?
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.29
Challenges in the Way
& Oracle Strategies
CHALLENGES ORACLE STRATEGY
• Fragmented Solutions • Specialized but integrated data stores and tools
• Difficulty of Self-Service BI • Flexible, guided, automated BI & data discovery
• Data Not Current • Solutions for Just-in-Time well defined data
• Time to ROI / Development Time • Horizontal & industry pre-built solutions, engineered systems
• Growing Diversity of Data & Users • Enterprise solutions for 1000s diverse users, petabytes data
• Manageability, Security, Cost • Centrally managed with advanced security / governance
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.30
Big Data Readiness
The R&D Prototype Stage
 Skills needed
– Distributed data deployment (e.g. Hadoop)
– Python or Java programming with MapReduce
– Statistical analysis (e.g. R)
– Data integration
– Ability to formulate business hypotheses
– Ability to convey business value of Big Data
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.31
Are You Ready for Big Data?
Contact Your Account Team
 Have you delivered mature analytics solutions for
structured data?
 Can Big Data make a difference to the business?
 Have you built a Big Data prototype, built skills, and
proved value?
 Do you have an enterprise integration & deployment
strategy for Big Data?
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.32 Copyright © 2013, Oracle and/or its affiliates. All rights reserved.32
Continuous
Innovation
Big Data at Work
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.33
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.34
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.35
RDBMS
Hadoop
RDBMS
Hadoop
Master diagram
Discovery & Analytics
Business IntelligenceExternal
ETL Cluster
Data
Mart
Data
Mart
Archive
ΣΣEvent
Processing
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.36
Discovery & Analytics
Business Intelligence
RDBMSRDBMS
Master diagram without Hadoop
External
ETL Cluster
Data
Mart
Data
Mart
Archive
ΣEvent
Processing

Contenu connexe

Tendances

Tendances (20)

MapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
 
Oracle big data discovery 994294
Oracle big data discovery   994294Oracle big data discovery   994294
Oracle big data discovery 994294
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
 
Big Data Solutions Executive Overview
Big Data Solutions Executive OverviewBig Data Solutions Executive Overview
Big Data Solutions Executive Overview
 
Open Source in the Energy Industry - Creating a New Operational Model for Dat...
Open Source in the Energy Industry - Creating a New Operational Model for Dat...Open Source in the Energy Industry - Creating a New Operational Model for Dat...
Open Source in the Energy Industry - Creating a New Operational Model for Dat...
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Emergence of MongoDB as an Enterprise Data Hub
Emergence of MongoDB as an Enterprise Data HubEmergence of MongoDB as an Enterprise Data Hub
Emergence of MongoDB as an Enterprise Data Hub
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
 
Developing a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceDeveloping a Strategy for Data Lake Governance
Developing a Strategy for Data Lake Governance
 
Enterprise Data Hub: The Next Big Thing in Big Data
Enterprise Data Hub: The Next Big Thing in Big DataEnterprise Data Hub: The Next Big Thing in Big Data
Enterprise Data Hub: The Next Big Thing in Big Data
 
Smarter Management for Your Data Growth
Smarter Management for Your Data GrowthSmarter Management for Your Data Growth
Smarter Management for Your Data Growth
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
 
Stream based Data Integration
Stream based Data IntegrationStream based Data Integration
Stream based Data Integration
 
Why Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data ArchitectureWhy Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data Architecture
 
IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?
 

Similaire à 6 enriching your data warehouse with big data and hadoop

A6 harnessing the power of big data and business analytics to transform bus...
A6   harnessing the power of big data and business analytics to transform bus...A6   harnessing the power of big data and business analytics to transform bus...
A6 harnessing the power of big data and business analytics to transform bus...
Dr. Wilfred Lin (Ph.D.)
 
A7 getting value from big data how to get there quickly and leverage your c...
A7   getting value from big data how to get there quickly and leverage your c...A7   getting value from big data how to get there quickly and leverage your c...
A7 getting value from big data how to get there quickly and leverage your c...
Dr. Wilfred Lin (Ph.D.)
 
Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?
Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?
Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?
KPI Partners
 
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v0213 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
Erin Kerrigan
 

Similaire à 6 enriching your data warehouse with big data and hadoop (20)

Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
 
A6 harnessing the power of big data and business analytics to transform bus...
A6   harnessing the power of big data and business analytics to transform bus...A6   harnessing the power of big data and business analytics to transform bus...
A6 harnessing the power of big data and business analytics to transform bus...
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
A7 getting value from big data how to get there quickly and leverage your c...
A7   getting value from big data how to get there quickly and leverage your c...A7   getting value from big data how to get there quickly and leverage your c...
A7 getting value from big data how to get there quickly and leverage your c...
 
Complement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & HadoopComplement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & Hadoop
 
Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?
Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?
Extreme Analytics - What's New With Oracle Exalytics X3-4 & T5-8?
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big dataConociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
 
Oracle Data Science Platform
Oracle Data Science PlatformOracle Data Science Platform
Oracle Data Science Platform
 
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v0213 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
 

Plus de Dr. Wilfred Lin (Ph.D.)

Plus de Dr. Wilfred Lin (Ph.D.) (20)

K2 keynote 2_oracle_saa_s_strategy
K2 keynote 2_oracle_saa_s_strategyK2 keynote 2_oracle_saa_s_strategy
K2 keynote 2_oracle_saa_s_strategy
 
K1 keynote 1_oracle_integrated_cloud_strategy_and_vision_for_journey_to_cloud...
K1 keynote 1_oracle_integrated_cloud_strategy_and_vision_for_journey_to_cloud...K1 keynote 1_oracle_integrated_cloud_strategy_and_vision_for_journey_to_cloud...
K1 keynote 1_oracle_integrated_cloud_strategy_and_vision_for_journey_to_cloud...
 
C7 engineered data_protection_for_oracle_databases
C7 engineered data_protection_for_oracle_databasesC7 engineered data_protection_for_oracle_databases
C7 engineered data_protection_for_oracle_databases
 
C6 oracles storage_strategy_from_databases_to_engineered_systems_to_cloud
C6 oracles storage_strategy_from_databases_to_engineered_systems_to_cloudC6 oracles storage_strategy_from_databases_to_engineered_systems_to_cloud
C6 oracles storage_strategy_from_databases_to_engineered_systems_to_cloud
 
C5 journey to_the_cloud_with_oracle_sparc
C5 journey to_the_cloud_with_oracle_sparcC5 journey to_the_cloud_with_oracle_sparc
C5 journey to_the_cloud_with_oracle_sparc
 
C4 optimizing your_application_infrastructure
C4 optimizing your_application_infrastructureC4 optimizing your_application_infrastructure
C4 optimizing your_application_infrastructure
 
C3 bringing the_power_of_the_public_cloud_to_your_secure_data_center
C3 bringing the_power_of_the_public_cloud_to_your_secure_data_centerC3 bringing the_power_of_the_public_cloud_to_your_secure_data_center
C3 bringing the_power_of_the_public_cloud_to_your_secure_data_center
 
C2 five journeys_to_the_cloud
C2 five journeys_to_the_cloudC2 five journeys_to_the_cloud
C2 five journeys_to_the_cloud
 
C1 keynote creating_your_enterprise_cloud_strategy
C1 keynote creating_your_enterprise_cloud_strategyC1 keynote creating_your_enterprise_cloud_strategy
C1 keynote creating_your_enterprise_cloud_strategy
 
B7 api management_enabling_digital_transformation
B7 api management_enabling_digital_transformationB7 api management_enabling_digital_transformation
B7 api management_enabling_digital_transformation
 
B6 improve operational_efficiency_through_process_and_document_collaboration
B6 improve operational_efficiency_through_process_and_document_collaborationB6 improve operational_efficiency_through_process_and_document_collaboration
B6 improve operational_efficiency_through_process_and_document_collaboration
 
B5 modernise your_cloud_to_on_premises_integration
B5 modernise your_cloud_to_on_premises_integrationB5 modernise your_cloud_to_on_premises_integration
B5 modernise your_cloud_to_on_premises_integration
 
B4 making dev_ops_really_work
B4 making dev_ops_really_workB4 making dev_ops_really_work
B4 making dev_ops_really_work
 
B3 getting started_with_cloud_native_development
B3 getting started_with_cloud_native_developmentB3 getting started_with_cloud_native_development
B3 getting started_with_cloud_native_development
 
B2 oracle mobile_any_app_to_any_service_lets_go
B2 oracle mobile_any_app_to_any_service_lets_goB2 oracle mobile_any_app_to_any_service_lets_go
B2 oracle mobile_any_app_to_any_service_lets_go
 
B1 keynote reimagine_application_development_and_delivery_with_oracle_platform
B1 keynote reimagine_application_development_and_delivery_with_oracle_platformB1 keynote reimagine_application_development_and_delivery_with_oracle_platform
B1 keynote reimagine_application_development_and_delivery_with_oracle_platform
 
A7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloudA7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloud
 
A6 big data_in_the_cloud
A6 big data_in_the_cloudA6 big data_in_the_cloud
A6 big data_in_the_cloud
 
A5 cloud security_now_a_reason_to_move_to_the_cloud
A5 cloud security_now_a_reason_to_move_to_the_cloudA5 cloud security_now_a_reason_to_move_to_the_cloud
A5 cloud security_now_a_reason_to_move_to_the_cloud
 
A4 drive dev_ops_agility_and_operational_efficiency
A4 drive dev_ops_agility_and_operational_efficiencyA4 drive dev_ops_agility_and_operational_efficiency
A4 drive dev_ops_agility_and_operational_efficiency
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Dernier (20)

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 

6 enriching your data warehouse with big data and hadoop

  • 1. Enriching Your Data Warehouse With Hadoop Presenting with Peter Yu, Senior Director iTech Asia-Pacific & Japan
  • 2. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.2 Agenda  Opportunity  Challenges  Strategy  Examples  Best Practices
  • 3. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.3 Connecting With Your Customer
  • 4. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.4 Big Data Improves Operational Performance Source: Economist Intelligence Unit, .”The Deciding Factor: Big Data and Decision Making“ Big data benefits seen growing substantially
  • 5. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.5 Use Data 12% Executives who feel they understand the impact data will have on their organizations Produce Data The Problem
  • 6. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.6
  • 7. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.7 DON’T BELIEVE EVERYTHING YOU READ
  • 8. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.8 Electricity: AC or DC?
  • 9. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.9 RDBMSRDBMS Today Discovery & Analytics Business Intelligence
  • 10. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.10 RDBMSRDBMS Today Discovery & Analytics Business Intelligence External ETL Cluster
  • 11. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.11 RDBMSRDBMS What about Archived Data? Discovery & Analytics Business Intelligence Archive External ETL Cluster
  • 12. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.12 RDBMSRDBMS What about New Data? Discovery & Analytics Business Intelligence ? Archive External ETL Cluster
  • 13. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.13 RDBMSRDBMS Expand Your Data Warehouse Discovery & Analytics Business Intelligence ? External ETL Cluster Archive
  • 14. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.14 RDBMS Hadoop RDBMS Hadoop Integrate Hadoop and RDBMS Discovery & Analytics Business Intelligence
  • 15. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.15 “[Facebook] started in the Hadoop world. We are now bringing in relational to enhance that. We're kind of going [in] the other direction ... We've been there, and [we] realized that using the wrong technology for certain kinds of problems can be difficult.” Ken Rubin Director of Analytics Facebook http://tdwi.org/Articles/2013/05/06/Facebooks-Relational-Platform.aspx?Page=1
  • 16. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.16 Get Fast Answers to New Questions Create a Data Reservoir Predict More, More Accurately Accelerate Data-Driven Action Key Hadoop Use Cases Complementing An Existing Data Warehouse ETL
  • 17. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.17 • Transactional data • Customer information • Web log and session data • Machine/Sensor data • Historical data Data Reservoir Keep All Potentially Valuable Data in One Place
  • 18. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.18 RDBMSRDBMS Today Discovery & Analytics Business Intelligence ? Archive External ETL Cluster
  • 19. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.19 RDBMS Hadoop RDBMS Hadoop Create an Active Archive with Hadoop Discovery & Analytics Business Intelligence Σ
  • 20. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.20  80 years of historical data in Hadoop  Structured and unstructured data includes customer data, economic trends, telematic sensors, weather, public data  Integrated with mainframes and EDWs  Before Hadoop, could analyze only one state, took 24 hours  With Cloudera, can analyze risk across all 50 states, in 16 hours (500x improvement)  First 3 use cases: Data hub, ETL offload, advanced analytics Comprehensive risk analysis Customer Example: Insurer Real-Time Data Hub Cloudera Hadoop EDW and Mainframe Customer Data
  • 21. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.21  Optimize offers  Increase revenue  Lower costs (offer bytes?)  Reduce complexity  Faster time to value Maximize offer effectiveness Customer Example: Travel Industry Big Data Appliance Legacy Data Warehouse Customer Data
  • 22. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.22  Fast access to 85% more data  Lowered costs  Simplified architecture  Faster time to value Compliance, cost reduction Customer Example: Regional Bank Big Data Appliance Oracle Exadata Mainframe, RDBMS Oracle Data Integrator
  • 23. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.23 • Batch window constraints • Adding value vs. adding cost • Analysis vs. Transformation • Analysis vs. Data movement and replication • Uncertain value of new data sources ETL Challenges Today
  • 24. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.24 Discovery & Analytics Business Intelligence RDBMSRDBMS Typical ETL Today External ETL Cluster
  • 25. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.25 RDBMS Hadoop RDBMS Hadoop ETL Offload with Hadoop Discovery & Analytics Business Intelligence Σ
  • 26. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.26  Exponential growth in data, generated by new consumer devices  ETL and storage constraints limited analytics to 1% sample  Now combined Oracle Exadata and Cloudera Hadoop delivers analytics on 100% of data (half a PB per day!)  Query times reduced dramatically (i.e. from 4 days to 53 minutes)  90% reduction of ETL code base From 1% sampling to 100% analysis Customer Example: Communications Services Archive Storage Data Warehouse Complex Correlation Alerting Filter & Split Event Monitoring Streaming ETL Streaming ETL Teleco m Services Before
  • 27. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.27  Exponential growth in data, generated by new consumer devices  ETL and storage constraints limited analytics to 1% sample  Now combined Oracle Exadata and Cloudera Hadoop delivers analytics on 100% of data (half a PB per day!)  Query times reduced dramatically (i.e. from 4 days to 53 minutes)  90% reduction of ETL code base From 1% sampling to 100% analysis Customer Example: Communications Services Archive Storage Data Warehouse Complex Correlation Alerting Filter & Split Event Monitoring Streaming ETL Streaming ETL Teleco m Services Before Data Warehouse Alerting Filter & Split Event Monitoring Hadoop Archive Storage ETL Correlation Stage 1 DWH Teleco m Services After
  • 28. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.28 Identifying Potential Opportunities Do you have a problem with ETL performance? Do you have potentially valuable data that you aren’t using, but might deliver new insight? Should you focus on analyzing structured data, unstructured data, or a combination? Are Big Data solutions already being built as silos?
  • 29. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.29 Challenges in the Way & Oracle Strategies CHALLENGES ORACLE STRATEGY • Fragmented Solutions • Specialized but integrated data stores and tools • Difficulty of Self-Service BI • Flexible, guided, automated BI & data discovery • Data Not Current • Solutions for Just-in-Time well defined data • Time to ROI / Development Time • Horizontal & industry pre-built solutions, engineered systems • Growing Diversity of Data & Users • Enterprise solutions for 1000s diverse users, petabytes data • Manageability, Security, Cost • Centrally managed with advanced security / governance
  • 30. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.30 Big Data Readiness The R&D Prototype Stage  Skills needed – Distributed data deployment (e.g. Hadoop) – Python or Java programming with MapReduce – Statistical analysis (e.g. R) – Data integration – Ability to formulate business hypotheses – Ability to convey business value of Big Data
  • 31. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.31 Are You Ready for Big Data? Contact Your Account Team  Have you delivered mature analytics solutions for structured data?  Can Big Data make a difference to the business?  Have you built a Big Data prototype, built skills, and proved value?  Do you have an enterprise integration & deployment strategy for Big Data?
  • 32. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.32 Copyright © 2013, Oracle and/or its affiliates. All rights reserved.32 Continuous Innovation Big Data at Work
  • 33. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.33
  • 34. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.34
  • 35. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.35 RDBMS Hadoop RDBMS Hadoop Master diagram Discovery & Analytics Business IntelligenceExternal ETL Cluster Data Mart Data Mart Archive ΣΣEvent Processing
  • 36. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.36 Discovery & Analytics Business Intelligence RDBMSRDBMS Master diagram without Hadoop External ETL Cluster Data Mart Data Mart Archive ΣEvent Processing