SlideShare une entreprise Scribd logo
1  sur  43
Télécharger pour lire hors ligne
Mark Rittman, Oracle ACE Director
ODTUG KScope’18, Orlando June 2018
From BI Developer to Data Engineer with

Oracle Analytics Cloud Data Lake Edition
• Oracle ACE Director, Independent Analyst
• Past ODTUG Exec Board Member + Oracle Scene Editor
• Author of two books on Oracle BI
• Co-founder & CTO of Rittman Mead
• 15+ Years in Oracle BI, DW, ETL + now Big Data
• Host of the Drill to Detail Podcast (www.drilltodetail.com)
• Based in Brighton & work in London, UK
About the Presenter
2
Data Lakes are the new Data Warehouse
•Data now landed in Hadoop clusters, NoSQL
databases and Cloud Storage
•Flexible data storage platform with cheap storage,
flexible schema support + compute
•Solves problem of how to store new types of data
and flexibility on when to process
•Typically used by data scientists as source for new
models or insights of interest
•Data Warehouses still have their place
•But very few new ones are being built
•Nobody leaves college dreaming of being an ETL
developer
•Except Michael Rainey
Meet the New Data Warehouse : The “Data Lake”
4
From “What is a Data Lake”,

https://aws.amazon.com/big-data/datalakes-and-analytics/what-is-a-data-lake/
Data Lakes Need Data Engineers
What is Data Engineering?
•When “Big Data” first became popular, all
users were termed “data scientists”
•Over time, this evolved into two distinct
roles:

• Data Scientists who focus on new insights + models 

working from laptops using R + sampled data

• Data Engineers, who make at-scale data
consumable in some form, either directly or by data
scientists

•Data Engineers
•Can code, run clusters
•Create data pipelines & prepare data
•Train and build predefined ML models
•Knowledge of the math of ML limited
•They may be DBAs, BI developers
•Experience with DevOps, cloud



and….
What is Data Engineering?
•Oracle’s Cloud Analytics platform, built-on Oracle BI EE and Oracle DV technology
•Available as customer-managed and Oracle-managed (Autonomous Analytics Cloud)
•Available as three packaging options
•Oracle Analytics Cloud Standard 

(aka Oracle DV in Oracle Cloud)
•Oracle Analytics Cloud Enterprise 

(aka OBIEE12c in Oracle Cloud)
•Oracle Analytics Cloud Data Lake

(aka …?)
Oracle Analytics Cloud Data Lake Edition
8
Oracle Analytics Cloud Data Lake Edition
Oracle Analytics Cloud
social
sensors
enterprisepersonal
SaaS
mobile
Data
Sources
Developers
Executives
Data Stewards
AnalystsData Catalog
One place to collect, search, explore & curate all data
Data Preparation
Prepare enriched, sharable, & reliable datasets across all data
Data Analysis
Understand & act using smarts: search, visualization, & storytelling
Oracle
Database
Services
Oracle
Big Data Cloud
Oracle
Storage Cloud
Data Engineers
•All functionality in OAC Standard Edition plus
•Integration with Oracle Big Data Cloud
•Additional data flow/data prep operators
•ML model build and train capability
•Text analytics and NLP processing
•Data flow execution in Apache Spark (*)
•Replicate from Cloud and On-Premise Apps
•Oracle Service Cloud –Taleo, Fusion Apps
•Incremental Ingest from DBs, Cloud + files
•Continuous Ingest from GoldenGate
OAC Data Lake Edition: Key Features
11
Integrates with Oracle Big Data Cloud and Event Hub
12
Long-Term Replacement for Big Data Discovery
13
•Visual Face of Data in Hadoop
•Data Preparation and Enrichment
•Spark Data Transformations
•Standalone technology + processes
•Visual Face of Data in Cloud
•Data Preparation and Enrichment
•Spark Data Transformations
•Oracle Analytics Cloud
•Explore, catalog and discover data in Oracle Big Data Cloud, Oracle Database
•Enrich and transform raw data into valuable information and insights
•Analyze at-scale data using Data Visualization
•Combine data from SaaS, social and real-time
•Create predictive and classification models
•Analyze the sentiment in social media feeds
•Data engineering without the hand-coding
OAC Data Lake Edition Use-Cases
14
Example Scenario
Scenario : Ingest and Analyze Real-Time Feeds
16
IoT events
via Fluentd
Social Media
data via
Fluentd
17
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Scenario : Ingest and Analyze Real-Time Feeds
18
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Event Hub
Cloud REST
Proxy
Event Hub
Cloud Kafka
Connect
Event Hub
Cloud Kafka
Connect
Scenario : Ingest and Analyze Real-Time Feeds
19
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Event Hub
Cloud REST
Proxy
Event Hub
Cloud Kafka
Connect
Event Hub
Cloud Kafka
Connect
INGEST
Scenario : Ingest and Analyze Real-Time Feeds
20
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Event Hub
Cloud REST
Proxy
Event Hub
Cloud Kafka
Connect
Event Hub
Cloud Kafka
Connect
INGEST
Oracle Big Data Cloud
Scenario : Ingest and Analyze Real-Time Feeds
21
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Event Hub
Cloud REST
Proxy
Event Hub
Cloud Kafka
Connect
Event Hub
Cloud Kafka
Connect
INGEST
Oracle Big Data Cloud
Oracle Analytics Cloud
Data Lake Edition
Scenario : Ingest and Analyze Real-Time Feeds
22
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Event Hub
Cloud REST
Proxy
Event Hub
Cloud Kafka
Connect
Event Hub
Cloud Kafka
Connect
INGEST
Oracle Big Data Cloud
Oracle Analytics Cloud
Data Lake Edition
TRANSFORM
Scenario : Ingest and Analyze Real-Time Feeds
23
IoT events
via Fluentd
Social Media
data via
Fluentd
Firewall
Event Hub
Cloud REST
Proxy
Event Hub
Cloud Kafka
Connect
Event Hub
Cloud Kafka
Connect
INGEST
Oracle Big Data Cloud
Oracle Analytics Cloud
Data Lake Edition
TRANSFORM ANALYZE
Scenario : Ingest and Analyze Real-Time Feeds
24
Scenario : Ingest and Analyze Real-Time Feeds
ID & Access
Management
Auditing
Object
Storage
VCN
25
Scenario : Ingest and Analyze Real-Time Feeds
ID & Access
Management
Auditing
Object
Storage
VCNAvailability	Domain	1
26
Scenario : Ingest and Analyze Real-Time Feeds
ID & Access
Management
Auditing
Object
Storage
VCNAvailability	Domain	1
ORACLE	CLOUD	INFRASTRUCTURE	 (REGION)
Cloud Infrastructure
Oracle Cloud Platform-as-a-Service Stack
28
Oracle Big Data Cloud, Ambari and Hive ThriftServer
29
Oracle Event Hub Cloud Service - Dedicated
30
OAC Data Lake Edition
•Catalog of all data assets
•Projects
•Connection to Hive Thrift Server
•IoT and Social Media Data Sets
•Data Flows and Sequences
•Managed data lake store
•Control the lifecycle of your 

data lake assets
•Security
•Scheduling
Managing and Cataloging the Cloud Data Lake
32
Data Preparation Features from OAC Standard Edition
33
1. Split timestamp field
that’s not in valid format
2. Choose “space”
character as delimiter
3. Convert the first split
column into a date datatype
4. Choose the correct date
format for this field’s values
5. Repeat for the TIME split column,
concatenate with ’T’ in-between and
finally convert resulting field into
TIMESTAMP
34
Data Flows are sequences
of data transformations
executed on the BI Server -
Spark execution on roadmap
for OAC DL
Create Essbase
Cube
Time Series
Forecast
Sentiment
Analysis
Predictive / ML
Model Train and
Build
Run custom R and
other python scripts
Extended Data Flow Capability for Data Lake Edition
Data Flows are based on the
technology previously
announce as “Dataflow ML”,
now delivered as part of
Oracle Analytics Cloud
Example : Enrich With Sentiment, Then Visualize
35
1. Add Sentiment Analyse
step to data flow, persist
final enriched dataset back
to Hive table
2. Add a calculation to convert
sentiment description values to
positive/negative cumulative
score
3. Analyze Results in Data
Visualization UI
Using Explain Feature to Automate Deriving Context
36
1. Right-Click on attribute
column to “explain” the drivers
of its values
2. ML algorithm explains basic
facts, drivers, anomalies and
identifies segments of interest
Display Selected Attribute Explanations on Dashboard
37
Transform, Aggregate and Join Datasets
38
Multi-step dataset joins
Aggregate Datasets
Binning and Grouping
Predictive Modeling and Forecasting
39
1. Select Prediction Model best
suited to predicting Kudos
from Strava bike rides
2. Select column who’s values
are to be predicted, and model
parameter values
3. Train model and then test
against remaining dataset
Analyzing Data At-Scale Hosted on Big Data Cloud
40
•Data Flow feature enables multi-step transform of ingested data
•Sentiment Analyze operator useful for social/text data enrichment
•Enables BI developers to train and build predictive models
•ML-driven Explain feature automates understanding of context
•Basic data engineering for BI developers
•More data lake features expected in v5, v6
•
OAC Data Lake: What Works, What’s Coming?
41
Integration of features from
Oracle Big Data Preparation
Cloud Service
Enhanced Summary view
highlights data shape and
data quality
Coming soon to London, Autumn/Fall 2018



https://mjr-analytics.com
Mark Rittman, Oracle ACE Director
ODTUG KScope’18, Orlando June 2018
From BI Developer to Data Engineer with

Oracle Analytics Cloud Data Lake Edition

Contenu connexe

Tendances

JSBI Presentation Big Data Hyperion OBIEE Integration16 2
JSBI Presentation Big Data Hyperion OBIEE Integration16 2JSBI Presentation Big Data Hyperion OBIEE Integration16 2
JSBI Presentation Big Data Hyperion OBIEE Integration16 2
Jeff Shauer
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafka
Jeffrey T. Pollock
 

Tendances (20)

Moving OBIEE to Oracle Analytics Cloud
Moving OBIEE to Oracle Analytics CloudMoving OBIEE to Oracle Analytics Cloud
Moving OBIEE to Oracle Analytics Cloud
 
JSBI Presentation Big Data Hyperion OBIEE Integration16 2
JSBI Presentation Big Data Hyperion OBIEE Integration16 2JSBI Presentation Big Data Hyperion OBIEE Integration16 2
JSBI Presentation Big Data Hyperion OBIEE Integration16 2
 
Oracle Analytics Live Webinar August 2021
Oracle Analytics Live Webinar August 2021Oracle Analytics Live Webinar August 2021
Oracle Analytics Live Webinar August 2021
 
On Demand BI
On Demand BIOn Demand BI
On Demand BI
 
Informix Corporate Image and Brand (1990)
Informix Corporate Image and Brand (1990)Informix Corporate Image and Brand (1990)
Informix Corporate Image and Brand (1990)
 
Oracle analytics Live - January 2021
Oracle analytics Live - January 2021Oracle analytics Live - January 2021
Oracle analytics Live - January 2021
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafka
 
Oracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldOracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorld
 
Enterprise data science - What it takes to build?
Enterprise data science - What it takes to build?Enterprise data science - What it takes to build?
Enterprise data science - What it takes to build?
 
2010.03.16 Pollock.Edw2010.Modern D Ifor Warehousing
2010.03.16 Pollock.Edw2010.Modern D Ifor Warehousing2010.03.16 Pollock.Edw2010.Modern D Ifor Warehousing
2010.03.16 Pollock.Edw2010.Modern D Ifor Warehousing
 
Why our customers choose teradata.
Why our customers choose teradata.Why our customers choose teradata.
Why our customers choose teradata.
 
Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
The Impact of SMACT on the Data Management Stack
The Impact of SMACT on the Data Management StackThe Impact of SMACT on the Data Management Stack
The Impact of SMACT on the Data Management Stack
 
Analyst Toolbox August 2017
Analyst Toolbox August 2017Analyst Toolbox August 2017
Analyst Toolbox August 2017
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Operational Analytics Using Spark and NoSQL Data Stores
Operational Analytics Using Spark and NoSQL Data StoresOperational Analytics Using Spark and NoSQL Data Stores
Operational Analytics Using Spark and NoSQL Data Stores
 
The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...
 
B5 modernise your_cloud_to_on_premises_integration
B5 modernise your_cloud_to_on_premises_integrationB5 modernise your_cloud_to_on_premises_integration
B5 modernise your_cloud_to_on_premises_integration
 
Oracle Gold Partner - MindStream Analytics
Oracle Gold Partner - MindStream AnalyticsOracle Gold Partner - MindStream Analytics
Oracle Gold Partner - MindStream Analytics
 

Similaire à From BI Developer to Data Engineer with Oracle Analytics Cloud Data Lake Edition

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Monitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stackMonitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stack
SmartWave
 
2018 02-08-what's-new-in-apache-spark-2.3
2018 02-08-what's-new-in-apache-spark-2.3 2018 02-08-what's-new-in-apache-spark-2.3
2018 02-08-what's-new-in-apache-spark-2.3
Chester Chen
 

Similaire à From BI Developer to Data Engineer with Oracle Analytics Cloud Data Lake Edition (20)

AMIS Oracle OpenWorld en Code One Review 2018 - Pillar 2: Custom Application ...
AMIS Oracle OpenWorld en Code One Review 2018 - Pillar 2: Custom Application ...AMIS Oracle OpenWorld en Code One Review 2018 - Pillar 2: Custom Application ...
AMIS Oracle OpenWorld en Code One Review 2018 - Pillar 2: Custom Application ...
 
AMIS Oracle OpenWorld & CodeOne Review - Pillar 2 - Custom Application Develo...
AMIS Oracle OpenWorld & CodeOne Review - Pillar 2 - Custom Application Develo...AMIS Oracle OpenWorld & CodeOne Review - Pillar 2 - Custom Application Develo...
AMIS Oracle OpenWorld & CodeOne Review - Pillar 2 - Custom Application Develo...
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 
Flink in Zalando's world of Microservices
Flink in Zalando's world of Microservices   Flink in Zalando's world of Microservices
Flink in Zalando's world of Microservices
 
Flink in Zalando's World of Microservices
Flink in Zalando's World of Microservices  Flink in Zalando's World of Microservices
Flink in Zalando's World of Microservices
 
Time's Up! Getting Value from Big Data Now
Time's Up! Getting Value from Big Data NowTime's Up! Getting Value from Big Data Now
Time's Up! Getting Value from Big Data Now
 
OTN EMEA TOUR 2016 - OBIEE12c New Features for End-Users, Developers and Sys...
OTN EMEA TOUR 2016  - OBIEE12c New Features for End-Users, Developers and Sys...OTN EMEA TOUR 2016  - OBIEE12c New Features for End-Users, Developers and Sys...
OTN EMEA TOUR 2016 - OBIEE12c New Features for End-Users, Developers and Sys...
 
Solving Enterprise Data Challenges with Apache Arrow
Solving Enterprise Data Challenges with Apache ArrowSolving Enterprise Data Challenges with Apache Arrow
Solving Enterprise Data Challenges with Apache Arrow
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
 
Oracle Stream Analytics - Simplifying Stream Processing
Oracle Stream Analytics - Simplifying Stream ProcessingOracle Stream Analytics - Simplifying Stream Processing
Oracle Stream Analytics - Simplifying Stream Processing
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Monitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stackMonitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stack
 
Streaming Visualization
Streaming VisualizationStreaming Visualization
Streaming Visualization
 
The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)
 
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
 
Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
 
What's New in Upcoming Apache Spark 2.3
What's New in Upcoming Apache Spark 2.3What's New in Upcoming Apache Spark 2.3
What's New in Upcoming Apache Spark 2.3
 
2018 02-08-what's-new-in-apache-spark-2.3
2018 02-08-what's-new-in-apache-spark-2.3 2018 02-08-what's-new-in-apache-spark-2.3
2018 02-08-what's-new-in-apache-spark-2.3
 
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudGetting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
 
Apache Arrow: Open Source Standard Becomes an Enterprise Necessity
Apache Arrow: Open Source Standard Becomes an Enterprise NecessityApache Arrow: Open Source Standard Becomes an Enterprise Necessity
Apache Arrow: Open Source Standard Becomes an Enterprise Necessity
 

Plus de Rittman Analytics

Plus de Rittman Analytics (18)

From Zero to One with Rittman Analytics
From Zero to One with Rittman AnalyticsFrom Zero to One with Rittman Analytics
From Zero to One with Rittman Analytics
 
Where Digital Analytics is taking BI and Big Data
Where Digital Analytics is taking BI and Big DataWhere Digital Analytics is taking BI and Big Data
Where Digital Analytics is taking BI and Big Data
 
User Engagement Analysis using the new Looker System Activity Model
User Engagement Analysis using the new Looker System Activity ModelUser Engagement Analysis using the new Looker System Activity Model
User Engagement Analysis using the new Looker System Activity Model
 
Planning a Strategy for Autonomous Analytics and Data Warehousing
Planning a Strategy for Autonomous Analytics and Data WarehousingPlanning a Strategy for Autonomous Analytics and Data Warehousing
Planning a Strategy for Autonomous Analytics and Data Warehousing
 
Where Digital Analytics is taking BI and Big Data
Where Digital Analytics is taking BI and Big DataWhere Digital Analytics is taking BI and Big Data
Where Digital Analytics is taking BI and Big Data
 
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseData Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
 
From BI Developer to Data Engineer with Oracle Analytics Cloud, Data Lake
From BI Developer to Data Engineer with Oracle Analytics Cloud, Data LakeFrom BI Developer to Data Engineer with Oracle Analytics Cloud, Data Lake
From BI Developer to Data Engineer with Oracle Analytics Cloud, Data Lake
 
Using Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations Data
Using Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations DataUsing Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations Data
Using Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations Data
 
Using Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations Data
Using Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations DataUsing Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations Data
Using Google Cloud Dataprep to Wrangle Strava, Fitbit and Google Locations Data
 
Using Data & Analytics To Find Out How Much Daily Mail Readers Hate Me (and W...
Using Data & Analytics To Find Out How Much Daily Mail Readers Hate Me (and W...Using Data & Analytics To Find Out How Much Daily Mail Readers Hate Me (and W...
Using Data & Analytics To Find Out How Much Daily Mail Readers Hate Me (and W...
 
Analytics, BigQuery, Looker and How I Became an Internet Meme for 48 Hours
Analytics, BigQuery, Looker and How I Became an Internet Meme for 48 HoursAnalytics, BigQuery, Looker and How I Became an Internet Meme for 48 Hours
Analytics, BigQuery, Looker and How I Became an Internet Meme for 48 Hours
 
Analytics is Taking over the World (Again) - UKOUG Tech'17
Analytics is Taking over the World (Again) - UKOUG Tech'17Analytics is Taking over the World (Again) - UKOUG Tech'17
Analytics is Taking over the World (Again) - UKOUG Tech'17
 
Petabytes to Personalization - Data Analytics with Qubit and Looker
Petabytes to Personalization - Data Analytics with Qubit and LookerPetabytes to Personalization - Data Analytics with Qubit and Looker
Petabytes to Personalization - Data Analytics with Qubit and Looker
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
 
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
 
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
 
How a Tweet Went Viral - BIWA Summit 2017
How a Tweet Went Viral - BIWA Summit 2017How a Tweet Went Viral - BIWA Summit 2017
How a Tweet Went Viral - BIWA Summit 2017
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 

Dernier

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 

Dernier (20)

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 

From BI Developer to Data Engineer with Oracle Analytics Cloud Data Lake Edition

  • 1. Mark Rittman, Oracle ACE Director ODTUG KScope’18, Orlando June 2018 From BI Developer to Data Engineer with
 Oracle Analytics Cloud Data Lake Edition
  • 2. • Oracle ACE Director, Independent Analyst • Past ODTUG Exec Board Member + Oracle Scene Editor • Author of two books on Oracle BI • Co-founder & CTO of Rittman Mead • 15+ Years in Oracle BI, DW, ETL + now Big Data • Host of the Drill to Detail Podcast (www.drilltodetail.com) • Based in Brighton & work in London, UK About the Presenter 2
  • 3. Data Lakes are the new Data Warehouse
  • 4. •Data now landed in Hadoop clusters, NoSQL databases and Cloud Storage •Flexible data storage platform with cheap storage, flexible schema support + compute •Solves problem of how to store new types of data and flexibility on when to process •Typically used by data scientists as source for new models or insights of interest •Data Warehouses still have their place •But very few new ones are being built •Nobody leaves college dreaming of being an ETL developer •Except Michael Rainey Meet the New Data Warehouse : The “Data Lake” 4 From “What is a Data Lake”,
 https://aws.amazon.com/big-data/datalakes-and-analytics/what-is-a-data-lake/
  • 5. Data Lakes Need Data Engineers
  • 6. What is Data Engineering? •When “Big Data” first became popular, all users were termed “data scientists” •Over time, this evolved into two distinct roles:
 • Data Scientists who focus on new insights + models 
 working from laptops using R + sampled data
 • Data Engineers, who make at-scale data consumable in some form, either directly or by data scientists

  • 7. •Data Engineers •Can code, run clusters •Create data pipelines & prepare data •Train and build predefined ML models •Knowledge of the math of ML limited •They may be DBAs, BI developers •Experience with DevOps, cloud
 
 and…. What is Data Engineering?
  • 8. •Oracle’s Cloud Analytics platform, built-on Oracle BI EE and Oracle DV technology •Available as customer-managed and Oracle-managed (Autonomous Analytics Cloud) •Available as three packaging options •Oracle Analytics Cloud Standard 
 (aka Oracle DV in Oracle Cloud) •Oracle Analytics Cloud Enterprise 
 (aka OBIEE12c in Oracle Cloud) •Oracle Analytics Cloud Data Lake
 (aka …?) Oracle Analytics Cloud Data Lake Edition 8
  • 9. Oracle Analytics Cloud Data Lake Edition
  • 10. Oracle Analytics Cloud social sensors enterprisepersonal SaaS mobile Data Sources Developers Executives Data Stewards AnalystsData Catalog One place to collect, search, explore & curate all data Data Preparation Prepare enriched, sharable, & reliable datasets across all data Data Analysis Understand & act using smarts: search, visualization, & storytelling Oracle Database Services Oracle Big Data Cloud Oracle Storage Cloud Data Engineers
  • 11. •All functionality in OAC Standard Edition plus •Integration with Oracle Big Data Cloud •Additional data flow/data prep operators •ML model build and train capability •Text analytics and NLP processing •Data flow execution in Apache Spark (*) •Replicate from Cloud and On-Premise Apps •Oracle Service Cloud –Taleo, Fusion Apps •Incremental Ingest from DBs, Cloud + files •Continuous Ingest from GoldenGate OAC Data Lake Edition: Key Features 11
  • 12. Integrates with Oracle Big Data Cloud and Event Hub 12
  • 13. Long-Term Replacement for Big Data Discovery 13 •Visual Face of Data in Hadoop •Data Preparation and Enrichment •Spark Data Transformations •Standalone technology + processes •Visual Face of Data in Cloud •Data Preparation and Enrichment •Spark Data Transformations •Oracle Analytics Cloud
  • 14. •Explore, catalog and discover data in Oracle Big Data Cloud, Oracle Database •Enrich and transform raw data into valuable information and insights •Analyze at-scale data using Data Visualization •Combine data from SaaS, social and real-time •Create predictive and classification models •Analyze the sentiment in social media feeds •Data engineering without the hand-coding OAC Data Lake Edition Use-Cases 14
  • 16. Scenario : Ingest and Analyze Real-Time Feeds 16 IoT events via Fluentd Social Media data via Fluentd
  • 17. 17 IoT events via Fluentd Social Media data via Fluentd Firewall Scenario : Ingest and Analyze Real-Time Feeds
  • 18. 18 IoT events via Fluentd Social Media data via Fluentd Firewall Event Hub Cloud REST Proxy Event Hub Cloud Kafka Connect Event Hub Cloud Kafka Connect Scenario : Ingest and Analyze Real-Time Feeds
  • 19. 19 IoT events via Fluentd Social Media data via Fluentd Firewall Event Hub Cloud REST Proxy Event Hub Cloud Kafka Connect Event Hub Cloud Kafka Connect INGEST Scenario : Ingest and Analyze Real-Time Feeds
  • 20. 20 IoT events via Fluentd Social Media data via Fluentd Firewall Event Hub Cloud REST Proxy Event Hub Cloud Kafka Connect Event Hub Cloud Kafka Connect INGEST Oracle Big Data Cloud Scenario : Ingest and Analyze Real-Time Feeds
  • 21. 21 IoT events via Fluentd Social Media data via Fluentd Firewall Event Hub Cloud REST Proxy Event Hub Cloud Kafka Connect Event Hub Cloud Kafka Connect INGEST Oracle Big Data Cloud Oracle Analytics Cloud Data Lake Edition Scenario : Ingest and Analyze Real-Time Feeds
  • 22. 22 IoT events via Fluentd Social Media data via Fluentd Firewall Event Hub Cloud REST Proxy Event Hub Cloud Kafka Connect Event Hub Cloud Kafka Connect INGEST Oracle Big Data Cloud Oracle Analytics Cloud Data Lake Edition TRANSFORM Scenario : Ingest and Analyze Real-Time Feeds
  • 23. 23 IoT events via Fluentd Social Media data via Fluentd Firewall Event Hub Cloud REST Proxy Event Hub Cloud Kafka Connect Event Hub Cloud Kafka Connect INGEST Oracle Big Data Cloud Oracle Analytics Cloud Data Lake Edition TRANSFORM ANALYZE Scenario : Ingest and Analyze Real-Time Feeds
  • 24. 24 Scenario : Ingest and Analyze Real-Time Feeds ID & Access Management Auditing Object Storage VCN
  • 25. 25 Scenario : Ingest and Analyze Real-Time Feeds ID & Access Management Auditing Object Storage VCNAvailability Domain 1
  • 26. 26 Scenario : Ingest and Analyze Real-Time Feeds ID & Access Management Auditing Object Storage VCNAvailability Domain 1 ORACLE CLOUD INFRASTRUCTURE (REGION)
  • 29. Oracle Big Data Cloud, Ambari and Hive ThriftServer 29
  • 30. Oracle Event Hub Cloud Service - Dedicated 30
  • 31. OAC Data Lake Edition
  • 32. •Catalog of all data assets •Projects •Connection to Hive Thrift Server •IoT and Social Media Data Sets •Data Flows and Sequences •Managed data lake store •Control the lifecycle of your 
 data lake assets •Security •Scheduling Managing and Cataloging the Cloud Data Lake 32
  • 33. Data Preparation Features from OAC Standard Edition 33 1. Split timestamp field that’s not in valid format 2. Choose “space” character as delimiter 3. Convert the first split column into a date datatype 4. Choose the correct date format for this field’s values 5. Repeat for the TIME split column, concatenate with ’T’ in-between and finally convert resulting field into TIMESTAMP
  • 34. 34 Data Flows are sequences of data transformations executed on the BI Server - Spark execution on roadmap for OAC DL Create Essbase Cube Time Series Forecast Sentiment Analysis Predictive / ML Model Train and Build Run custom R and other python scripts Extended Data Flow Capability for Data Lake Edition Data Flows are based on the technology previously announce as “Dataflow ML”, now delivered as part of Oracle Analytics Cloud
  • 35. Example : Enrich With Sentiment, Then Visualize 35 1. Add Sentiment Analyse step to data flow, persist final enriched dataset back to Hive table 2. Add a calculation to convert sentiment description values to positive/negative cumulative score 3. Analyze Results in Data Visualization UI
  • 36. Using Explain Feature to Automate Deriving Context 36 1. Right-Click on attribute column to “explain” the drivers of its values 2. ML algorithm explains basic facts, drivers, anomalies and identifies segments of interest
  • 37. Display Selected Attribute Explanations on Dashboard 37
  • 38. Transform, Aggregate and Join Datasets 38 Multi-step dataset joins Aggregate Datasets Binning and Grouping
  • 39. Predictive Modeling and Forecasting 39 1. Select Prediction Model best suited to predicting Kudos from Strava bike rides 2. Select column who’s values are to be predicted, and model parameter values 3. Train model and then test against remaining dataset
  • 40. Analyzing Data At-Scale Hosted on Big Data Cloud 40
  • 41. •Data Flow feature enables multi-step transform of ingested data •Sentiment Analyze operator useful for social/text data enrichment •Enables BI developers to train and build predictive models •ML-driven Explain feature automates understanding of context •Basic data engineering for BI developers •More data lake features expected in v5, v6 • OAC Data Lake: What Works, What’s Coming? 41 Integration of features from Oracle Big Data Preparation Cloud Service Enhanced Summary view highlights data shape and data quality
  • 42. Coming soon to London, Autumn/Fall 2018
 
 https://mjr-analytics.com
  • 43. Mark Rittman, Oracle ACE Director ODTUG KScope’18, Orlando June 2018 From BI Developer to Data Engineer with
 Oracle Analytics Cloud Data Lake Edition