SlideShare une entreprise Scribd logo
1  sur  18
Copyright © 2015, SAS Institute Inc. All rights reserved.
Big Data Analytics with SAS
and Hadoop
Felix Liao
Business Solutions Manager
SAS Australia/New Zealand
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Agenda
 5 things you didn’t know about SAS (and Hadoop)
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
#1 SAS is the largest private software company in the world
 1000+ customer sites in Australia &
New Zealand
 A market leader in the areas of Data
Management, Reporting and Advanced
Analytics
 23% annual re-investment in R&D
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
#2 SAS has been doing machine learning for 39 years
SAS is the "800-pound gorilla" in the analytics space
- Gartner
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Breadth and Depth of Analytical Capabilities
Append
Data
Partition
File
Import
Filter Merge SampleSAMPLE
Association DMDB
MultiPlot
EXPLORE
Graph
Explore
Link Analysis
Path Analysis
SOM/Kohonen
StatExplore
Variable
Clustering
Variable
Selection
Market Basket
Cluster
MODIFY Drop
Rules
Builder
ReplacementPrincipal
Components
Interactive
Binning
Impute
Transform
Variables
Decision
Tree
AutoNeural
Neural
Network
Regression
Partial Least
Squares
Dmine
Regression
MODEL
DM Neural
Ensemble
Rule
Induction
Gradient
Boosting
LARS
MBR
Two Stage
Model Import
Incremental
Response
Survival
Analysis
Credit
Scoring*
TS
Correlation
TS Data
Prep
TS Dimension
Reduction
TS
Decomp.
TS
Similarity
TS Exponential
Smoothing
HP Explore
HP Impute
HP
Regression
HP
Transform
HP Variable
Selection
HP Neural
HP Forest
HP Decision
Tree
HP Data
Partition
HP GLM HP
Cluster
HP Principal
Components
HP SVM
Cutoff Segment ProfileASSESS
Model
Comparison
ScoreDecisions
UTILITY Control
Point
Metadata
SAS Code
Reporter
End Groups Score Code
ExportStart Groups Ext Demo
Input
Data
Open Source
Integration
Register
Metadata
Save
Data
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
#3 SAS is serious and committed about Hadoop
 Hadoop as catalyst for big data analytics
 Bringing SAS analytics to Hadoop
 Joint R&D effort with leading Hadoop vendors
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Open Data Platform Initiative
 SAS is a founding member of the open data platform
(ODP) initiative
 Accelerate innovations around a stable common core
platform
 Maximize big data adoption and productivity
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
#4 SAS is a certified workload engine on YARN
We are very excited today to announce
the next step in our joint journey
achieved by integrating SAS HPA and
LASR with the YARN resource manager
so it will run as a first class citizen in the
Hadoop cluster, co-existing and sharing
cluster resources with other YARN
enabled workloads running Hadoop and
third-party YARN enabled applications.
Arun C. Murthy
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
SAS & Hadoop Accelerating the Analytical Life Cycle
Prepare data IN
Hadoop for
analytics
Deploy and manage
model score code IN
Hadoop
Lift data IN to memory
for analytics at scale
Model data at scale in-
memory WITH advanced
modeling tools
Explore data at scale, in-
memory WITH data
visualization
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Prepare Hadoop Data: SAS Data Loader for Hadoop
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Hadoop Data Discovery: SAS Visual Analytics
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Model Development: SAS In-Memory Statistics for Hadoop
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
#5 SAS is delivering big data analytics today!
Now we can run hundreds and thousands of
models at the product level - at the SKU level
- because you have the big data and
analytics to support those models at that
level.
- Kerem Tomak (VP of Analytics)
We have a lot of data, but now we can start
unleashing the power of that information
- Joanna Gurry (Head of Information)
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
SAS and Hortonworks - Rogers Media
 40 million records per month in Hortonworks
HDP
 More than 600 relevant web characteristics
 Processing data on 12 million customers
 SAS High Performance Analytics to place
better targeted ads “Several of us from Rogers
in the room looked at each
other, and said ‘That is
really wicked; that’s cool.”
Chris Dingle
Senior Director of Audience Solutions
Rogers Communications
https://www.youtube.com/watch?v=YFtrK02VaM4
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Five things you now know about SAS and Hadoop!
 #1 SAS is the largest private software company in the world
 #2 SAS has been doing machine learning for 39 years
 #3 SAS is serious and committed about Hadoop
 #4 SAS is a certified workload engine on YARN
 #5 SAS is delivering big data analytics today
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
http://www.sas.com/au/sashadoop
Copyr ight © 2012, SAS Institute Inc. All rights reser ved.
felix.liao@sas.com
@felixliao
felixliao Thank You!
http://www.sas.com/au/sashadoop

Contenu connexe

Tendances

Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop TutorialEdureka!
 
Hadoop Overview
Hadoop Overview Hadoop Overview
Hadoop Overview EMC
 
SQL-on-Hadoop Tutorial
SQL-on-Hadoop TutorialSQL-on-Hadoop Tutorial
SQL-on-Hadoop TutorialDaniel Abadi
 
Azure_Business_Opportunity
Azure_Business_OpportunityAzure_Business_Opportunity
Azure_Business_OpportunityNojan Emad
 
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...Cloudera, Inc.
 
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, GuindyScaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, GuindyRohit Kulkarni
 
Introduction To Hadoop Ecosystem
Introduction To Hadoop EcosystemIntroduction To Hadoop Ecosystem
Introduction To Hadoop EcosystemInSemble
 
Hadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouseHadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouseAsis Mohanty
 
Big Data and Hadoop Ecosystem
Big Data and Hadoop EcosystemBig Data and Hadoop Ecosystem
Big Data and Hadoop EcosystemRajkumar Singh
 
Building a Big Data platform with the Hadoop ecosystem
Building a Big Data platform with the Hadoop ecosystemBuilding a Big Data platform with the Hadoop ecosystem
Building a Big Data platform with the Hadoop ecosystemGregg Barrett
 
Big data advance topics - part 2.pptx
Big data   advance topics - part 2.pptxBig data   advance topics - part 2.pptx
Big data advance topics - part 2.pptxMoldovan Radu Adrian
 
Hadoop and Hive at Orbitz, Hadoop World 2010
Hadoop and Hive at Orbitz, Hadoop World 2010Hadoop and Hive at Orbitz, Hadoop World 2010
Hadoop and Hive at Orbitz, Hadoop World 2010Jonathan Seidman
 
Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Hortonworks
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHortonworks
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoopOmar Jaber
 

Tendances (20)

Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop Tutorial
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Filling the Data Lake
Filling the Data LakeFilling the Data Lake
Filling the Data Lake
 
Hadoop Overview
Hadoop Overview Hadoop Overview
Hadoop Overview
 
SQL-on-Hadoop Tutorial
SQL-on-Hadoop TutorialSQL-on-Hadoop Tutorial
SQL-on-Hadoop Tutorial
 
Azure_Business_Opportunity
Azure_Business_OpportunityAzure_Business_Opportunity
Azure_Business_Opportunity
 
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
 
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, GuindyScaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
 
Introduction To Hadoop Ecosystem
Introduction To Hadoop EcosystemIntroduction To Hadoop Ecosystem
Introduction To Hadoop Ecosystem
 
Hadoop Ecosystem
Hadoop EcosystemHadoop Ecosystem
Hadoop Ecosystem
 
Apache Hadoop at 10
Apache Hadoop at 10Apache Hadoop at 10
Apache Hadoop at 10
 
Big data Hadoop
Big data  Hadoop   Big data  Hadoop
Big data Hadoop
 
Hadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouseHadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouse
 
Big Data and Hadoop Ecosystem
Big Data and Hadoop EcosystemBig Data and Hadoop Ecosystem
Big Data and Hadoop Ecosystem
 
Building a Big Data platform with the Hadoop ecosystem
Building a Big Data platform with the Hadoop ecosystemBuilding a Big Data platform with the Hadoop ecosystem
Building a Big Data platform with the Hadoop ecosystem
 
Big data advance topics - part 2.pptx
Big data   advance topics - part 2.pptxBig data   advance topics - part 2.pptx
Big data advance topics - part 2.pptx
 
Hadoop and Hive at Orbitz, Hadoop World 2010
Hadoop and Hive at Orbitz, Hadoop World 2010Hadoop and Hive at Orbitz, Hadoop World 2010
Hadoop and Hive at Orbitz, Hadoop World 2010
 
Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoop
 

En vedette

алексей максимович горький
алексей максимович горькийалексей максимович горький
алексей максимович горькийDyma-teacher
 
用吸睛的圖文創造提袋率
用吸睛的圖文創造提袋率用吸睛的圖文創造提袋率
用吸睛的圖文創造提袋率imbooka1115
 
с.Pptxесенин и революция
с.Pptxесенин и революцияс.Pptxесенин и революция
с.Pptxесенин и революцияDyma-teacher
 
Studying Languages at Dalhousie 2011
Studying Languages at Dalhousie 2011Studying Languages at Dalhousie 2011
Studying Languages at Dalhousie 2011sdspasova
 
фадеев «разгром»
фадеев «разгром»фадеев «разгром»
фадеев «разгром»Dyma-teacher
 
Central Powers Collapse
Central Powers CollapseCentral Powers Collapse
Central Powers Collapsebrhughes
 
การคำนวณภาษีเงินได้แบบสิ้นปี
การคำนวณภาษีเงินได้แบบสิ้นปีการคำนวณภาษีเงินได้แบบสิ้นปี
การคำนวณภาษีเงินได้แบบสิ้นปี0821960829
 
Hybrid app简要介绍
Hybrid app简要介绍Hybrid app简要介绍
Hybrid app简要介绍Eric Xiao
 
Jasa desain rumah bali home designer architect bali
Jasa desain rumah bali   home designer   architect baliJasa desain rumah bali   home designer   architect bali
Jasa desain rumah bali home designer architect balisupriyantoedi
 
LINQS Quick Connect Stickers
LINQS Quick Connect StickersLINQS Quick Connect Stickers
LINQS Quick Connect StickersRaghvendra Saboo
 
Marsactu : Aix pose ses marques sans Marseille et la Métropole
Marsactu : Aix pose ses marques sans Marseille et la MétropoleMarsactu : Aix pose ses marques sans Marseille et la Métropole
Marsactu : Aix pose ses marques sans Marseille et la MétropoleFranck Confino
 
Infographics: Analyze, Evaluate and Create
Infographics: Analyze, Evaluate and CreateInfographics: Analyze, Evaluate and Create
Infographics: Analyze, Evaluate and CreateLinda Nitsche
 
Post World War 2 Rome - Bicycle Thieves
Post World War 2 Rome - Bicycle ThievesPost World War 2 Rome - Bicycle Thieves
Post World War 2 Rome - Bicycle Thievesstephcrame
 

En vedette (20)

1глагол
1глагол1глагол
1глагол
 
алексей максимович горький
алексей максимович горькийалексей максимович горький
алексей максимович горький
 
用吸睛的圖文創造提袋率
用吸睛的圖文創造提袋率用吸睛的圖文創造提袋率
用吸睛的圖文創造提袋率
 
с.Pptxесенин и революция
с.Pptxесенин и революцияс.Pptxесенин и революция
с.Pptxесенин и революция
 
Studying Languages at Dalhousie 2011
Studying Languages at Dalhousie 2011Studying Languages at Dalhousie 2011
Studying Languages at Dalhousie 2011
 
Leadership growth
Leadership growthLeadership growth
Leadership growth
 
фадеев «разгром»
фадеев «разгром»фадеев «разгром»
фадеев «разгром»
 
Piramida Aromatov
Piramida AromatovPiramida Aromatov
Piramida Aromatov
 
Klimatrapport Sigtunahöjden 2015
Klimatrapport Sigtunahöjden 2015Klimatrapport Sigtunahöjden 2015
Klimatrapport Sigtunahöjden 2015
 
Central Powers Collapse
Central Powers CollapseCentral Powers Collapse
Central Powers Collapse
 
ARISH CV (1)
ARISH CV (1)ARISH CV (1)
ARISH CV (1)
 
การคำนวณภาษีเงินได้แบบสิ้นปี
การคำนวณภาษีเงินได้แบบสิ้นปีการคำนวณภาษีเงินได้แบบสิ้นปี
การคำนวณภาษีเงินได้แบบสิ้นปี
 
Hybrid app简要介绍
Hybrid app简要介绍Hybrid app简要介绍
Hybrid app简要介绍
 
Jasa desain rumah bali home designer architect bali
Jasa desain rumah bali   home designer   architect baliJasa desain rumah bali   home designer   architect bali
Jasa desain rumah bali home designer architect bali
 
LINQS Quick Connect Stickers
LINQS Quick Connect StickersLINQS Quick Connect Stickers
LINQS Quick Connect Stickers
 
Marsactu : Aix pose ses marques sans Marseille et la Métropole
Marsactu : Aix pose ses marques sans Marseille et la MétropoleMarsactu : Aix pose ses marques sans Marseille et la Métropole
Marsactu : Aix pose ses marques sans Marseille et la Métropole
 
Italy in WWI
Italy in WWIItaly in WWI
Italy in WWI
 
Infographics: Analyze, Evaluate and Create
Infographics: Analyze, Evaluate and CreateInfographics: Analyze, Evaluate and Create
Infographics: Analyze, Evaluate and Create
 
Post World War 2 Rome - Bicycle Thieves
Post World War 2 Rome - Bicycle ThievesPost World War 2 Rome - Bicycle Thieves
Post World War 2 Rome - Bicycle Thieves
 
รำวงมาตรฐาน
รำวงมาตรฐานรำวงมาตรฐาน
รำวงมาตรฐาน
 

Similaire à 2015 HortonWorks MDA Roadshow Presentation

Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopAccelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopDataWorks Summit
 
Datameer6 for prospects - june 2016_v2
Datameer6 for prospects - june 2016_v2Datameer6 for prospects - june 2016_v2
Datameer6 for prospects - june 2016_v2Datameer
 
Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...DataWorks Summit
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...TheInevitableCloud
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderaCw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderainevitablecloud
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Datajdijcks
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
Big data tim
Big data timBig data tim
Big data timT Weir
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...Jürgen Ambrosi
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataPentaho
 
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Italy
 
Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09Hortonworks
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Barijaxconf
 
The Forrester Wave - Big Data Hadoop
The Forrester Wave - Big Data HadoopThe Forrester Wave - Big Data Hadoop
The Forrester Wave - Big Data HadoopIBM Software India
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 
Hadoop Turns a Corner and Sees the Future
Hadoop Turns a Corner and Sees the FutureHadoop Turns a Corner and Sees the Future
Hadoop Turns a Corner and Sees the FutureDataWorks Summit
 

Similaire à 2015 HortonWorks MDA Roadshow Presentation (20)

Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopAccelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
 
Big Data
Big DataBig Data
Big Data
 
Datameer6 for prospects - june 2016_v2
Datameer6 for prospects - june 2016_v2Datameer6 for prospects - june 2016_v2
Datameer6 for prospects - june 2016_v2
 
Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderaCw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-cloudera
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Big data tim
Big data timBig data tim
Big data tim
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
 
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
 
Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
The Forrester Wave - Big Data Hadoop
The Forrester Wave - Big Data HadoopThe Forrester Wave - Big Data Hadoop
The Forrester Wave - Big Data Hadoop
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
Hadoop Turns a Corner and Sees the Future
Hadoop Turns a Corner and Sees the FutureHadoop Turns a Corner and Sees the Future
Hadoop Turns a Corner and Sees the Future
 

Dernier

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 

Dernier (20)

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 

2015 HortonWorks MDA Roadshow Presentation

  • 1. Copyright © 2015, SAS Institute Inc. All rights reserved. Big Data Analytics with SAS and Hadoop Felix Liao Business Solutions Manager SAS Australia/New Zealand
  • 2. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Agenda  5 things you didn’t know about SAS (and Hadoop)
  • 3. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. #1 SAS is the largest private software company in the world  1000+ customer sites in Australia & New Zealand  A market leader in the areas of Data Management, Reporting and Advanced Analytics  23% annual re-investment in R&D
  • 4. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. #2 SAS has been doing machine learning for 39 years SAS is the "800-pound gorilla" in the analytics space - Gartner
  • 5. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Breadth and Depth of Analytical Capabilities Append Data Partition File Import Filter Merge SampleSAMPLE Association DMDB MultiPlot EXPLORE Graph Explore Link Analysis Path Analysis SOM/Kohonen StatExplore Variable Clustering Variable Selection Market Basket Cluster MODIFY Drop Rules Builder ReplacementPrincipal Components Interactive Binning Impute Transform Variables Decision Tree AutoNeural Neural Network Regression Partial Least Squares Dmine Regression MODEL DM Neural Ensemble Rule Induction Gradient Boosting LARS MBR Two Stage Model Import Incremental Response Survival Analysis Credit Scoring* TS Correlation TS Data Prep TS Dimension Reduction TS Decomp. TS Similarity TS Exponential Smoothing HP Explore HP Impute HP Regression HP Transform HP Variable Selection HP Neural HP Forest HP Decision Tree HP Data Partition HP GLM HP Cluster HP Principal Components HP SVM Cutoff Segment ProfileASSESS Model Comparison ScoreDecisions UTILITY Control Point Metadata SAS Code Reporter End Groups Score Code ExportStart Groups Ext Demo Input Data Open Source Integration Register Metadata Save Data
  • 6. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. #3 SAS is serious and committed about Hadoop  Hadoop as catalyst for big data analytics  Bringing SAS analytics to Hadoop  Joint R&D effort with leading Hadoop vendors
  • 7. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Open Data Platform Initiative  SAS is a founding member of the open data platform (ODP) initiative  Accelerate innovations around a stable common core platform  Maximize big data adoption and productivity
  • 8. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. #4 SAS is a certified workload engine on YARN We are very excited today to announce the next step in our joint journey achieved by integrating SAS HPA and LASR with the YARN resource manager so it will run as a first class citizen in the Hadoop cluster, co-existing and sharing cluster resources with other YARN enabled workloads running Hadoop and third-party YARN enabled applications. Arun C. Murthy
  • 9. Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
  • 10. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. SAS & Hadoop Accelerating the Analytical Life Cycle Prepare data IN Hadoop for analytics Deploy and manage model score code IN Hadoop Lift data IN to memory for analytics at scale Model data at scale in- memory WITH advanced modeling tools Explore data at scale, in- memory WITH data visualization
  • 11. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Prepare Hadoop Data: SAS Data Loader for Hadoop
  • 12. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Hadoop Data Discovery: SAS Visual Analytics
  • 13. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Model Development: SAS In-Memory Statistics for Hadoop
  • 14. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. #5 SAS is delivering big data analytics today! Now we can run hundreds and thousands of models at the product level - at the SKU level - because you have the big data and analytics to support those models at that level. - Kerem Tomak (VP of Analytics) We have a lot of data, but now we can start unleashing the power of that information - Joanna Gurry (Head of Information)
  • 15. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. SAS and Hortonworks - Rogers Media  40 million records per month in Hortonworks HDP  More than 600 relevant web characteristics  Processing data on 12 million customers  SAS High Performance Analytics to place better targeted ads “Several of us from Rogers in the room looked at each other, and said ‘That is really wicked; that’s cool.” Chris Dingle Senior Director of Audience Solutions Rogers Communications https://www.youtube.com/watch?v=YFtrK02VaM4
  • 16. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Five things you now know about SAS and Hadoop!  #1 SAS is the largest private software company in the world  #2 SAS has been doing machine learning for 39 years  #3 SAS is serious and committed about Hadoop  #4 SAS is a certified workload engine on YARN  #5 SAS is delivering big data analytics today
  • 17. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. http://www.sas.com/au/sashadoop
  • 18. Copyr ight © 2012, SAS Institute Inc. All rights reser ved. felix.liao@sas.com @felixliao felixliao Thank You! http://www.sas.com/au/sashadoop

Notes de l'éditeur

  1. - My name is Felix ………. Been with SAS ANZ for 5 years. - Responsible for the areas of data management and all things big data related. As SAS, we are extremely excited about Hadoop and think it truly is a game changer in terms types of big data analytics it will organisations to do. We have been investing and focusing on this area for a little while and I want to tell you some of the cool and interesting things we are doing in terms of SAS and Hadoop.
  2. As I only have 25 minuets which us not a lot of time I want to focus on telling you all 5 things that you didn’t know about SAS and what SAS is doing with Hadoop. If you all walk away thinking, that was cool, didn’t know SAS do that with Hadoop, then I would be a very happy man, Let’s see how well I go. For the next 25 minuets, I want to tell you 5 things about SAS and hadoop that perhaps you did not know. Firstly Hands up people who are familiar with who SAS is and what we do. I had a funny suspicion you guys are not all that interested about just SAS so,
  3. Start with a simple one related with just SAS. Great one to remember for pub quiz and trivia We have been for quite some time. Something I didn’t know when I joined SAS. Have been in Australia New Zealand for a little while. We have products that are rated by analyst as a leader in a number of product categories. How have we done that. The combination of the R&D focus and broad product has been instrumental as we build new innovative solutions around Hadoop which we will get to. We have focused a lot of that R/D and engineering effort into better integration with Hadoop. What you also might not be aware of is that we are a very R&D and engineering focused organisations. 12-15% industry average. Continuous growth resulted in a yearly revenue of 3B last year. We are proud of the fact that we have been included in the top 50 best place to work list for the last 5 years. I think we have some of the smartest people in the industry and continue to attract top talent. We do have a number of openings so come and speak to me afterwards if you are interested.
  4. Whilst SAS has evolved over the years, and our $3B revenue comes from areas such as Data Management, Advanced Analytics, reporting and industry solutions. Analytics and data mining or machine learning as the more recent buzz word, has been the focus from day one and continue to be the centre of everthing we do. Advanced Analytics which really is a multiple discipline areas that spans around machine learning, super/unsuper. SAS has been providing solutions from day 1. Advanced analytics is really a multi-displline area that has evolved over the years to include the areas of machine learning. SAS has been the pioneer and continue to innovate in this space whether it be in the area of superverised learning (include regression) and unsupervised learning, (clustering and segmentation). We have been continue to be the 800-pound gorilla. Machine learning is most common big data application, Post childs such as Netflix, Applying it on big data with new modern architecture and technologies. For more information on Machine Learning, see the SAS.com webpage on Machine Learning and the SAS Global Forum paper by Patrick Hall (SAS R&D) for more information: http://www.sas.com/en_us/insights/analytics/machine-learning.html http://support.sas.com/resources/papers/proceedings14/SAS313-2014.pdf What many SAS customers will recognize as machine learning, I would claim is within the intersection of data mining and machine learning, which also includes tools from many fields. It’s a very rich area for data analysis algorithms. 38% of Advanced Analytics Market Share in 2013
  5. Just to show you what I mean, I have taken a snapshot of the algorim and techniques we suported through our our various solutions. What SAS bring is both the breadth, in terms of the sheet number of algorisms ( and we have a lot of those, decision tree, neural network) but also the depth in terms of of supporting the end of end deployment process. Which includes things like sampling technique and analytical model management. This is just data mining algo I should add, what I am showing here is a subset of only the data mining capabilities. And does not include things like, econometics, forecasting and optimisation algorithms. IF I have not made my point clear, I guess the point here is that at SAS WE DO ANALYTICS AND MACHINE LEARNING !! And we are proud of it. What we have done and will continue to do is enable all of our analytical capabilities to run on Hadoop. Breath in terms of algorism, (regressions, decision tree, neural network) . Depth in terms of sampling techniqies to deployment and management of analytical models. Standard linear regression modelling to neural network to Sampling techniques to model managements. The most comprehensive analytical capabilities on the market. Whether it be supervised learning (regression), unsupervised learning ( Extended a lot of our analycal model and procedure to run within a Hadoop environment, High performance analytcs or HP procedures. Complete in-memory, complete parallel and complete within a hadoop cluster
  6. Speaking of Hadoop. Point number 3 leads to our commitment and believe in the Hadoop eco system. We are deadly serious about hadoop. Internally at SAS it is one of the most important initiative in recent years. I guess I wouldn’t speaking to you all here today if we weren’t 3 points, around powerful catalyst, we want to bring SAS to hadoop but we also want to work with the community. We don’t mearly think Hadoop as another data source, we view it as a powerful platform to run all of SAS. 1, definitely not just another data source ….. All the analytical functions and applications, we are modernising them, building new architectures and bring them all across to the world of hadoop. Continue R&D focus in this space. We are committed and doing all this because customer are asking us but we also see tremendous benefits in leveraging more of Hadoop technology. What customers want for us is loud and clear, organisations recognise our strength and heritage in analytics and wants us to bring proven, mature application to hadoop. That’s what we have been working on. By committed, I don’t mean we will be contributing open source into the Hadoop community (not as yet), committed in building and enabling new applications to run natively on Hadoop Partnership with all the leading vendors, very close co-development relationship with hortonworks
  7. Recently we have taken the next step in terms of working with the Hadoop community. One of the most recent initiative we have done is being part of the founding member of the open data platform initiative of which HortonWorks is one of the other key founding members. (also includes pivotal, ibm, teradata, For those of you not familiar with the open data platform, it is a relatively new initiative led by Hortonworks and pivotal to create a common core platform (HDFS, YARN, mapreduce), for hadoop across multiple vendors and distributions. As as well as pivotal you also have the likes of IBM and teradata. We are involved because as an application vendor on hadoop we believe in what the ODP is trying to do and the benefit it will have with our customers in the long term. We want to drive deep, robust, integration into the heart of Hadoop and being part of the ODP will help us to do that. So what kinds of things are we looking at doing with other members of ODP alliance. As a major application provider on Hadoop, we see the challenges faced by organisations as they deploy hadoop applications into producton. Standardising the core components and working closely with Hortonworks will means more robust products and accelerate product release cycle. “Hadoop and the ecosystem around it have been built on new ways to attack big problems. SAS remains committed to innovation in big data analytics and to providing high-quality software that our customers can count on. SAS’ participation in the Open Data Platform Alliance aligns with these commitments, and will benefit the increasing number of organizations – and SAS customers – that are turning to Hadoop to store and process big data. With SAS software managing and analyzing data from Hadoop, our customers can solve their most pressing challenges – better interacting with their customers, fighting fraud, managing risk, improving product quality and more.” Early days
  8. Speaking of integration into the heart of Hadoop ecosystem, I want to talk about fact number 4 which is that SAS is a certified workload engine on top of Hadoop. So what kinds of things are we looking at doing with other members of ODP alliance. Deep, robust, integration into the heart of Hadoop is a top priority. Which leads us to fact number 4 that you might not be aware of. So what has been the output of our commitment in terms of product and technology I hear you ask? Modernised SAS ….. Machine learning is the ultimate holy grail of big data applications. Post childs such as Netflix, Whilst there are a raft of emerging new technologies SAS has been doing this for Whether it be supervised learning, unsupervised learning or semi-supervised learning Deep learning Model factory Applying it on big data with new modern architecture and technologies. One of the advantage of being a leader is you can innovate Start with a simple one
  9. This is a big deal for us and more important a big deal for organisations looking at using Hadoop as the big data platform Data Locality For those of you who are just learning about YARN. Think of it as the resource management layer of a operating system, where the operating system in this case is Hadoop. It helps organise manage workload in Hadoop, also help organise maximise the investment they have made with their Hadoop cluster.
  10. With the deep integration we have built with Hadoop by taking advantage of data locality. We have build applications on top of Hadoop to accelerate the analytical life cycle. Making it easier and cost effective for organizations to drive insights out of Hadoop. We are doing that be building hadoop powered applications that drive the end to end analytical life cycle. The complete analytical life cycle is important to understand, as this is the reality most companies face: - Data needs to be prepared specifically for analytics (a crucial step), then it needs to be explored in a highly efficient environment, purpose built for interactive visualization, then it needs to be modeled in a purpose built advanced analytics environment. Finally, many times the final scoring can happen where the bulk of the data reside, in Hadoop. Through it all, key metadata act as glue, ensuring proper governance of the processes and data, tracking lineage and impact analysis, so that the user can know what may result from any changes at any point in the cycle.
  11. Easy to use interface that allows to do some. Behind the scene we generate code that runs natively within Hadoop, taking advantage of the massive scalable framework work.
  12. From a data discover and visualisation perspective. Our Visual analytics offerings leverage In-memory sas server that runs within the Hadoop cluster leveraging the YARN framework. By taking an in-memory approach and bypassing MapReduce, we can make the data discovery process much more interactive. Eliminate the batch based latency of Mapreduced based work load.
  13. Taking advantage of the same in-memory architecture on hadoop. In-memory statistics are targeted more towards programmer or hardware data scientist who wants to do data manipulation and model development within a hadoop environment . By interacting with hadoop data has been loaded persisted into memory, again we allow data scientist to much more productive through a low latency programming envionment against Hadoop data
  14. Point number 5. We are making big data analytics on Hadoop a reality today. Across different industries ### IAG High performance analytics solutions. Accelerating the model development process. (17 hours to 1min) Doing risk modelling IAG saw the time it took to analyse 20 million records against 186 variables (wide), reduce from 17 hours to just one minute. This could mean where actuaries and modelers were previously restricted to a cycle of one model a week, they can look toward cycling many models each day. ### Macy’s The initial objective: stop the “one size fits all email marketing” approach, resulting in a reduction of 20% in churn subscription. This lead to generating more accurate, real-time decisions about customer preferences. The ability to gain customer insight across channels is a critical part of improving customer satisfaction and revenues, and Macys.com uses SAS to validate and guide the site's cross- and up-sell offer algorithms. 20% reduction in churn $500,000 annual savings Customer lifetime value analysis More accurate response prediction Optimized promotions
  15. Finally but definitely not the least, I want to talk about Rogers. Poster child of everthing we just talked about. The ultimate goal was to position the most adequate advertising to a given visiting customer on Rogers’ web site. Traits are a characteristics/parameter of each visit. For example, the time of a visit, the number of clicks, the target browser, the device used (iPad, Samsung, etc). The 600 traits used in the final model were actually derived from a list of 75,000 original traits. Hortonworks youtube channel.
  16. Recently we have taken the next step in terms of working with the Hadoop community. We have the resource, commitment, we have the technology and we are making it real on Hadoop. We are doing by working closely with the hadoop community and technoloy ecosystem. Which we recognise as being extremely important.
  17. Where do I find out more.To find out more, a good whitepaper to get started. Nice and easy to remember. There is a lot to what SAS is doing with Hadoop.
  18. My contact detail, I will be sticking aorund and colleague with SAS shirt. Enjoy your rest of your day here at the conference.