Soumettre la recherche
Mettre en ligne
Enterprise Data Science at Scale
•
0 j'aime
•
326 vues
Artem Ervits
Suivre
Enterprise Data Science at Scale
Lire moins
Lire la suite
Logiciels
Signaler
Partager
Signaler
Partager
1 sur 18
Télécharger maintenant
Télécharger pour lire hors ligne
Recommandé
IBM Cloud Paris meetup 20180213 - Hortonworks
IBM Cloud Paris meetup 20180213 - Hortonworks
IBM France Lab
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
IBM France Lab
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
IBM France Lab
Complement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & Hadoop
Datameer
Word optimisa doc for linked in insights promotion
Word optimisa doc for linked in insights promotion
Paul Morgan
Self Evolving Model to Attain to State of Dynamic System Accuracy
Self Evolving Model to Attain to State of Dynamic System Accuracy
DataWorks Summit
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
Building up a Data Science Team from Scratch
Building up a Data Science Team from Scratch
Institute of Contemporary Sciences
Recommandé
IBM Cloud Paris meetup 20180213 - Hortonworks
IBM Cloud Paris meetup 20180213 - Hortonworks
IBM France Lab
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
IBM France Lab
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
IBM France Lab
Complement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & Hadoop
Datameer
Word optimisa doc for linked in insights promotion
Word optimisa doc for linked in insights promotion
Paul Morgan
Self Evolving Model to Attain to State of Dynamic System Accuracy
Self Evolving Model to Attain to State of Dynamic System Accuracy
DataWorks Summit
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
Building up a Data Science Team from Scratch
Building up a Data Science Team from Scratch
Institute of Contemporary Sciences
940 diamond sponsor sengupta
940 diamond sponsor sengupta
Rising Media, Inc.
Building Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball Approach
joshwills
From Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data Science
Institute of Contemporary Sciences
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Josh Patterson
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
Destroying Data Silos
Destroying Data Silos
Hellmar Becker
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
Rehgan Avon
940 paw business general session - ssg - data-robot
940 paw business general session - ssg - data-robot
Rising Media, Inc.
Data Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
DataWorks Summit
From Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data Engineering
Ry Walker
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
Dr. Haxel Consult
Datascienceindia article
Datascienceindia article
HimanshuPise1
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino Data Lab
How to add security in dataops and devops
How to add security in dataops and devops
Ulf Mattsson
Dataiku r users group v2
Dataiku r users group v2
Cdiscount
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seeling Cheung
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
Sri Ambati
Dataiku - google cloud platform roadshow - october 2013
Dataiku - google cloud platform roadshow - october 2013
Dataiku
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Caserta
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Timothy Spann
Enterprise data science at scale
Enterprise data science at scale
Carolyn Duby
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Hortonworks
Contenu connexe
Tendances
940 diamond sponsor sengupta
940 diamond sponsor sengupta
Rising Media, Inc.
Building Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball Approach
joshwills
From Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data Science
Institute of Contemporary Sciences
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Josh Patterson
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
Destroying Data Silos
Destroying Data Silos
Hellmar Becker
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
Rehgan Avon
940 paw business general session - ssg - data-robot
940 paw business general session - ssg - data-robot
Rising Media, Inc.
Data Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
DataWorks Summit
From Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data Engineering
Ry Walker
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
Dr. Haxel Consult
Datascienceindia article
Datascienceindia article
HimanshuPise1
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino Data Lab
How to add security in dataops and devops
How to add security in dataops and devops
Ulf Mattsson
Dataiku r users group v2
Dataiku r users group v2
Cdiscount
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seeling Cheung
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
Sri Ambati
Dataiku - google cloud platform roadshow - october 2013
Dataiku - google cloud platform roadshow - october 2013
Dataiku
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Caserta
Tendances
(19)
940 diamond sponsor sengupta
940 diamond sponsor sengupta
Building Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball Approach
From Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data Science
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
Destroying Data Silos
Destroying Data Silos
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
940 paw business general session - ssg - data-robot
940 paw business general session - ssg - data-robot
Data Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
From Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data Engineering
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
ICIC 2013 Conference Proceedings Tony Trippe Patinformatics
Datascienceindia article
Datascienceindia article
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
How to add security in dataops and devops
How to add security in dataops and devops
Dataiku r users group v2
Dataiku r users group v2
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
Dataiku - google cloud platform roadshow - october 2013
Dataiku - google cloud platform roadshow - october 2013
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Similaire à Enterprise Data Science at Scale
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Timothy Spann
Enterprise data science at scale
Enterprise data science at scale
Carolyn Duby
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Hortonworks
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks
Edw Optimization Solution
Edw Optimization Solution
Hortonworks
How Customers are Optimizing their EDW for Fast, Secure, and Effective Insights
How Customers are Optimizing their EDW for Fast, Secure, and Effective Insights
Hortonworks
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014
Hortonworks
Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09
Hortonworks
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
Hortonworks
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Hortonworks
PGDay Brasilia 2017
PGDay Brasilia 2017
Thiago Santiago
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Raúl Marín
Introduction to Hadoop
Introduction to Hadoop
POSSCON
Hortonworks for Financial Analysts Presentation
Hortonworks for Financial Analysts Presentation
Hortonworks
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
Storm Demo Talk - Denver Apr 2015
Storm Demo Talk - Denver Apr 2015
Mac Moore
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Hortonworks
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
Mac Moore
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
Similaire à Enterprise Data Science at Scale
(20)
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Enterprise data science at scale
Enterprise data science at scale
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Edw Optimization Solution
Edw Optimization Solution
How Customers are Optimizing their EDW for Fast, Secure, and Effective Insights
How Customers are Optimizing their EDW for Fast, Secure, and Effective Insights
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014
Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
PGDay Brasilia 2017
PGDay Brasilia 2017
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Introduction to Hadoop
Introduction to Hadoop
Hortonworks for Financial Analysts Presentation
Hortonworks for Financial Analysts Presentation
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Storm Demo Talk - Denver Apr 2015
Storm Demo Talk - Denver Apr 2015
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
Plus de Artem Ervits
Hive 3 a new horizon
Hive 3 a new horizon
Artem Ervits
Breathing new life into Apache Oozie with Apache Ambari Workflow Manager
Breathing new life into Apache Oozie with Apache Ambari Workflow Manager
Artem Ervits
Integrate SparkR with existing R packages to accelerate data science workflows
Integrate SparkR with existing R packages to accelerate data science workflows
Artem Ervits
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Artem Ervits
Past, Present and Future of Apache Ambari
Past, Present and Future of Apache Ambari
Artem Ervits
Hortonworks SmartSense
Hortonworks SmartSense
Artem Ervits
Plus de Artem Ervits
(6)
Hive 3 a new horizon
Hive 3 a new horizon
Breathing new life into Apache Oozie with Apache Ambari Workflow Manager
Breathing new life into Apache Oozie with Apache Ambari Workflow Manager
Integrate SparkR with existing R packages to accelerate data science workflows
Integrate SparkR with existing R packages to accelerate data science workflows
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Past, Present and Future of Apache Ambari
Past, Present and Future of Apache Ambari
Hortonworks SmartSense
Hortonworks SmartSense
Dernier
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Alberto González Trastoy
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
masabamasaba
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Delhi Call girls
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
kalichargn70th171
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
Jittipong Loespradit
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
Arshad QA
The title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
masabamasaba
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Presentation.STUDIO
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
masabamasaba
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Steffen Staab
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
proinshot.com
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Philip Schwarz
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Nitya salvi
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
ThousandEyes
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
Mind IT Systems
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
HimanshiGarg82
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
Dernier
(20)
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
The title is not connected to what is inside
The title is not connected to what is inside
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
Enterprise Data Science at Scale
1.
1 © Hortonworks
Inc. 2011 – 2016. All Rights Reserved From Data Science to Enterprise Data Science @ Scale
2.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Data Scientist Valentine Day Prediction Spark with HDP Improve Zeppelin Data Science Platform
3.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved à #1 Pure Open Source Hadoop Distribution à 1000+ customers and 2100+ ecosystem partners à Employs the original architects, developers and operators of Hadoop from Yahoo! à Best-in-class 24x7 customer support à Leading professional services and training à #1 Data Science Platform (Source: Gartner) à OpenPOWER performance leadership à Flexible, software defined storage à #1 SQL Engine for complex, analytical workloads à Leader in On-premise and Hybrid Cloud solutions + IBM + Hortonworks = Unlocking Actionable Insights
4.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Data Science For Modern Data Architecture Ø Make Zeppelin great for Spark Ø Enable Apps To Consume Predictions and become smarter Ø Make Zeppelin great for Spark Ø Become easier, more accurate & faster to deploy & manage Ø Make Zeppelin great for Spark Ø Bring predictive analytics to the IOT edge Ø Make Zeppelin great for Spark Ø Fully support data science life cycle
5.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Data Science Lifecycle
6.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Next Generation Data Science Problems Multiple data sources & clusters Data Scientists Where is the data I need to answer the business questions? Data Engineers How do I move that data into a central repository? How do I transform and cleanse that data?
7.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Next Generation Data Science Problems Too many tools and technologies Data Scientists How do I learn the latest library/ technique? I don’t (want to) know Hadoop/ Hive etc. How do I bring my familiar R/ Python library to the new data science platform?
8.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Next Generation Data Science Problems Socializing insights is challenging Data Scientists How do I collaborate and share my work with others in the organization? Business Analyst How do I move that data into a central repository? What is the best visualization to tell my story?
9.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Next Generation Data Science Problems Going from prototype to production is cumbersome Data Scientists I created this awesome Machine Learning Model, how do I put it into production? Data Scientists/ Data Engineers How are my Machine Learning Models performing & how to improve them?
10.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Enterprise Data Science At Scale Enterprise Secured, governed and managed Tools Leverage your favorite tools, technologies and libraries Deployment From pilot to production Data Build models using all the data
11.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Data Science Solution Community Open Source Scale & Enterprise Security • Find tutorials and datasets • Connect with Data Scientists • Ask questions • Read articles and papers • Fork and share projects • Code in Scala/Python/R/SQL • Zeppelin & Jupyter Notebooks • RStudio IDE and Shiny • Apache Spark • Your favorite libraries • Data Science at Scale • Run Spark Jobs on HDP Cluster • Secure Hadoop Support • Ranger Atlas Support for Data • Support for ABAC Model Management • Data Shaping Pipeline UI • Auto-data preparation & modeling • Advanced Visualizations • Model management & deployment • Documented Model APIs Data Science Experience
12.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved DSX Demo
13.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Use Case à All industries are effected by churn. à Being able to predict churn helps companies take action and keep customers longer. à The more historical data, the better the model à Data collected and labeled over time based on churn. à Using a Random Forest we will predict future churners. Customer Churn Architecture
14.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved
15.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Demo Scenario Assessing Customer Churn Probability in Real Time • Stored long term data on customer churn behavior • New real time data coming in • Predict a customers churn probability before they churn • Alert the proper departments | manager • Business monitors customer retention outlook & performance
16.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Demo Flow Insights from Data Science to Production Data Scientists Where is the data I need to answer the business questions? Business Users Where is the insight & predictions from the data? HDP Cluster Knox Admins How do I meet SLA, Performance, .., Feature needs?
17.
© Hortonworks Inc.
2011 – 2017. All Rights Reserved Demo Scenario Problems Solved • Data Scientist collaborate, learn new tools & frameworks • Choice of tools, notebooks and languages • Run favorite notebook on all data in the HDP Cluster • Deploy the model to production • Leverage the production model to deliver insights to business • Monitor models and retrain models as new data comes in
18.
18 © Hortonworks
Inc. 2011–2018. All rights reserved. Hortonworks confidential and proprietary information Questions?
Télécharger maintenant