SlideShare a Scribd company logo
1 of 37
Download to read offline
S U M M I T
S E O U L
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DataRobot, Automated ML
changes in methodology and benefits
홍운표
Customer Facing Data Scientist
DataRobot, Korea
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
1. DataRobot Introduction
Corporate Vision, Introduction
2. Data Science and Practices
Science Methodology, Data Scientist, Agony
3. Automated ML
What & How, Benefits, Live Demo
4. Future Direction
What & How
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Strategic Vision
Enabling the AI-Driven Enterprise
Where AI is applied in every business process to predict outcomes.
The AI-Driven Enterprise adapts to new conditions at incredible
speeds and continually self-optimizes based on predicting the future.
“If your competitor is rushing to build AI and you don’t, it will crush you.”
-Elon Musk
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Opportunity for Machine Learning in Any Business
Marketing
Predicting customer Lifetime Value (LTV)
Churn
Customer segmentation
Product mix (best product mix to reduce churn)
Cross selling/recommendation algorithms
Up selling
Channel optimization
Discount targeting
Responses rates
Reactivation likelihood
Adwords optimization and ad buying
In store traffic patterns
Aircraft scheduling
Sales
Lead prioritization
Demand forecasting
Pricing
Market Basket
Inventory management / Dynamic Pricing
Promos / Upgrades / Offers
Human Resources
Resume screening
Employee churn
Training recommendation
Talent management
Risk
Credit risk
Fraud detection
Accounts Payable Recovery
Anti-money laundering
Insurance Claims prediction
Readmission Risk
Warranty Analytics
Claim Prediction
Logistics
Procurement
Warehousing
Cost Analysis
Product life cycle
Demand Forecast
Assembly
Turnover
Banking
Insurance
Healthcare
Media
Pharma
Telco
Retail
Government
Energy
Transportation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Filling the Gap
Accelerate the process of researching, testing and deploying predictive algorithms. Enable more people to
help research, test and deploy predictive algorithms.
KEY
Demand for predictive models
Supply of data scientists
Turn Analysts & Engineers
into Data Scientists
Increase Data Scientist
productivity Unmet demand for
Data Science
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The world’s most advanced Enterprise Machine Learning Automation platform
2012
Founded, HQ in Boston, MA
$224M
In funding
1,000,000,000+
Models built on DataRobot Cloud
250+
Data Scientists & Engineers (of 600+)
4
#1 ranked Data Scientists
50+
Top 3 finishes
INSURANCE FINTECH HEALTHCARE MARKETING BANKING MANY MORE
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Best Practices and Technology
The top ranked Data Scientists in the world
Owen Zhang
Product Advisor
Highest: 1st
MASTER
Xavier Conort
Chief Data Scientist
Highest: 1st
MASTER
Sergey Yurgenson
Data Scientist
Highest: 1st
MASTER
Amanda Schierz
Data Scientist
Current: 1st Female, 1st in UK
MASTER
Jeremy Achin
CEO & Co-Founder
Highest: 20th
MASTER
The best technologies in the world
Tom de Godoy
CTO & Co-Founder
Highest: 20th
MASTER
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Scientists
Data
Scientist
Programming
Skills
Math &
Stats
Domain
Expertise
Required Capabilities
1. Knowledge of the business
2. Knowledge of the data
3. Ability to write code to gather data
4. Ability to write code to explore/inspect data
5. Ability to write code to manipulate data
6. Ability to write code to extract actionable intel
7. Ability to write code to build models
8. Ability to write code to implement models
9. Foundational statistics
10. Internals of algorithms
11. Practical knowledge and experience
12. Knowing how to interpret and explain models
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Scientific Methodology (Metaphysics)
Karl Popper
Observation/Rationale
Hypothesis
Experiment
Theory
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Science Methodology
Due to limited resource, call for amelioration
No target goal
A few algorithms
& prone to overfit
Aging of model
Not sufficient Explanations
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Science Landscape
Where do you mostly work?
Business
space
Feature
space
Algorithm
space
(LoB, DE, DS) (DE, DS)
ROI Availability Accuracy
LOB : Line of Business
DE : Data Engineer (ETL)
DS : Data Scientist
Actionable predictive model,
Valuable insights
(LOB, DS)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Algorithm
Best Practices
Business-driven, feature-oriented analysis
Business Feature
Business
Algorithm
Business
Feature
Feature
Algorithm
Data Scientist Drives Business Drives Balanced and Promising
√
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Motivations for AutoML
Value of diverse set of algorithms
Source:
http://statweb.stanford.edu/~tibs/ElemStatLearn/
Methodology driven
Problem driven
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
And trends
Most modelling software using a
20th Century paradigm
Expert systems with
modelling intelligence
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What is Automated Machine Learning
• 10 steps to building models
• An expert system that knows how to do
each of these 10 steps, without human
instructions
• Human friendly – not a black box
• Fast and accurate
• Replicable data science
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What about DataRobot?
Key Points
• End to end automated machine learning – all 10 steps
are automated
• Hundreds of algorithms in the repository with new
algorithms being added regularly
• Chooses the best algorithms for your data
• Best-in-class human-friendly insights
• Widest range of deployment options
• Enterprise ready
• Automatic model reports
• Large support team around the world
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DataRobot Workflow
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Different but powerful way of analysis
A few perspectives (many more)
Single model
Multiple models
Only interpretable
algorithm is chosen
 Linear model is
preferable
No need of Hold-out
partition : just train/test
or k-fold CV
Hold-out partition for
evaluation of several
models
Blending starts from
existing model
Interpretability is
model-agnostic
Blending is fair-basis
reflecting multiple
models performance,
with speed vs accuracy
data
Interaction should be
considered for model
performance (linear
model)
Interaction
automatically reflected
in tree-based
algorithms. If interaction
should be of importance,
DR has GA2M model
and R/Python api
support for that
parameter tuning is
limited for a model and
time-consuming
Parameter tuning is
exhaustive for all
candidate models.
One can easily confine
the search space and
quickly get the results
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits : safer model
Robust model free from the risk of overfitting
Average of these 5 validation
scores is the cross validation score
The holdout is completely
hidden from the models during
the training process. After you
have selected your optimal
model, you can score your
model on this to get your
holdout score.
Partition 1
(TRAINING)
Partition 2
(TRAINING)
Partition 3
(TRAINING)
Partition 4
(TRAINING)
Partition 5
(VALIDATION)
Holdout
Partition 1
(TRAINING)
Partition 2
(TRAINING)
Partition 3
(TRAINING)
Partition 4
(VALIDATION)
Partition 5
(TRAINING)
Holdout
Partition 1
(TRAINING)
Partition 2
(TRAINING)
Partition 3
(VALIDATION)
Partition 4
(TRAINING)
Partition 5
(TRAINING)
Holdout
Partition 1
(TRAINING)
Partition 2
(VALIDATION)
Partition 3
(TRAINING)
Partition 4
(TRAINING)
Partition 5
(TRAINING)
Holdout
Partition 1
(VALIDATION)
Partition 2
(TRAINING)
Partition 3
(TRAINING)
Partition 4
(TRAINING)
Partition 5
(TRAINING)
Holdout
CV-Fold #1
CV-Fold #2
CV-Fold #3
CV-Fold #4
CV-Fold #5
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits : more effort on feature space
”Feature engineering is the art of data science” (Sergey Yurgenson)
Business
space
Feature
space
Algorithm
space
(LoB, DE, DS) (DE, DS)
ROI Availability Accuracy
Actionable predictive model,
Valuable insights
(LOB, DS)
Feature Impact
Feature Effects
Prediction Explanations
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits : Explainability
Model-agnostic explanation
[Feature Impact] [Feature Effect] [Prediction Explanation]
• The importance of each feature
• Coincides with domain knowledge?
• Any new insights?
• Relationship among target and a feature
• Relationship reflects domain knowledge?
• Any new insights or feature transform?
• What is the basis of prediction?
• The predictions are reliablable to
business people?
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits : effective blending
Search over candidates which promises tangible improvement
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits : Hyper-param Tuning
Gradient-free and effective pattern search
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits : API integration
data scientists and developers can use API
Application server
Prediction worker
RestAPI,
R/Python pkg
Model Factory
Automatic
Model Refresh
Model
Diags & Viz
Feature
Engineering
App.
Integration
Custom Analysis and various Analysis
Notebook Web UIConsole US, EU region
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Demo : Bleedout prediction
Binary classification for QA
Process: Coating of thin film by covering the
surface with coating solution and drying,
followed by polymerizing with UV-light.
Problem: Unintended precipitation of
powder such as unpolymerized monomer,
antioxidant occurs causing “bleedout”. It
spoils the product and contaminates the
production line.
Data:
● Material: length of film roll
● Project type: production vs experiment
● Control: winding tension, UV-exposure
duration, O2 concentration etc
Winding in
Coating
Drying
Tension
UV exposure
(O2 purged)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Demo : Bleedout MFG process
1) Unwinding 2) Coating 3) Drying
4) UV exposure 5) Tension Control 6) Winding
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Demo : Jupyter Notebook
Sagemaker Notebook Automatically Project Created & Run
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Full-fledged Automation
Expanded coverage of automation, full automation
Consumption
Consuming and application
of advanced analytics in the
form of dashboards,
decisions, and analytics
powered applications.
Prep, Blend,
Agg and ETL
Self-Service BI, data prep,
blending, transformation,
feature engineering, and
sharing of insights. Data
pipeline and workflow
execution.
Data
Management
Data cataloging,
organization, and
collaboration. Automatic
indexing and knowledge
gathering made available to
the entire organization.
Analytics
(Advanced, Simple)
Simple: Self-Service BI,
charts, graphs,
tables, queries.
Advanced: Automated data
investigation for insights,
predictions, and
recommendations
Deployment
Powering business
applications by providing
advanced analytics
insights, predictions,
monitoring, and refresh on
new data. Hosted as an
API, SDK, or code.
여러분의 피드백을 기다립니다!
#AWSSummit 해시태그로
소셜미디어에 여러분의
행사소감을 올려주세요.
AWS Summit Seoul 2019
모바일 앱과 QR코드를 통해
강연평가 및 설문조사에
참여해 주시기 바랍니다.
내년 Summit을 만들 여러분의
소중한 의견 부탁 드립니다.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
홍운표
woonpyo.hong@datarobot.com
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

More Related Content

What's hot

What's hot (20)

Enterprise Knowledge Graph
Enterprise Knowledge GraphEnterprise Knowledge Graph
Enterprise Knowledge Graph
 
GPU上でのNLP向け深層学習の実装について
GPU上でのNLP向け深層学習の実装についてGPU上でのNLP向け深層学習の実装について
GPU上でのNLP向け深層学習の実装について
 
ConditionalPointDiffusion.pdf
ConditionalPointDiffusion.pdfConditionalPointDiffusion.pdf
ConditionalPointDiffusion.pdf
 
Trends_of_MLOps_tech_in_business
Trends_of_MLOps_tech_in_businessTrends_of_MLOps_tech_in_business
Trends_of_MLOps_tech_in_business
 
【論文読み会】MAUVE: Measuring the Gap Between Neural Text and Human Text using Dive...
【論文読み会】MAUVE: Measuring the Gap Between Neural Text and Human Text using Dive...【論文読み会】MAUVE: Measuring the Gap Between Neural Text and Human Text using Dive...
【論文読み会】MAUVE: Measuring the Gap Between Neural Text and Human Text using Dive...
 
つくばチャレンジ2020運営の活動報告
つくばチャレンジ2020運営の活動報告つくばチャレンジ2020運営の活動報告
つくばチャレンジ2020運営の活動報告
 
データ利活用を促進するメタデータ
データ利活用を促進するメタデータデータ利活用を促進するメタデータ
データ利活用を促進するメタデータ
 
Deep Learningを用いた経路予測の研究動向
Deep Learningを用いた経路予測の研究動向Deep Learningを用いた経路予測の研究動向
Deep Learningを用いた経路予測の研究動向
 
ナレッジグラフ推論チャレンジの紹介
ナレッジグラフ推論チャレンジの紹介ナレッジグラフ推論チャレンジの紹介
ナレッジグラフ推論チャレンジの紹介
 
Teslaにおけるコンピュータビジョン技術の調査
Teslaにおけるコンピュータビジョン技術の調査Teslaにおけるコンピュータビジョン技術の調査
Teslaにおけるコンピュータビジョン技術の調査
 
先駆者に学ぶ MLOpsの実際
先駆者に学ぶ MLOpsの実際先駆者に学ぶ MLOpsの実際
先駆者に学ぶ MLOpsの実際
 
Azure Machine Learning の使い方 (全体を理解するための入門編)
Azure Machine Learning の使い方 (全体を理解するための入門編)Azure Machine Learning の使い方 (全体を理解するための入門編)
Azure Machine Learning の使い方 (全体を理解するための入門編)
 
제 17회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [굿아이디어스] : 아이디어스 작가를 위한 비지니스 대시보드
제 17회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [굿아이디어스] : 아이디어스 작가를 위한 비지니스 대시보드제 17회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [굿아이디어스] : 아이디어스 작가를 위한 비지니스 대시보드
제 17회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [굿아이디어스] : 아이디어스 작가를 위한 비지니스 대시보드
 
초단타매매 전략 소개 및 트렌드
초단타매매 전략 소개 및 트렌드초단타매매 전략 소개 및 트렌드
초단타매매 전략 소개 및 트렌드
 
機械学習を用いた異常検知入門
機械学習を用いた異常検知入門機械学習を用いた異常検知入門
機械学習を用いた異常検知入門
 
[DL輪読会]Deep Direct Reinforcement Learning for Financial Signal Representation...
[DL輪読会]Deep Direct Reinforcement Learning for Financial Signal Representation...[DL輪読会]Deep Direct Reinforcement Learning for Financial Signal Representation...
[DL輪読会]Deep Direct Reinforcement Learning for Financial Signal Representation...
 
Teslaにおけるコンピュータビジョン技術の調査 (2)
Teslaにおけるコンピュータビジョン技術の調査 (2)Teslaにおけるコンピュータビジョン技術の調査 (2)
Teslaにおけるコンピュータビジョン技術の調査 (2)
 
【DL輪読会】DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Dri...
【DL輪読会】DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Dri...【DL輪読会】DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Dri...
【DL輪読会】DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Dri...
 
Neo4j GraphTalk Helsinki - Introduction and Graph Use Cases
Neo4j GraphTalk Helsinki - Introduction and Graph Use CasesNeo4j GraphTalk Helsinki - Introduction and Graph Use Cases
Neo4j GraphTalk Helsinki - Introduction and Graph Use Cases
 
Machine Learning for Automated Reasoning: An Overview
Machine Learning for Automated Reasoning: An OverviewMachine Learning for Automated Reasoning: An Overview
Machine Learning for Automated Reasoning: An Overview
 

Similar to Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Summit Seoul 2019

NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
Amazon Web Services
 

Similar to Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Summit Seoul 2019 (20)

DataRobot - 머신러닝 자동화 플랫폼
DataRobot - 머신러닝 자동화 플랫폼DataRobot - 머신러닝 자동화 플랫폼
DataRobot - 머신러닝 자동화 플랫폼
 
Organizing for faster innovation - People, process, culture, and technology
Organizing for faster innovation - People, process, culture, and technologyOrganizing for faster innovation - People, process, culture, and technology
Organizing for faster innovation - People, process, culture, and technology
 
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
 
Drive Digital Transformation using Machine Learning
Drive Digital Transformation using Machine LearningDrive Digital Transformation using Machine Learning
Drive Digital Transformation using Machine Learning
 
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain PipelineThe Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
 
Operating Models: How Does Your Operating Model Change and Scale in the Cloud?
Operating Models: How Does Your Operating Model Change and Scale in the Cloud?Operating Models: How Does Your Operating Model Change and Scale in the Cloud?
Operating Models: How Does Your Operating Model Change and Scale in the Cloud?
 
Pixels.camp - Machine Learning: Building Successful Products at Scale
Pixels.camp - Machine Learning: Building Successful Products at ScalePixels.camp - Machine Learning: Building Successful Products at Scale
Pixels.camp - Machine Learning: Building Successful Products at Scale
 
Deep Dive Amazon SageMaker
Deep Dive Amazon SageMakerDeep Dive Amazon SageMaker
Deep Dive Amazon SageMaker
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)
 
Racing with Artificial Intelligence
Racing with Artificial IntelligenceRacing with Artificial Intelligence
Racing with Artificial Intelligence
 
MLops workshop AWS
MLops workshop AWSMLops workshop AWS
MLops workshop AWS
 
Powering up agents using data science challenges and opportunities
Powering up agents using data science challenges and opportunities Powering up agents using data science challenges and opportunities
Powering up agents using data science challenges and opportunities
 
Cloud Choices Quantifying the Cost and Risk Implications of Cloud
Cloud Choices Quantifying the Cost and Risk Implications of CloudCloud Choices Quantifying the Cost and Risk Implications of Cloud
Cloud Choices Quantifying the Cost and Risk Implications of Cloud
 
Machine Learning Everywhere
Machine Learning EverywhereMachine Learning Everywhere
Machine Learning Everywhere
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
 
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
 
打造新一代的企業 IT - Transforming Enterprise IT
打造新一代的企業 IT - Transforming Enterprise IT打造新一代的企業 IT - Transforming Enterprise IT
打造新一代的企業 IT - Transforming Enterprise IT
 
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up LoftArtifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
 
The Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons LearnedThe Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons Learned
 

More from Amazon Web Services Korea

More from Amazon Web Services Korea (20)

AWS Modern Infra with Storage Roadshow 2023 - Day 2
AWS Modern Infra with Storage Roadshow 2023 - Day 2AWS Modern Infra with Storage Roadshow 2023 - Day 2
AWS Modern Infra with Storage Roadshow 2023 - Day 2
 
AWS Modern Infra with Storage Roadshow 2023 - Day 1
AWS Modern Infra with Storage Roadshow 2023 - Day 1AWS Modern Infra with Storage Roadshow 2023 - Day 1
AWS Modern Infra with Storage Roadshow 2023 - Day 1
 
사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...
사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...
사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...
 
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
 
Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...
Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...
Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...
 
[Keynote] 슬기로운 AWS 데이터베이스 선택하기 - 발표자: 강민석, Korea Database SA Manager, WWSO, A...
[Keynote] 슬기로운 AWS 데이터베이스 선택하기 - 발표자: 강민석, Korea Database SA Manager, WWSO, A...[Keynote] 슬기로운 AWS 데이터베이스 선택하기 - 발표자: 강민석, Korea Database SA Manager, WWSO, A...
[Keynote] 슬기로운 AWS 데이터베이스 선택하기 - 발표자: 강민석, Korea Database SA Manager, WWSO, A...
 
Demystify Streaming on AWS - 발표자: 이종혁, Sr Analytics Specialist, WWSO, AWS :::...
Demystify Streaming on AWS - 발표자: 이종혁, Sr Analytics Specialist, WWSO, AWS :::...Demystify Streaming on AWS - 발표자: 이종혁, Sr Analytics Specialist, WWSO, AWS :::...
Demystify Streaming on AWS - 발표자: 이종혁, Sr Analytics Specialist, WWSO, AWS :::...
 
Amazon EMR - Enhancements on Cost/Performance, Serverless - 발표자: 김기영, Sr Anal...
Amazon EMR - Enhancements on Cost/Performance, Serverless - 발표자: 김기영, Sr Anal...Amazon EMR - Enhancements on Cost/Performance, Serverless - 발표자: 김기영, Sr Anal...
Amazon EMR - Enhancements on Cost/Performance, Serverless - 발표자: 김기영, Sr Anal...
 
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
 
Enabling Agility with Data Governance - 발표자: 김성연, Analytics Specialist, WWSO,...
Enabling Agility with Data Governance - 발표자: 김성연, Analytics Specialist, WWSO,...Enabling Agility with Data Governance - 발표자: 김성연, Analytics Specialist, WWSO,...
Enabling Agility with Data Governance - 발표자: 김성연, Analytics Specialist, WWSO,...
 
Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...
Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...
Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...
 
From Insights to Action, How to build and maintain a Data Driven Organization...
From Insights to Action, How to build and maintain a Data Driven Organization...From Insights to Action, How to build and maintain a Data Driven Organization...
From Insights to Action, How to build and maintain a Data Driven Organization...
 
[Keynote] Accelerating Business Outcomes with AWS Data - 발표자: Saeed Gharadagh...
[Keynote] Accelerating Business Outcomes with AWS Data - 발표자: Saeed Gharadagh...[Keynote] Accelerating Business Outcomes with AWS Data - 발표자: Saeed Gharadagh...
[Keynote] Accelerating Business Outcomes with AWS Data - 발표자: Saeed Gharadagh...
 
Amazon DynamoDB - Use Cases and Cost Optimization - 발표자: 이혁, DynamoDB Special...
Amazon DynamoDB - Use Cases and Cost Optimization - 발표자: 이혁, DynamoDB Special...Amazon DynamoDB - Use Cases and Cost Optimization - 발표자: 이혁, DynamoDB Special...
Amazon DynamoDB - Use Cases and Cost Optimization - 발표자: 이혁, DynamoDB Special...
 
LG전자 - Amazon Aurora 및 RDS 블루/그린 배포를 이용한 데이터베이스 업그레이드 안정성 확보 - 발표자: 이은경 책임, L...
LG전자 - Amazon Aurora 및 RDS 블루/그린 배포를 이용한 데이터베이스 업그레이드 안정성 확보 - 발표자: 이은경 책임, L...LG전자 - Amazon Aurora 및 RDS 블루/그린 배포를 이용한 데이터베이스 업그레이드 안정성 확보 - 발표자: 이은경 책임, L...
LG전자 - Amazon Aurora 및 RDS 블루/그린 배포를 이용한 데이터베이스 업그레이드 안정성 확보 - 발표자: 이은경 책임, L...
 
KB국민카드 - 클라우드 기반 분석 플랫폼 혁신 여정 - 발표자: 박창용 과장, 데이터전략본부, AI혁신부, KB카드│강병억, Soluti...
KB국민카드 - 클라우드 기반 분석 플랫폼 혁신 여정 - 발표자: 박창용 과장, 데이터전략본부, AI혁신부, KB카드│강병억, Soluti...KB국민카드 - 클라우드 기반 분석 플랫폼 혁신 여정 - 발표자: 박창용 과장, 데이터전략본부, AI혁신부, KB카드│강병억, Soluti...
KB국민카드 - 클라우드 기반 분석 플랫폼 혁신 여정 - 발표자: 박창용 과장, 데이터전략본부, AI혁신부, KB카드│강병억, Soluti...
 
SK Telecom - 망관리 프로젝트 TANGO의 오픈소스 데이터베이스 전환 여정 - 발표자 : 박승전, Project Manager, ...
SK Telecom - 망관리 프로젝트 TANGO의 오픈소스 데이터베이스 전환 여정 - 발표자 : 박승전, Project Manager, ...SK Telecom - 망관리 프로젝트 TANGO의 오픈소스 데이터베이스 전환 여정 - 발표자 : 박승전, Project Manager, ...
SK Telecom - 망관리 프로젝트 TANGO의 오픈소스 데이터베이스 전환 여정 - 발표자 : 박승전, Project Manager, ...
 
코리안리 - 데이터 분석 플랫폼 구축 여정, 그 시작과 과제 - 발표자: 김석기 그룹장, 데이터비즈니스센터, 메가존클라우드 ::: AWS ...
코리안리 - 데이터 분석 플랫폼 구축 여정, 그 시작과 과제 - 발표자: 김석기 그룹장, 데이터비즈니스센터, 메가존클라우드 ::: AWS ...코리안리 - 데이터 분석 플랫폼 구축 여정, 그 시작과 과제 - 발표자: 김석기 그룹장, 데이터비즈니스센터, 메가존클라우드 ::: AWS ...
코리안리 - 데이터 분석 플랫폼 구축 여정, 그 시작과 과제 - 발표자: 김석기 그룹장, 데이터비즈니스센터, 메가존클라우드 ::: AWS ...
 
LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...
LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...
LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...
 
[Keynote] Data Driven Organizations with AWS Data - 발표자: Agnes Panosian, Head...
[Keynote] Data Driven Organizations with AWS Data - 발표자: Agnes Panosian, Head...[Keynote] Data Driven Organizations with AWS Data - 발표자: Agnes Panosian, Head...
[Keynote] Data Driven Organizations with AWS Data - 발표자: Agnes Panosian, Head...
 

Recently uploaded

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Summit Seoul 2019

  • 1. S U M M I T S E O U L © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. DataRobot, Automated ML changes in methodology and benefits 홍운표 Customer Facing Data Scientist DataRobot, Korea
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda 1. DataRobot Introduction Corporate Vision, Introduction 2. Data Science and Practices Science Methodology, Data Scientist, Agony 3. Automated ML What & How, Benefits, Live Demo 4. Future Direction What & How
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Strategic Vision Enabling the AI-Driven Enterprise Where AI is applied in every business process to predict outcomes. The AI-Driven Enterprise adapts to new conditions at incredible speeds and continually self-optimizes based on predicting the future. “If your competitor is rushing to build AI and you don’t, it will crush you.” -Elon Musk
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Opportunity for Machine Learning in Any Business Marketing Predicting customer Lifetime Value (LTV) Churn Customer segmentation Product mix (best product mix to reduce churn) Cross selling/recommendation algorithms Up selling Channel optimization Discount targeting Responses rates Reactivation likelihood Adwords optimization and ad buying In store traffic patterns Aircraft scheduling Sales Lead prioritization Demand forecasting Pricing Market Basket Inventory management / Dynamic Pricing Promos / Upgrades / Offers Human Resources Resume screening Employee churn Training recommendation Talent management Risk Credit risk Fraud detection Accounts Payable Recovery Anti-money laundering Insurance Claims prediction Readmission Risk Warranty Analytics Claim Prediction Logistics Procurement Warehousing Cost Analysis Product life cycle Demand Forecast Assembly Turnover Banking Insurance Healthcare Media Pharma Telco Retail Government Energy Transportation
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Filling the Gap Accelerate the process of researching, testing and deploying predictive algorithms. Enable more people to help research, test and deploy predictive algorithms. KEY Demand for predictive models Supply of data scientists Turn Analysts & Engineers into Data Scientists Increase Data Scientist productivity Unmet demand for Data Science
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. The world’s most advanced Enterprise Machine Learning Automation platform 2012 Founded, HQ in Boston, MA $224M In funding 1,000,000,000+ Models built on DataRobot Cloud 250+ Data Scientists & Engineers (of 600+) 4 #1 ranked Data Scientists 50+ Top 3 finishes INSURANCE FINTECH HEALTHCARE MARKETING BANKING MANY MORE
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Best Practices and Technology The top ranked Data Scientists in the world Owen Zhang Product Advisor Highest: 1st MASTER Xavier Conort Chief Data Scientist Highest: 1st MASTER Sergey Yurgenson Data Scientist Highest: 1st MASTER Amanda Schierz Data Scientist Current: 1st Female, 1st in UK MASTER Jeremy Achin CEO & Co-Founder Highest: 20th MASTER The best technologies in the world Tom de Godoy CTO & Co-Founder Highest: 20th MASTER
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Scientists Data Scientist Programming Skills Math & Stats Domain Expertise Required Capabilities 1. Knowledge of the business 2. Knowledge of the data 3. Ability to write code to gather data 4. Ability to write code to explore/inspect data 5. Ability to write code to manipulate data 6. Ability to write code to extract actionable intel 7. Ability to write code to build models 8. Ability to write code to implement models 9. Foundational statistics 10. Internals of algorithms 11. Practical knowledge and experience 12. Knowing how to interpret and explain models
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Scientific Methodology (Metaphysics) Karl Popper Observation/Rationale Hypothesis Experiment Theory
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Science Methodology Due to limited resource, call for amelioration No target goal A few algorithms & prone to overfit Aging of model Not sufficient Explanations
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Science Landscape Where do you mostly work? Business space Feature space Algorithm space (LoB, DE, DS) (DE, DS) ROI Availability Accuracy LOB : Line of Business DE : Data Engineer (ETL) DS : Data Scientist Actionable predictive model, Valuable insights (LOB, DS)
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Algorithm Best Practices Business-driven, feature-oriented analysis Business Feature Business Algorithm Business Feature Feature Algorithm Data Scientist Drives Business Drives Balanced and Promising √
  • 16. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Motivations for AutoML Value of diverse set of algorithms Source: http://statweb.stanford.edu/~tibs/ElemStatLearn/ Methodology driven Problem driven
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. And trends Most modelling software using a 20th Century paradigm Expert systems with modelling intelligence
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. What is Automated Machine Learning • 10 steps to building models • An expert system that knows how to do each of these 10 steps, without human instructions • Human friendly – not a black box • Fast and accurate • Replicable data science
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. What about DataRobot? Key Points • End to end automated machine learning – all 10 steps are automated • Hundreds of algorithms in the repository with new algorithms being added regularly • Chooses the best algorithms for your data • Best-in-class human-friendly insights • Widest range of deployment options • Enterprise ready • Automatic model reports • Large support team around the world
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. DataRobot Workflow
  • 22. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Different but powerful way of analysis A few perspectives (many more) Single model Multiple models Only interpretable algorithm is chosen  Linear model is preferable No need of Hold-out partition : just train/test or k-fold CV Hold-out partition for evaluation of several models Blending starts from existing model Interpretability is model-agnostic Blending is fair-basis reflecting multiple models performance, with speed vs accuracy data Interaction should be considered for model performance (linear model) Interaction automatically reflected in tree-based algorithms. If interaction should be of importance, DR has GA2M model and R/Python api support for that parameter tuning is limited for a model and time-consuming Parameter tuning is exhaustive for all candidate models. One can easily confine the search space and quickly get the results
  • 23. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits : safer model Robust model free from the risk of overfitting Average of these 5 validation scores is the cross validation score The holdout is completely hidden from the models during the training process. After you have selected your optimal model, you can score your model on this to get your holdout score. Partition 1 (TRAINING) Partition 2 (TRAINING) Partition 3 (TRAINING) Partition 4 (TRAINING) Partition 5 (VALIDATION) Holdout Partition 1 (TRAINING) Partition 2 (TRAINING) Partition 3 (TRAINING) Partition 4 (VALIDATION) Partition 5 (TRAINING) Holdout Partition 1 (TRAINING) Partition 2 (TRAINING) Partition 3 (VALIDATION) Partition 4 (TRAINING) Partition 5 (TRAINING) Holdout Partition 1 (TRAINING) Partition 2 (VALIDATION) Partition 3 (TRAINING) Partition 4 (TRAINING) Partition 5 (TRAINING) Holdout Partition 1 (VALIDATION) Partition 2 (TRAINING) Partition 3 (TRAINING) Partition 4 (TRAINING) Partition 5 (TRAINING) Holdout CV-Fold #1 CV-Fold #2 CV-Fold #3 CV-Fold #4 CV-Fold #5
  • 24. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits : more effort on feature space ”Feature engineering is the art of data science” (Sergey Yurgenson) Business space Feature space Algorithm space (LoB, DE, DS) (DE, DS) ROI Availability Accuracy Actionable predictive model, Valuable insights (LOB, DS) Feature Impact Feature Effects Prediction Explanations
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits : Explainability Model-agnostic explanation [Feature Impact] [Feature Effect] [Prediction Explanation] • The importance of each feature • Coincides with domain knowledge? • Any new insights? • Relationship among target and a feature • Relationship reflects domain knowledge? • Any new insights or feature transform? • What is the basis of prediction? • The predictions are reliablable to business people?
  • 26. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits : effective blending Search over candidates which promises tangible improvement
  • 27. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits : Hyper-param Tuning Gradient-free and effective pattern search
  • 28. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits : API integration data scientists and developers can use API Application server Prediction worker RestAPI, R/Python pkg Model Factory Automatic Model Refresh Model Diags & Viz Feature Engineering App. Integration Custom Analysis and various Analysis Notebook Web UIConsole US, EU region
  • 29. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 30. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Demo : Bleedout prediction Binary classification for QA Process: Coating of thin film by covering the surface with coating solution and drying, followed by polymerizing with UV-light. Problem: Unintended precipitation of powder such as unpolymerized monomer, antioxidant occurs causing “bleedout”. It spoils the product and contaminates the production line. Data: ● Material: length of film roll ● Project type: production vs experiment ● Control: winding tension, UV-exposure duration, O2 concentration etc Winding in Coating Drying Tension UV exposure (O2 purged)
  • 31. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Demo : Bleedout MFG process 1) Unwinding 2) Coating 3) Drying 4) UV exposure 5) Tension Control 6) Winding
  • 32. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Demo : Jupyter Notebook Sagemaker Notebook Automatically Project Created & Run
  • 33. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 34. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Full-fledged Automation Expanded coverage of automation, full automation Consumption Consuming and application of advanced analytics in the form of dashboards, decisions, and analytics powered applications. Prep, Blend, Agg and ETL Self-Service BI, data prep, blending, transformation, feature engineering, and sharing of insights. Data pipeline and workflow execution. Data Management Data cataloging, organization, and collaboration. Automatic indexing and knowledge gathering made available to the entire organization. Analytics (Advanced, Simple) Simple: Self-Service BI, charts, graphs, tables, queries. Advanced: Automated data investigation for insights, predictions, and recommendations Deployment Powering business applications by providing advanced analytics insights, predictions, monitoring, and refresh on new data. Hosted as an API, SDK, or code.
  • 35. 여러분의 피드백을 기다립니다! #AWSSummit 해시태그로 소셜미디어에 여러분의 행사소감을 올려주세요. AWS Summit Seoul 2019 모바일 앱과 QR코드를 통해 강연평가 및 설문조사에 참여해 주시기 바랍니다. 내년 Summit을 만들 여러분의 소중한 의견 부탁 드립니다. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 36. Thank you! © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. 홍운표 woonpyo.hong@datarobot.com
  • 37. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.