SlideShare une entreprise Scribd logo
1  sur  34
Sridhar Ratakonda
Founder, PredixDATA, LLC
http://www.predixdata.com
Machine learning /
Algorithms
&
Business use cases
What is Statistical learning?
Let’s say you want to associate sales based on advertising channel.
Input variables “Xn” => “TV budget”, “Radio budget”, “newspaper budget”
Output variable “Y” => Sales
Y = f(X) + ͼ
Statistical learning refers to set of ways for estimating “f”
Estimate of “f” / Prediction
In many situations, a set of inputs X are readily
available, but the output Y cannot be easily obtained.
we can predict Y using Yˆ = ˆf(X),
fˆ = estimate for f
Yˆ = resulting prediction for Y
Ex: Predicting sales based on advertisement spend
Estimate of “f” / Inference 1 of 2
In some cases we want to understand how Y changes as
a function of X1,...,Xp.
• Which predictors are associated with the response?
• What is the relationship between the response and
each predictor?
• Can the relationship between Y and each predictor
be adequately summarized using a linear equation
Estimating “f”
Broadly speaking two methods are applied:
• Parametric
• Non-Parametric
Parametric models 1 of 2
Parametric methods involve a three-step model-based
approach.
I. First, make an assumption about shape, of f. For example,
one very simple assumption is that f is linear in X: f(X) = β0
+ β1X1 + β2X2 + ... + βpXp.
II. After a model has been selected, uses the training data to
fit or train the model. Solve for parameters (β0, β1, …..)
Y ≈ β0 + β1X1 + β2X2 + ... + βpXp.
III. Apply the model to predict on test data
Parametric models 2 of 2 PROS
• Fewer observations needed
• Simpler to model
CONS
• Not flexible
income ≈ β0 + β1 × education + β2 × seniority.
Non-Parametric models 1 of 2
 Non-parametric methods do not make explicit assumptions about
the functional form of f
 Instead they seek an estimate of f that gets as close to the data
points as possible
 Accurately fits known data (train data)
 Optimized to fit existing data
 High variability for true data
Non-Parametric models 2 of 2
Smooth thin-plate spline fit
Trade-Off / Prediction accuracy and Model interpretability
Supervised Vs. Unsupervised Learning Part 1 0f 3
Supervised learning
 For each observation of the predictor measurement(s) xi,
i = 1,...,n there is an associated response measurement yi.
 linear regression, logistic regression, boosting, support
vec- regression (SVM) etc.
 Majority of statistical models fall under “supervised mode”
Supervised Vs. Unsupervised Learning Part 2 0f 3
Unsupervised learning
 Unsupervised learning describes situation in which for
every observation i = 1,...,n, we observe a vector of
measurements xi but no associated response variable
 No response variable to fit
 Ex: Cluster analysis for customer segmentation
Unsupervised Learning - Clustering
Regression Vs. Classification
Classification model use cases
 Spam Filter
 Google news classification
 Cancel cell classification (Benign, Malignant)
Machine learning process / Lab
Ex: Titanic Data set in KDNuggets
Lab: Titanic.R
Assessing model accuracy / Quality of fit
For regression model Numnber of test data
elements
Mean Squared error
Actual value
Predicted value
Assessing model accuracy / Quality of fit
For Classification models Predicted value
Actual value
Numnber of test data
elements
Top Machine learning algorithms and business
use cases
Decision trees
Structured way to arrive at a logical
conclusion
Business use cases
 Option pricing
 Pattern recognition
“R” library -> caret
Naïve Bayes Classification
Simple probabilistic classifiers
(Baye’s theorem)
Business use cases
 Sentiment analysis (ex: FB
analyses status updates)
 Classify spam mails
“R” library -> e1071
Simple Linear Regression
Business use cases
 Predicting sales
 Risk assessment
“R” library -> stats
Logistics Regression Modeling a binomial outcome with one
or more explanatory variables
 Measures the relationship between
the categorical dependent variable and
one or more independent variables
Business use cases
 Weather prediction / Credit scoring
“R” library -> MASS
Support Vector Machines (SVM)
Support Vectors are co-
ordinates of individual
observation (ex: 45,150)
SVMis a frontier which best
segregates the Male from the
Females
“R” library -> e1071
Random Forest When you can’t think of any
algorithm use “Random Forest”
“R” library -> randomForest
Simple linear regression 1 of 3
Linear regression assumes that there is approximately
a linear relationship between X and Y.
Y ≈ β0 + β1X (regressing Y on X)
(Ex) Sales ≈ β0 + β1 × TV
Predicted variable SlopeY intercept
Simple linear regression 2 of 3
Let
Then
additional $1,000 spent on TV advertising = approximately 47.5 additional units
Simple linear regression 3 of 3
Accuracy of estimates (standard error) 1 of 2
A true relationship between Y & X takes the form
Standard error
 Standard error is introduced because model is calculated using
“available data” (sample data)
 Whole population data is not known during modeling and hence
introduction of error
Accuracy of estimates (standard error) 2 of 2
Standard errors can be used to compute confidence intervals
For linear regression, the 95 % confidence interval for β1, β0
approximately takes the form:
In the case of the advertising data, the 95 % confidence interval for
β0 is [6.130, 7.935] and the 95 % confidence interval for β1 is
[0.042, 0.053].
Interpreting standard error in regression
LAB Advertising (Summary output)
Accuracy of the model
 Residual Standard Error (RSE) is used to measure
accuracy of the model
 Roughly speaking, it is the average amount that the
response will deviate from the true regression line.
Interpreting RSE &
For advertising data RSE = 3.26 i.e. 3,260 units
difference in sales
Average sales = 14,000 units
%error = 3260/14000 = 23%
indicates variability of “Y” explained using “X”
ABOUT ME
25 years in Technology Industry
LinkedIn Profile:
https://www.linkedin.com/in/ratakondas/
Experience working for multiple early stage
startups and leading global teams
Current
Principal Founder – PredixDATA
(a analytics/bigdata service company)
Board of managers – Syntilla (stealth startup)

Contenu connexe

Tendances

Machine learning session7(nb classifier k-nn)
Machine learning   session7(nb classifier k-nn)Machine learning   session7(nb classifier k-nn)
Machine learning session7(nb classifier k-nn)Abhimanyu Dwivedi
 
Handling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random UndersamplingHandling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random UndersamplingIRJET Journal
 
Linear Regression in R
Linear Regression in RLinear Regression in R
Linear Regression in REdureka!
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning AlgorithmsWalaa Hamdy Assy
 
MachineLearning.ppt
MachineLearning.pptMachineLearning.ppt
MachineLearning.pptbutest
 
Machine learning session9(clustering)
Machine learning   session9(clustering)Machine learning   session9(clustering)
Machine learning session9(clustering)Abhimanyu Dwivedi
 
Linear Regression Algorithm | Linear Regression in R | Data Science Training ...
Linear Regression Algorithm | Linear Regression in R | Data Science Training ...Linear Regression Algorithm | Linear Regression in R | Data Science Training ...
Linear Regression Algorithm | Linear Regression in R | Data Science Training ...Edureka!
 
Machine Learning - Decision Trees
Machine Learning - Decision TreesMachine Learning - Decision Trees
Machine Learning - Decision TreesRupak Roy
 
Machine Learning Decision Tree Algorithms
Machine Learning Decision Tree AlgorithmsMachine Learning Decision Tree Algorithms
Machine Learning Decision Tree AlgorithmsRupak Roy
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision TreesSara Hooker
 

Tendances (13)

Machine learning session7(nb classifier k-nn)
Machine learning   session7(nb classifier k-nn)Machine learning   session7(nb classifier k-nn)
Machine learning session7(nb classifier k-nn)
 
Handling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random UndersamplingHandling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random Undersampling
 
Linear Regression in R
Linear Regression in RLinear Regression in R
Linear Regression in R
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning Algorithms
 
MachineLearning.ppt
MachineLearning.pptMachineLearning.ppt
MachineLearning.ppt
 
Machine learning session9(clustering)
Machine learning   session9(clustering)Machine learning   session9(clustering)
Machine learning session9(clustering)
 
Linear Regression Algorithm | Linear Regression in R | Data Science Training ...
Linear Regression Algorithm | Linear Regression in R | Data Science Training ...Linear Regression Algorithm | Linear Regression in R | Data Science Training ...
Linear Regression Algorithm | Linear Regression in R | Data Science Training ...
 
Borderline Smote
Borderline SmoteBorderline Smote
Borderline Smote
 
Machine Learning - Decision Trees
Machine Learning - Decision TreesMachine Learning - Decision Trees
Machine Learning - Decision Trees
 
Machine Learning Decision Tree Algorithms
Machine Learning Decision Tree AlgorithmsMachine Learning Decision Tree Algorithms
Machine Learning Decision Tree Algorithms
 
Machine learning session1
Machine learning   session1Machine learning   session1
Machine learning session1
 
Decision tree and random forest
Decision tree and random forestDecision tree and random forest
Decision tree and random forest
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
 

En vedette

Introdução a Machine Learning e TensorFlow
Introdução a Machine Learning e TensorFlowIntrodução a Machine Learning e TensorFlow
Introdução a Machine Learning e TensorFlowGuilherme Campos
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheLeslie Samuel
 
More Than A Feeling - How to Quantify Emotion in CX
More Than A Feeling - How to Quantify Emotion in CXMore Than A Feeling - How to Quantify Emotion in CX
More Than A Feeling - How to Quantify Emotion in CXMattersight
 
The Future's so Bright
The Future's so Bright  The Future's so Bright
The Future's so Bright Mattersight
 
Happy Together - The Analytics Answer to a More Engaged Workforce
Happy Together - The Analytics Answer to a More Engaged WorkforceHappy Together - The Analytics Answer to a More Engaged Workforce
Happy Together - The Analytics Answer to a More Engaged WorkforceMattersight
 
Knowing Me Knowing You - Understanding the 6 Employee Personality Types
Knowing Me Knowing You - Understanding the 6 Employee Personality TypesKnowing Me Knowing You - Understanding the 6 Employee Personality Types
Knowing Me Knowing You - Understanding the 6 Employee Personality TypesMattersight
 
Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market
Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market   Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market
Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market Mattersight
 
Machine Learning Use Cases with Azure
Machine Learning Use Cases with AzureMachine Learning Use Cases with Azure
Machine Learning Use Cases with AzureChris McHenry
 
Machine Learning and its Use Cases (dsth Meetup#3)
Machine Learning and its Use Cases (dsth Meetup#3)Machine Learning and its Use Cases (dsth Meetup#3)
Machine Learning and its Use Cases (dsth Meetup#3)Data Science Thailand
 
Deep Learning: Towards General Artificial Intelligence
Deep Learning: Towards General Artificial IntelligenceDeep Learning: Towards General Artificial Intelligence
Deep Learning: Towards General Artificial IntelligenceRukshan Batuwita
 
클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사
클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사
클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사Amazon Web Services Korea
 
Les évolutions adaptatives
Les évolutions adaptativesLes évolutions adaptatives
Les évolutions adaptativesRESPONSIV
 
The Coming Intelligent Digital Assistant Era and Its Impact on Online Platforms
The Coming Intelligent Digital Assistant Era and Its Impact on Online PlatformsThe Coming Intelligent Digital Assistant Era and Its Impact on Online Platforms
The Coming Intelligent Digital Assistant Era and Its Impact on Online PlatformsCognizant
 

En vedette (16)

Introdução a Machine Learning e TensorFlow
Introdução a Machine Learning e TensorFlowIntrodução a Machine Learning e TensorFlow
Introdução a Machine Learning e TensorFlow
 
SEO UZMANI - SEO
SEO UZMANI - SEOSEO UZMANI - SEO
SEO UZMANI - SEO
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your Niche
 
Feel Good Inc.
Feel Good Inc. Feel Good Inc.
Feel Good Inc.
 
More Than A Feeling - How to Quantify Emotion in CX
More Than A Feeling - How to Quantify Emotion in CXMore Than A Feeling - How to Quantify Emotion in CX
More Than A Feeling - How to Quantify Emotion in CX
 
The Future's so Bright
The Future's so Bright  The Future's so Bright
The Future's so Bright
 
Jump for Joy
Jump for Joy Jump for Joy
Jump for Joy
 
Happy Together - The Analytics Answer to a More Engaged Workforce
Happy Together - The Analytics Answer to a More Engaged WorkforceHappy Together - The Analytics Answer to a More Engaged Workforce
Happy Together - The Analytics Answer to a More Engaged Workforce
 
Knowing Me Knowing You - Understanding the 6 Employee Personality Types
Knowing Me Knowing You - Understanding the 6 Employee Personality TypesKnowing Me Knowing You - Understanding the 6 Employee Personality Types
Knowing Me Knowing You - Understanding the 6 Employee Personality Types
 
Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market
Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market   Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market
Let's Stay Together - Hiring For Keeps in a Candidate-Driven Market
 
Machine Learning Use Cases with Azure
Machine Learning Use Cases with AzureMachine Learning Use Cases with Azure
Machine Learning Use Cases with Azure
 
Machine Learning and its Use Cases (dsth Meetup#3)
Machine Learning and its Use Cases (dsth Meetup#3)Machine Learning and its Use Cases (dsth Meetup#3)
Machine Learning and its Use Cases (dsth Meetup#3)
 
Deep Learning: Towards General Artificial Intelligence
Deep Learning: Towards General Artificial IntelligenceDeep Learning: Towards General Artificial Intelligence
Deep Learning: Towards General Artificial Intelligence
 
클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사
클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사
클라우드 뉴노멀 시대의 글로벌 혁신 기업들의 디지털 트랜스포메이션 :: 정우진 이사
 
Les évolutions adaptatives
Les évolutions adaptativesLes évolutions adaptatives
Les évolutions adaptatives
 
The Coming Intelligent Digital Assistant Era and Its Impact on Online Platforms
The Coming Intelligent Digital Assistant Era and Its Impact on Online PlatformsThe Coming Intelligent Digital Assistant Era and Its Impact on Online Platforms
The Coming Intelligent Digital Assistant Era and Its Impact on Online Platforms
 

Similaire à Machine learning algorithms and business use cases

Supervised learning - Linear and Logistic Regression( AI, ML)
Supervised learning - Linear and Logistic Regression( AI, ML)Supervised learning - Linear and Logistic Regression( AI, ML)
Supervised learning - Linear and Logistic Regression( AI, ML)Rahul Pal
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROAnthony Kilili
 
Assessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's GuideAssessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's GuideMegan Verbakel
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfDatacademy.ai
 
An Introduction to boosting
An Introduction to boostingAn Introduction to boosting
An Introduction to boostingbutest
 
Sarcia idoese08
Sarcia idoese08Sarcia idoese08
Sarcia idoese08asarcia
 
MLPA for health care presentation smc
MLPA for health care presentation   smcMLPA for health care presentation   smc
MLPA for health care presentation smcShaun Comfort
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)Abhimanyu Dwivedi
 
IRJET- Disease Prediction using Machine Learning
IRJET-  Disease Prediction using Machine LearningIRJET-  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine LearningIRJET Journal
 
PREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMS
PREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMSPREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMS
PREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMSIJCI JOURNAL
 
Machine learning session5(logistic regression)
Machine learning   session5(logistic regression)Machine learning   session5(logistic regression)
Machine learning session5(logistic regression)Abhimanyu Dwivedi
 
chap4_Parametric_Methods.ppt
chap4_Parametric_Methods.pptchap4_Parametric_Methods.ppt
chap4_Parametric_Methods.pptShayanChowdary
 
ForecastingBUS255 GoalsBy the end of this chapter, y.docx
ForecastingBUS255 GoalsBy the end of this chapter, y.docxForecastingBUS255 GoalsBy the end of this chapter, y.docx
ForecastingBUS255 GoalsBy the end of this chapter, y.docxbudbarber38650
 
WEKA: Credibility Evaluating Whats Been Learned
WEKA: Credibility Evaluating Whats Been LearnedWEKA: Credibility Evaluating Whats Been Learned
WEKA: Credibility Evaluating Whats Been LearnedDataminingTools Inc
 
WEKA:Credibility Evaluating Whats Been Learned
WEKA:Credibility Evaluating Whats Been LearnedWEKA:Credibility Evaluating Whats Been Learned
WEKA:Credibility Evaluating Whats Been Learnedweka Content
 
V. pacáková, d. brebera
V. pacáková, d. breberaV. pacáková, d. brebera
V. pacáková, d. breberalogyalaa
 

Similaire à Machine learning algorithms and business use cases (20)

Supervised learning - Linear and Logistic Regression( AI, ML)
Supervised learning - Linear and Logistic Regression( AI, ML)Supervised learning - Linear and Logistic Regression( AI, ML)
Supervised learning - Linear and Logistic Regression( AI, ML)
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
 
Assessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's GuideAssessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's Guide
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
 
Bivariate Regression
Bivariate RegressionBivariate Regression
Bivariate Regression
 
Linear_Regression
Linear_RegressionLinear_Regression
Linear_Regression
 
An Introduction to boosting
An Introduction to boostingAn Introduction to boosting
An Introduction to boosting
 
Sarcia idoese08
Sarcia idoese08Sarcia idoese08
Sarcia idoese08
 
MLPA for health care presentation smc
MLPA for health care presentation   smcMLPA for health care presentation   smc
MLPA for health care presentation smc
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
 
IRJET- Disease Prediction using Machine Learning
IRJET-  Disease Prediction using Machine LearningIRJET-  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine Learning
 
PREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMS
PREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMSPREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMS
PREDICTING BANKRUPTCY USING MACHINE LEARNING ALGORITHMS
 
Machine learning session5(logistic regression)
Machine learning   session5(logistic regression)Machine learning   session5(logistic regression)
Machine learning session5(logistic regression)
 
chap4_Parametric_Methods.ppt
chap4_Parametric_Methods.pptchap4_Parametric_Methods.ppt
chap4_Parametric_Methods.ppt
 
ForecastingBUS255 GoalsBy the end of this chapter, y.docx
ForecastingBUS255 GoalsBy the end of this chapter, y.docxForecastingBUS255 GoalsBy the end of this chapter, y.docx
ForecastingBUS255 GoalsBy the end of this chapter, y.docx
 
Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptx
 
WEKA: Credibility Evaluating Whats Been Learned
WEKA: Credibility Evaluating Whats Been LearnedWEKA: Credibility Evaluating Whats Been Learned
WEKA: Credibility Evaluating Whats Been Learned
 
WEKA:Credibility Evaluating Whats Been Learned
WEKA:Credibility Evaluating Whats Been LearnedWEKA:Credibility Evaluating Whats Been Learned
WEKA:Credibility Evaluating Whats Been Learned
 
Errors2
Errors2Errors2
Errors2
 
V. pacáková, d. brebera
V. pacáková, d. breberaV. pacáková, d. brebera
V. pacáková, d. brebera
 

Dernier

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 

Dernier (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 

Machine learning algorithms and business use cases

  • 1. Sridhar Ratakonda Founder, PredixDATA, LLC http://www.predixdata.com Machine learning / Algorithms & Business use cases
  • 2. What is Statistical learning? Let’s say you want to associate sales based on advertising channel. Input variables “Xn” => “TV budget”, “Radio budget”, “newspaper budget” Output variable “Y” => Sales Y = f(X) + ͼ Statistical learning refers to set of ways for estimating “f”
  • 3. Estimate of “f” / Prediction In many situations, a set of inputs X are readily available, but the output Y cannot be easily obtained. we can predict Y using Yˆ = ˆf(X), fˆ = estimate for f Yˆ = resulting prediction for Y Ex: Predicting sales based on advertisement spend
  • 4. Estimate of “f” / Inference 1 of 2 In some cases we want to understand how Y changes as a function of X1,...,Xp. • Which predictors are associated with the response? • What is the relationship between the response and each predictor? • Can the relationship between Y and each predictor be adequately summarized using a linear equation
  • 5. Estimating “f” Broadly speaking two methods are applied: • Parametric • Non-Parametric
  • 6. Parametric models 1 of 2 Parametric methods involve a three-step model-based approach. I. First, make an assumption about shape, of f. For example, one very simple assumption is that f is linear in X: f(X) = β0 + β1X1 + β2X2 + ... + βpXp. II. After a model has been selected, uses the training data to fit or train the model. Solve for parameters (β0, β1, …..) Y ≈ β0 + β1X1 + β2X2 + ... + βpXp. III. Apply the model to predict on test data
  • 7. Parametric models 2 of 2 PROS • Fewer observations needed • Simpler to model CONS • Not flexible income ≈ β0 + β1 × education + β2 × seniority.
  • 8. Non-Parametric models 1 of 2  Non-parametric methods do not make explicit assumptions about the functional form of f  Instead they seek an estimate of f that gets as close to the data points as possible  Accurately fits known data (train data)  Optimized to fit existing data  High variability for true data
  • 9. Non-Parametric models 2 of 2 Smooth thin-plate spline fit
  • 10. Trade-Off / Prediction accuracy and Model interpretability
  • 11. Supervised Vs. Unsupervised Learning Part 1 0f 3 Supervised learning  For each observation of the predictor measurement(s) xi, i = 1,...,n there is an associated response measurement yi.  linear regression, logistic regression, boosting, support vec- regression (SVM) etc.  Majority of statistical models fall under “supervised mode”
  • 12. Supervised Vs. Unsupervised Learning Part 2 0f 3 Unsupervised learning  Unsupervised learning describes situation in which for every observation i = 1,...,n, we observe a vector of measurements xi but no associated response variable  No response variable to fit  Ex: Cluster analysis for customer segmentation
  • 15. Classification model use cases  Spam Filter  Google news classification  Cancel cell classification (Benign, Malignant)
  • 16. Machine learning process / Lab Ex: Titanic Data set in KDNuggets Lab: Titanic.R
  • 17. Assessing model accuracy / Quality of fit For regression model Numnber of test data elements Mean Squared error Actual value Predicted value
  • 18. Assessing model accuracy / Quality of fit For Classification models Predicted value Actual value Numnber of test data elements
  • 19. Top Machine learning algorithms and business use cases
  • 20. Decision trees Structured way to arrive at a logical conclusion Business use cases  Option pricing  Pattern recognition “R” library -> caret
  • 21. Naïve Bayes Classification Simple probabilistic classifiers (Baye’s theorem) Business use cases  Sentiment analysis (ex: FB analyses status updates)  Classify spam mails “R” library -> e1071
  • 22. Simple Linear Regression Business use cases  Predicting sales  Risk assessment “R” library -> stats
  • 23. Logistics Regression Modeling a binomial outcome with one or more explanatory variables  Measures the relationship between the categorical dependent variable and one or more independent variables Business use cases  Weather prediction / Credit scoring “R” library -> MASS
  • 24. Support Vector Machines (SVM) Support Vectors are co- ordinates of individual observation (ex: 45,150) SVMis a frontier which best segregates the Male from the Females “R” library -> e1071
  • 25. Random Forest When you can’t think of any algorithm use “Random Forest” “R” library -> randomForest
  • 26. Simple linear regression 1 of 3 Linear regression assumes that there is approximately a linear relationship between X and Y. Y ≈ β0 + β1X (regressing Y on X) (Ex) Sales ≈ β0 + β1 × TV Predicted variable SlopeY intercept
  • 27. Simple linear regression 2 of 3 Let Then additional $1,000 spent on TV advertising = approximately 47.5 additional units
  • 29. Accuracy of estimates (standard error) 1 of 2 A true relationship between Y & X takes the form Standard error  Standard error is introduced because model is calculated using “available data” (sample data)  Whole population data is not known during modeling and hence introduction of error
  • 30. Accuracy of estimates (standard error) 2 of 2 Standard errors can be used to compute confidence intervals For linear regression, the 95 % confidence interval for β1, β0 approximately takes the form: In the case of the advertising data, the 95 % confidence interval for β0 is [6.130, 7.935] and the 95 % confidence interval for β1 is [0.042, 0.053].
  • 31. Interpreting standard error in regression LAB Advertising (Summary output)
  • 32. Accuracy of the model  Residual Standard Error (RSE) is used to measure accuracy of the model  Roughly speaking, it is the average amount that the response will deviate from the true regression line.
  • 33. Interpreting RSE & For advertising data RSE = 3.26 i.e. 3,260 units difference in sales Average sales = 14,000 units %error = 3260/14000 = 23% indicates variability of “Y” explained using “X”
  • 34. ABOUT ME 25 years in Technology Industry LinkedIn Profile: https://www.linkedin.com/in/ratakondas/ Experience working for multiple early stage startups and leading global teams Current Principal Founder – PredixDATA (a analytics/bigdata service company) Board of managers – Syntilla (stealth startup)

Notes de l'éditeur

  1. Ex; Logistics regression SVM Naïve Bayes Classifier
  2. Ex; Logistics regression SVM Naïve Bayes Classifier
  3. Mainly classification but regression possible (regression trees)
  4. Classification problem
  5. Regression & class