SlideShare a Scribd company logo
1 of 24
Download to read offline
Machine Learning in
Python using Scikit-
Learn
Submitted by Amol Agrawal as proposal for
Workshop in PyCon India 2016
Machine Learning
“Statistics on Steroids”
Exploring and recognizing patterns using data
analysis through computer science and statistics.
Lots of computer science ! Even more statistics!
Scikit-Learn
“The Black Box”
A Python module for all your Machine Learning
needs.
Easy to understand, easier to implement.
0. Beginning -
What? Where? Why? How?
.
➔ What is Machine Learning?
➔ What are the aims of this
workshop?
➔ What this workshop is not!
➔ Setup and basic config.
1. Scikit - Learn -
The module, our savior.
.
➔ About the module.
➔ But why Scikit - Learn ??
➔ Dependencies.
➔ Some basic terminologies.
➔ Let’s begin !
2. Machine Learning -
Or How I Learned To Stop
Worrying and Love the DATA
.
➔ Introduction
➔ Core Terminologies
➔ Definitions.
➔ Usage.
Machine Learning can be
broadly categorized into
TWO
CATEGORIES.
Let’s look at them.
3. Supervised Learning -
Classification and
Regression Definition
Herein, we have labeled
data, i.e, for each X we
know what the value of
Y will be !
So for new unlabelled
data, we can use the
labelled data to make
some predictions. .
3.1 Classification
Nearest Neighbors and
Support Vector
Machines
Definition
The labels have discrete
values.
Eg:
Is Mail Spam or Not
Bug or Not
3.2 Regression -
Linear Regression and
Random Forests Definition
The labels lie in a
continuous range:
Eg.
Stock Market Prediction
Housing Price Prediction
4. Unsupervised Learning -
Dimensionality Reduction
and Clustering Definition
We have NO LABELS !
Patterns must be found
and labels must be
generated on their own.
Time to freak out? NO !
4.1 Dimensionality Reduction -
Principal Component Analysis
Definition
When the dataset has a
large number of
features, out of which
some maybe redundant,
to save computational
power… Reduce the
Dimensions !
Eg. Working on Images
or Text
4.2 Clustering -
KMeans
Definition:
Divide the dataset into
clusters based on the
feature set.
Eg.
Find prominent colors in
an Image.
Cluster Documents
5. Model Selection and
Validation -
What model to use
and Hypertuning
Definition
What model to use
when? Which algorithm
to use when?
How to tune the
parameters?
How to get the best
results?
6. Real Word Examples-
Showcase of working
examples of how
ML is being used.
Definition:
Some real world
examples of ML
implemented and
showcased.
7. Where to go from here ?-
How to incorporate ML in
your work
Words of Caution
Definition:
How to integrate ML into
your work.
Do’s and Dont’s
8. Workflow
How the workshop will be conducted.:
➔ Ipython-Notebooks
Code will be made available for each
aforementioned section through
Ipython-Notebooks which makes it
easy to follow along, tweak and run it
quickly.
Scikit - Learn library will be utilized
throughout to show how it’s inherently
implemented algorithms can be used.
➔ Exercises
The notebooks will be littered with
incomplete code blocks for the
participants to attempt and solve.
9. Requirements
➔ Basic knowledge of Python
10. Takeaway
➔ Machine Learning Concepts
➔ How to use Scikit - Learn
➔ How to create ML models
quickly using Scikit - Learn.
Thank You.

More Related Content

What's hot

Yellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformersYellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformers
Rebecca Bilbro
 
(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning
Rebecca Bilbro
 

What's hot (20)

Scikit-Learn: Machine Learning in Python
Scikit-Learn: Machine Learning in PythonScikit-Learn: Machine Learning in Python
Scikit-Learn: Machine Learning in Python
 
Machine Learning Overview
Machine Learning OverviewMachine Learning Overview
Machine Learning Overview
 
Data Product Architectures
Data Product ArchitecturesData Product Architectures
Data Product Architectures
 
Yellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformersYellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformers
 
TensorFlow in 3 sentences
TensorFlow in 3 sentencesTensorFlow in 3 sentences
TensorFlow in 3 sentences
 
IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...
IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...
IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...
 
Visualizing the model selection process
Visualizing the model selection processVisualizing the model selection process
Visualizing the model selection process
 
Using Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar DressesUsing Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar Dresses
 
Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)
 
Ppt shuai
Ppt shuaiPpt shuai
Ppt shuai
 
Siamese networks
Siamese networksSiamese networks
Siamese networks
 
Neural networks and google tensor flow
Neural networks and google tensor flowNeural networks and google tensor flow
Neural networks and google tensor flow
 
(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning
 
Clustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn TutorialClustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn Tutorial
 
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
 
Europython - Machine Learning for dummies with Python
Europython - Machine Learning for dummies with PythonEuropython - Machine Learning for dummies with Python
Europython - Machine Learning for dummies with Python
 
Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)
 
Unsupervised Learning: Clustering
Unsupervised Learning: Clustering Unsupervised Learning: Clustering
Unsupervised Learning: Clustering
 
Google Developer Groups Talk - TensorFlow
Google Developer Groups Talk - TensorFlowGoogle Developer Groups Talk - TensorFlow
Google Developer Groups Talk - TensorFlow
 
Escaping the Black Box
Escaping the Black BoxEscaping the Black Box
Escaping the Black Box
 

Similar to Introduction to Machine Learning in Python using Scikit-Learn

Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
David Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
RajSingh512965
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
David Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 
Topic based and structured authoring - slides
Topic based and structured authoring - slidesTopic based and structured authoring - slides
Topic based and structured authoring - slides
Neil Perlin
 

Similar to Introduction to Machine Learning in Python using Scikit-Learn (20)

Andrew NG machine learning
Andrew NG machine learningAndrew NG machine learning
Andrew NG machine learning
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
 
How to use Artificial Intelligence with Python? Edureka
How to use Artificial Intelligence with Python? EdurekaHow to use Artificial Intelligence with Python? Edureka
How to use Artificial Intelligence with Python? Edureka
 
Machine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsMachine learning: A Walk Through School Exams
Machine learning: A Walk Through School Exams
 
Introduction To TensorFlow
Introduction To TensorFlowIntroduction To TensorFlow
Introduction To TensorFlow
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
 
Apple Machine Learning
Apple Machine LearningApple Machine Learning
Apple Machine Learning
 
How to implement artificial intelligence solutions
How to implement artificial intelligence solutionsHow to implement artificial intelligence solutions
How to implement artificial intelligence solutions
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
 
ML crash course
ML crash courseML crash course
ML crash course
 
Walk through of azure machine learning studio new features
Walk through of azure machine learning studio new featuresWalk through of azure machine learning studio new features
Walk through of azure machine learning studio new features
 
Artificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaArtificial Intelligence with Python | Edureka
Artificial Intelligence with Python | Edureka
 
Topic based and structured authoring - slides
Topic based and structured authoring - slidesTopic based and structured authoring - slides
Topic based and structured authoring - slides
 
Topic based and structured authoring - slides
Topic based and structured authoring - slidesTopic based and structured authoring - slides
Topic based and structured authoring - slides
 
Python and data analytics
Python and data analyticsPython and data analytics
Python and data analytics
 
Machine Learning for Startups without PhDs
Machine Learning for Startups without PhDsMachine Learning for Startups without PhDs
Machine Learning for Startups without PhDs
 
Machine Learning for Startups without PhDs
Machine Learning for Startups without PhDsMachine Learning for Startups without PhDs
Machine Learning for Startups without PhDs
 
Tuning the Untunable - Insights on Deep Learning Optimization
Tuning the Untunable - Insights on Deep Learning OptimizationTuning the Untunable - Insights on Deep Learning Optimization
Tuning the Untunable - Insights on Deep Learning Optimization
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Recently uploaded (20)

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 

Introduction to Machine Learning in Python using Scikit-Learn

  • 1. Machine Learning in Python using Scikit- Learn Submitted by Amol Agrawal as proposal for Workshop in PyCon India 2016
  • 2. Machine Learning “Statistics on Steroids” Exploring and recognizing patterns using data analysis through computer science and statistics. Lots of computer science ! Even more statistics!
  • 3. Scikit-Learn “The Black Box” A Python module for all your Machine Learning needs. Easy to understand, easier to implement.
  • 4. 0. Beginning - What? Where? Why? How?
  • 5. . ➔ What is Machine Learning? ➔ What are the aims of this workshop? ➔ What this workshop is not! ➔ Setup and basic config.
  • 6. 1. Scikit - Learn - The module, our savior.
  • 7. . ➔ About the module. ➔ But why Scikit - Learn ?? ➔ Dependencies. ➔ Some basic terminologies. ➔ Let’s begin !
  • 8. 2. Machine Learning - Or How I Learned To Stop Worrying and Love the DATA
  • 9. . ➔ Introduction ➔ Core Terminologies ➔ Definitions. ➔ Usage.
  • 10. Machine Learning can be broadly categorized into TWO CATEGORIES. Let’s look at them.
  • 11.
  • 12. 3. Supervised Learning - Classification and Regression Definition Herein, we have labeled data, i.e, for each X we know what the value of Y will be ! So for new unlabelled data, we can use the labelled data to make some predictions. .
  • 13. 3.1 Classification Nearest Neighbors and Support Vector Machines Definition The labels have discrete values. Eg: Is Mail Spam or Not Bug or Not
  • 14. 3.2 Regression - Linear Regression and Random Forests Definition The labels lie in a continuous range: Eg. Stock Market Prediction Housing Price Prediction
  • 15. 4. Unsupervised Learning - Dimensionality Reduction and Clustering Definition We have NO LABELS ! Patterns must be found and labels must be generated on their own. Time to freak out? NO !
  • 16. 4.1 Dimensionality Reduction - Principal Component Analysis Definition When the dataset has a large number of features, out of which some maybe redundant, to save computational power… Reduce the Dimensions ! Eg. Working on Images or Text
  • 17. 4.2 Clustering - KMeans Definition: Divide the dataset into clusters based on the feature set. Eg. Find prominent colors in an Image. Cluster Documents
  • 18. 5. Model Selection and Validation - What model to use and Hypertuning Definition What model to use when? Which algorithm to use when? How to tune the parameters? How to get the best results?
  • 19. 6. Real Word Examples- Showcase of working examples of how ML is being used. Definition: Some real world examples of ML implemented and showcased.
  • 20. 7. Where to go from here ?- How to incorporate ML in your work Words of Caution Definition: How to integrate ML into your work. Do’s and Dont’s
  • 21. 8. Workflow How the workshop will be conducted.: ➔ Ipython-Notebooks Code will be made available for each aforementioned section through Ipython-Notebooks which makes it easy to follow along, tweak and run it quickly. Scikit - Learn library will be utilized throughout to show how it’s inherently implemented algorithms can be used. ➔ Exercises The notebooks will be littered with incomplete code blocks for the participants to attempt and solve.
  • 22. 9. Requirements ➔ Basic knowledge of Python
  • 23. 10. Takeaway ➔ Machine Learning Concepts ➔ How to use Scikit - Learn ➔ How to create ML models quickly using Scikit - Learn.