Data Science is one of the hottest career options globally right now with data scientists earning an average of 15 lacs to 18 lacs annually. This deck explains the fundamentals of Data Science, the role of a Data Scientist.
The deck also introduces the Certificate Masterclass in Data Science with Python by Spotle Learn. This course is specifically designed by the experts for the people who want to build a career in data science. This course will equip you with the fundamental knowledge and practical expertise required for data science careers through a rigorous pedagogy based on videos, live projects, interactive classes and integrated internships.
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Introduction To Data Science With Python
1. Introduction to
Data Science
with Python
Connecting … in Analytics
Spotle.ai Study Material
Spotle.ai/Learn
Spotle.ai Study Material
Spotle.ai/Learn
2. Mousum Dutta
Chief Data Scientist, Spotle.ai
IIT Kharagpur
Dr Sourish Das
Assistant Professor,
Chennai Mathematical Institute
Dr. Subhas Kulkarni
Distinguished Data Scientists
Spotle.ai Study Material
Spotle.ai/Learn 2
Instructors
3. Spotle.ai Study Material
Spotle.ai/Learn
3
All around us, data is exploding….
Machine
Learning
Traditional
Software
Data Analysis
Over 2.5
quintillion
bytes of
data are
created
every
single day
Source: domo.com
5. Spotle.ai Study Material
Spotle.ai/Learn
Every day your digital traces are
being mined to….
Machine
Learning
Traditional
Software
Data Analysis
Recommend Your Next Buy
Fight Terrorism
Win Matches
Predict Cyclones
Prevent Frauds
Solve Homelesness
Invent the next superdrug
5
6. Spotle.ai Study Material
Spotle.ai/Learn
6
And Data Scientists are leading
the expedition
Machine
Learning
Traditional
Software
Data Analysis
• Fight terrorism
• Invent the next
superdrug
• Solve Homelesness
• Predict Cyclones
• Win Matches
• Recommend your next
buy
Demand for
Data
Scientists/
Engineers to
Grow 40% by
2020
Source: US department of statistics
7. Spotle.ai Study Material
Spotle.ai/Learn
What is Data Science?
Domain Expertise
Statistics
Mathematics
Computer
Science
Machine
Learning
Traditional
Software
Data
Analysis
Data
Science
Data science is a multi-
disciplinary field that uses
scientific methods,
processes, algorithms and
systems to extract
knowledge and insights
from data in various forms,
both structured and
unstructured, similar to
data mining.
- Wikipedia
7
Data
Science
8. Spotle.ai Study Material
Spotle.ai/Learn
Data Science interconnects
disconnected processes to build
automated analytical systems
Data Collection
• Usually happens outside of the analytical system
• Results of primary and secondary research/ customer data
Extract, Transform and Load (ETL)
• Extracting the data from various sources and converting into machine-readable
formats
• Is implemented by BI programmers
Core Analytics
• The statistician creates new models based on the data or fits existing models on
to it to derive insights
Reporting
• Reports are generated based on the data summarising past performances and
future forecasts
• This step is delivered by business analysts/ domain experts
DataScience
8
9. Spotle.ai Study Material
Spotle.ai/Learn
Data Scientist – Skillsets
Better Programmer than a
Statistician
Better statistician than a
Programmer
Master Integrator
Statistics
Analytics
Programming
-Python
-R
Business
Analysis/
Domain
Expertise
Data Mining
Artificial
Intelligence/
Machine
Learning
9
10. Data Science
Core Disciplines and
Techniques
Spotle.ai Study Material
Spotle.ai/Learn
Spotle.ai Study Material
Spotle.ai/Learn
11. Spotle.ai Study Material
Spotle.ai/Learn
Statistical decision making
Hypothesis Testing
Testing Alternative
Hypotheses
Is it true that vitamin C has
the ability to cure or prevent
the common cold?
Or is it just a myth?
Analysis of Variance
Comparing Multiple
Options
Buying a new mobile phone
but not sure how it stacks
up against the alternatives?
Spotle.ai Study Material
Spotle.ai/Learn
12. Spotle.ai Study Material
Spotle.ai/Learn
Predictive modelling
The science of analyzing current and historical facts to predict future or
otherwise unknown events.
For example, estimating the mileage of a car at prototype design phase.
12
13. Exploratory analysis
Analysing data sets to find key characteristics.
Used to find out a group of customers from existing list of customers who are
probable customers for an add-on product. This forms the basis of customer
segmentation in market research.
Spotle.ai Study Material
Spotle.ai/Learn
14. Spotle.ai Study Material
Spotle.ai/Learn
Classification analysis
Part of predictive modelling, this
technique is used to divide a
training data set into groups of
similar data.
Here the target output are classes
or clusters.
Used to determine if a new email
that this a spam or not.
14
15. Spotle.ai Study Material
Spotle.ai/Learn
Time-series analysis
Time Series Analysis is
the technique of
analysing trends
across successive
periods.
As an example, Time
Series Analysis will be
used in analysing sales
trends over previous
quarters and
forecasting sales for
next quarter.
15
17. Spotle.ai Study Material
Spotle.ai/Learn
The case for Python in Data Science
✓ High Usability
✓ Flexibility – Integrate your
data applications into
websites
✓ Readable and simple –
Easier to learn, code and
debug
✓ Availability of powerful
data analysis and
visualization packages
✓ Bundle your analysis in
one file with the Ipython
notebook
NumPy
• Used for scientific computing
Pandas
Used to manipulate data
SciKit - Learn
Used in machine learning
Matplotlib
Used to make graphics
Statsmodels
Used to explore data, estimate statistical models
Python Libraries For Data Science
Used By:
18. Spotle.ai Study Material
Spotle.ai/Learn
Certificate Masterclass in Data Science
with Python – What will you learn?
✓ Understand different data analysis problems and algorithms.
✓ Get a structured approach to convert high level data analytics problem
statements into well defined workflow for solutions.
✓ Learn how to take a problem statement and break it down into smaller
components and solve using an appropriate algorithm.
✓ Learn how to execute data science projects in one of the most popular
languages for data science right now that is Python.
✓ Understand the basics.
✓ Linear algebra that is critical for data science algorithms.
✓ Fundamental of statistics that is the backbone of data science.
18