SlideShare a Scribd company logo
1 of 39
Data Science
Unit 1
By : Professor Lili Saghafi
https://professorlilisaghafiquantumcomputing.wor
dpress.com
proflilisaghafi@gmail.com
https://sites.google.com/site/professorlilisaghafi/home
@Lili_PLS
Introduction
• The course is designed as an introduction
to programming and statistics for students
from many different majors.
• It teaches practical techniques that apply
across many disciplines and also serves
as the technical foundation for more
advanced courses in data
science, statistics, and computer science.
Programming Prerequisite
• No prior programming experience is necessary,
but many of the programming techniques
covered in this course do not appear in a typical
introduction to programming.
• The programming content of this course focuses
on manipulating data tables, rather than
building software applications.
• Students who take the course after taking other
programming courses often learn a new
approach to programming that they haven't
encountered before.
Statistic Prerequisite
• No prior statistics experience is necessary, but
many of the statistical inference
techniques covered in this course do not appear
in an undergraduate statistics curriculum.
• The techniques in this course rely heavily on
sampling and simulation, and they require
computers to carry out.
• Students who have taken statistics courses
before often learn new methods to complement
what they already know.
Understanding problem domains
Prerequisite
• Data science is more than just a combination
of programming and statistics.
• Effective data science requires understanding
problem domains and correctly interpreting
domain-specific approaches.
• The examples in this course are largely drawn
from real-world data sets, and one of the main
goals of this course is to develop the ability to
apply analysis and prediction techniques to real-
world scenarios.
NO Prerequisite
• This course is designed specifically
for those who have not previously
taken statistics or computer science
courses.
Equipment and Supplies
• A computer
• R Studio (https://cran.r-project.org/ )
• Math Player
• NVDA reader
• SAS or Python
• MS Azure
• A browser that supports Jupyter (Project Jupyter exists to develop
open-source software, open-standards, and services for interactive
computing across dozens of programming languages).
https://jupyter.org/
• Jupyter notebooks to complete lab assignments.
• We highly recommend using Google Chrome to complete Jupyter
notebook lab assignments. https://jupyter.org/
Using Jupyter Notebooks on
Microsoft Azure
• https://notebooks.azure.com
• an overview of using Jupyter Notebooks
with Python 3.
• For further information on Jupyter
Notebooks see the documentation
at http://Jupyter.org.
R
Jupyter notebook
• Jupyter notebook, and it's not running on your computer. Instead,
Google has generously donated compute cloud credits so that we
can run your code on Google's machines in order to execute
whatever examples you want, including all of the labs for the
course.So, thanks, Google!
• You'll learn about how to use this Jupyter environment in the labs.
https://jupyter.org/try OR https://jupyter.org/install.html
• For now, all you need to know is that you can run whatever
examples you want by clicking on a cell, holding down shift, and
pressing return or enter. So in this case, we told the computer to add
two and two together and that made four.
• Now the examples are going to get a lot more interesting soon, and
you'll learn how to use this environment, which is one of the most
popular environments for data science work out there in the world
today.
Jupyter notebook
• Thanks to Google's support, all of the
software relevant to the course is already
pre-installed on their systems that you
have access to, so you can start working
on examples right without having to install
anything. https://jupyter.org/try
Azure Machine Learning Studio
Azure AI Gallery.
Recommended Text
• Analytics, Data Science, &
Artificial Intelligence: Systems
for Decision Support, 11/E
– Ramesh Sharda
– Dursun Delen, Oklahoma
State University
– Efraim Turban, Oklahoma
State University
– ISBN-10: 0135192013 • ISBN-
13: 9780135192016
– ©2020 • Pearson • Cloth, 832
pp
– Published 02/11/2019 •
– Reading this text is not
required but it is helpful for
mastering the course material.
Why Data science?
• It's about taking large data sets and trying
to make them useful or
informative,especially for understanding
the world or making informed decisions.
• We need to use ideas from computing,
ideas from statistics and also domain
knowledge that informs what the data
really represents.
Domain knowledge
• You can't do an analysis in the legal
domain without understanding something
about the law, so that's what we mean by
domain knowledge.
• It's that you really have to understand
when you have a data set, some big table
of numbers and descriptions, what's really
going on behind those numbers and what
they represent about the world.
So what is data science?
• What do you get when you combine
computing and statistics and domain
knowledge together?
• You get a science that's about drawing
useful conclusions from data using
computation as our primary tool.
Data science,
as a practice,
has three core
activities.
1-Exploration
• Exploration is figuring out what patterns exist in
the data.
• When you have many observations about some
phenomenon, what can you conclude about the
phenomenon itself?
• Instead of just looking at large tables of
numbers, we'll draw data visualizations because
it's much easier to interpret lot of information at
once if it's portrayed in some kind of visualway.
2-Statistical Inference
• Once we've found a pattern, we need to perform
statistical inference, and that's because some
patterns are there just by chance and some are
there because they're a reflection of some
underlying process that's really interesting about
the world.
• The goal of statistical inference is to quantify
whether the patterns that we observe during
the exploration phase are reliable.
• If we collected more data, would we see this
pattern again or not?
Randomization
• The primary tool we have is
randomization because by simulating
random processes, we can see what kinds
of patterns appear just by chance.
3-Prediction
• And if the pattern we observe is not the kind of
thing that could just appear by chance, then we
can conclude that it's because of some robust or
reliable pattern in the underlying phenomenon
we want to study.
• We'll perform prediction.
• This is where we have partial information about
something we want to know, and we want to
guess about the things we don't know yet.
Machine Learning
• We are making informed guesses, quantitative
guesses using a discipline called machine
learning.
• Normally when we write programs, we just focus
on the particular logic of what the computer
should do, but machine learning is about not
programming every detail, but instead using
the data to make decisions or choice within
that program.
A form of prediction
• So when we write a program, for instance, to
recognize speech or automatically translate
languages or control a car or a robot, we don't
actually write down all the details of what to do,
but instead use examples from the world to help
computers automatically learn how to behave.
• And that's a form of prediction, one that we'll
talk about in this course.
Three stages in this course
• And these three stages correspond to
how we'll approach the material in this
course.
1. First talk about how to identify patterns,
2. then we'll talk about quantifying whether
those patterns are reliable.
3. And finally, based on the patterns we've
discovered, the reliable ones can help us
make informed guesses about the
information that we wish we knew.
On the way to become a Data
scientist
• Once you can do all that, you're well on
your way to being a data scientist.
• Now in the process of doing all these
things, it's important that you learn how to
program a computer, because computing
underlies each step of the way and
learning to program is just an essential
part of participating in this discipline.
Data Science
Thanks
Professor Lili Saghafi
proflilisaghafi@gmail.com
https://sites.google.com/site/professorlilisaghafi/home
@Lili_PLS

More Related Content

What's hot

End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning ProjectEng Teong Cheah
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsSri Ambati
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big dataPoo Kuan Hoong
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsMichał Łopuszyński
 
Aaa ped-1- Python: Introduction to AI, Python and Colab
Aaa ped-1- Python: Introduction to AI, Python and ColabAaa ped-1- Python: Introduction to AI, Python and Colab
Aaa ped-1- Python: Introduction to AI, Python and ColabAminaRepo
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteRoger Barga
 
Machine Learning 101 | Essential Tools for Machine Learning
Machine Learning 101 | Essential Tools for Machine LearningMachine Learning 101 | Essential Tools for Machine Learning
Machine Learning 101 | Essential Tools for Machine LearningHafiz Muhammad Attaullah
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandryH2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandrySri Ambati
 
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DMFrom Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DMMichał Łopuszyński
 
A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningHaptik
 
Machine learning, Machine learning training bootcamp
Machine learning, Machine learning training bootcampMachine learning, Machine learning training bootcamp
Machine learning, Machine learning training bootcampTonex
 
Data Driven Engineering 2014
Data Driven Engineering 2014Data Driven Engineering 2014
Data Driven Engineering 2014Roger Barga
 
Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causationPeter Varhol
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1Roger Barga
 
Artificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaArtificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaEdureka!
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...Simplilearn
 
Automating fetal heart monitor using machine learning
Automating fetal heart monitor using machine learningAutomating fetal heart monitor using machine learning
Automating fetal heart monitor using machine learningTamjid Rayhan
 

What's hot (20)

End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning Project
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner Pitfalls
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big data
 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining Projects
 
Managing machine learning
Managing machine learningManaging machine learning
Managing machine learning
 
Aaa ped-1- Python: Introduction to AI, Python and Colab
Aaa ped-1- Python: Introduction to AI, Python and ColabAaa ped-1- Python: Introduction to AI, Python and Colab
Aaa ped-1- Python: Introduction to AI, Python and Colab
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 Keynote
 
Machine Learning 101 | Essential Tools for Machine Learning
Machine Learning 101 | Essential Tools for Machine LearningMachine Learning 101 | Essential Tools for Machine Learning
Machine Learning 101 | Essential Tools for Machine Learning
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandryH2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark Landry
 
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DMFrom Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
 
A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine Learning
 
Software Testing
Software TestingSoftware Testing
Software Testing
 
Machine learning, Machine learning training bootcamp
Machine learning, Machine learning training bootcampMachine learning, Machine learning training bootcamp
Machine learning, Machine learning training bootcamp
 
Data Driven Engineering 2014
Data Driven Engineering 2014Data Driven Engineering 2014
Data Driven Engineering 2014
 
Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causation
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1
 
Artificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaArtificial Intelligence with Python | Edureka
Artificial Intelligence with Python | Edureka
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Automating fetal heart monitor using machine learning
Automating fetal heart monitor using machine learningAutomating fetal heart monitor using machine learning
Automating fetal heart monitor using machine learning
 

Similar to Data science unit 1 By: Professor Lili Saghafi

Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning CCG
 
Using machine learning to try and predict taxi availability by Narahari Allam...
Using machine learning to try and predict taxi availability by Narahari Allam...Using machine learning to try and predict taxi availability by Narahari Allam...
Using machine learning to try and predict taxi availability by Narahari Allam...PYCON MY PLT
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflowCharmi Chokshi
 
Machine learning basics by akanksha bali
Machine learning basics by akanksha baliMachine learning basics by akanksha bali
Machine learning basics by akanksha baliAkanksha Bali
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics Akanksha Bali
 
Teaching the cloud to think
Teaching the cloud to thinkTeaching the cloud to think
Teaching the cloud to thinkJosh Gillespie
 
IPT Tools 2
IPT Tools 2IPT Tools 2
IPT Tools 2MR Z
 
From SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the SwitchFrom SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the SwitchRachel Berryman
 
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent CerveauTheFamily
 
Introduction to Machine learning
Introduction to Machine learningIntroduction to Machine learning
Introduction to Machine learningKnoldus Inc.
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAjaved75
 
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)MAHIRA
 
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...tboubez
 
Learn to Code with MIT App Inventor ( PDFDrive ).pdf
Learn to Code with MIT App Inventor ( PDFDrive ).pdfLearn to Code with MIT App Inventor ( PDFDrive ).pdf
Learn to Code with MIT App Inventor ( PDFDrive ).pdfNemoPalleschi
 
Andrew NG machine learning
Andrew NG machine learningAndrew NG machine learning
Andrew NG machine learningShareDocView.com
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxChitrachitrap
 

Similar to Data science unit 1 By: Professor Lili Saghafi (20)

Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
 
Using machine learning to try and predict taxi availability by Narahari Allam...
Using machine learning to try and predict taxi availability by Narahari Allam...Using machine learning to try and predict taxi availability by Narahari Allam...
Using machine learning to try and predict taxi availability by Narahari Allam...
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflow
 
Machine learning basics by akanksha bali
Machine learning basics by akanksha baliMachine learning basics by akanksha bali
Machine learning basics by akanksha bali
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics
 
Teaching the cloud to think
Teaching the cloud to thinkTeaching the cloud to think
Teaching the cloud to think
 
IPT Tools 2
IPT Tools 2IPT Tools 2
IPT Tools 2
 
From SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the SwitchFrom SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the Switch
 
tensorflow.pptx
tensorflow.pptxtensorflow.pptx
tensorflow.pptx
 
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
 
Introduction to Machine learning
Introduction to Machine learningIntroduction to Machine learning
Introduction to Machine learning
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Unit no_1.pptx
Unit no_1.pptxUnit no_1.pptx
Unit no_1.pptx
 
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
 
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
 
Binary crosswords
Binary crosswordsBinary crosswords
Binary crosswords
 
Learn to Code with MIT App Inventor ( PDFDrive ).pdf
Learn to Code with MIT App Inventor ( PDFDrive ).pdfLearn to Code with MIT App Inventor ( PDFDrive ).pdf
Learn to Code with MIT App Inventor ( PDFDrive ).pdf
 
Andrew NG machine learning
Andrew NG machine learningAndrew NG machine learning
Andrew NG machine learning
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptx
 

More from Professor Lili Saghafi

Artificial Intelligence and the importance of Data, By : Prof. Lili Saghafi
Artificial Intelligence and the importance of Data,  By : Prof. Lili SaghafiArtificial Intelligence and the importance of Data,  By : Prof. Lili Saghafi
Artificial Intelligence and the importance of Data, By : Prof. Lili SaghafiProfessor Lili Saghafi
 
Introduction to Quantum Computing Lecture 1: Fundamentals
Introduction to Quantum Computing Lecture 1: FundamentalsIntroduction to Quantum Computing Lecture 1: Fundamentals
Introduction to Quantum Computing Lecture 1: FundamentalsProfessor Lili Saghafi
 
Quantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili Saghafi
Quantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili SaghafiQuantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili Saghafi
Quantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili SaghafiProfessor Lili Saghafi
 
Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi
Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi
Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi Professor Lili Saghafi
 
Introduction to blockchain lesson 2 By: Professor Lili Saghafi
Introduction to blockchain lesson 2 By: Professor Lili SaghafiIntroduction to blockchain lesson 2 By: Professor Lili Saghafi
Introduction to blockchain lesson 2 By: Professor Lili SaghafiProfessor Lili Saghafi
 
Introduction to Blockchain Technology By Professor Lili Saghafi
Introduction to Blockchain Technology By Professor Lili SaghafiIntroduction to Blockchain Technology By Professor Lili Saghafi
Introduction to Blockchain Technology By Professor Lili SaghafiProfessor Lili Saghafi
 
Cyber Security and Post Quantum Cryptography By: Professor Lili Saghafi
Cyber Security and Post Quantum Cryptography By: Professor Lili SaghafiCyber Security and Post Quantum Cryptography By: Professor Lili Saghafi
Cyber Security and Post Quantum Cryptography By: Professor Lili SaghafiProfessor Lili Saghafi
 
Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...
Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...
Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...Professor Lili Saghafi
 
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...Professor Lili Saghafi
 
Machine learning by using python Lesson One Part 2 By Professor Lili Saghafi
Machine learning by using python Lesson One Part 2 By Professor Lili SaghafiMachine learning by using python Lesson One Part 2 By Professor Lili Saghafi
Machine learning by using python Lesson One Part 2 By Professor Lili SaghafiProfessor Lili Saghafi
 
Machine learning by using python By: Professor Lili Saghafi
Machine learning by using python By: Professor Lili SaghafiMachine learning by using python By: Professor Lili Saghafi
Machine learning by using python By: Professor Lili SaghafiProfessor Lili Saghafi
 
What is digital humanities ,By: Professor Lili Saghafi
What is digital humanities ,By: Professor Lili SaghafiWhat is digital humanities ,By: Professor Lili Saghafi
What is digital humanities ,By: Professor Lili SaghafiProfessor Lili Saghafi
 
Effective Algorithm for n Fibonacci Number By: Professor Lili Saghafi
Effective Algorithm for n Fibonacci Number By: Professor Lili SaghafiEffective Algorithm for n Fibonacci Number By: Professor Lili Saghafi
Effective Algorithm for n Fibonacci Number By: Professor Lili SaghafiProfessor Lili Saghafi
 
Computer Security Cyber Security DOS_DDOS Attacks By: Professor Lili Saghafi
Computer Security Cyber Security DOS_DDOS Attacks By: Professor Lili SaghafiComputer Security Cyber Security DOS_DDOS Attacks By: Professor Lili Saghafi
Computer Security Cyber Security DOS_DDOS Attacks By: Professor Lili SaghafiProfessor Lili Saghafi
 
New Assessments in Higher Education with Computers by: Prof Lili Saghafi
New Assessments in Higher Education with Computers by: Prof Lili Saghafi New Assessments in Higher Education with Computers by: Prof Lili Saghafi
New Assessments in Higher Education with Computers by: Prof Lili Saghafi Professor Lili Saghafi
 
Arrow of time by Professor Lili Saghafi
Arrow of time by Professor Lili SaghafiArrow of time by Professor Lili Saghafi
Arrow of time by Professor Lili SaghafiProfessor Lili Saghafi
 

More from Professor Lili Saghafi (20)

Artificial Intelligence and the importance of Data, By : Prof. Lili Saghafi
Artificial Intelligence and the importance of Data,  By : Prof. Lili SaghafiArtificial Intelligence and the importance of Data,  By : Prof. Lili Saghafi
Artificial Intelligence and the importance of Data, By : Prof. Lili Saghafi
 
Software Engineering
Software EngineeringSoftware Engineering
Software Engineering
 
Ai
AiAi
Ai
 
Introduction to Quantum Computing Lecture 1: Fundamentals
Introduction to Quantum Computing Lecture 1: FundamentalsIntroduction to Quantum Computing Lecture 1: Fundamentals
Introduction to Quantum Computing Lecture 1: Fundamentals
 
Quantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili Saghafi
Quantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili SaghafiQuantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili Saghafi
Quantum Computing Quantum Internet 2020_unit 1 By: Prof. Lili Saghafi
 
Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi
Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi
Programming Languages Categories / Programming Paradigm By: Prof. Lili Saghafi
 
Introduction to blockchain lesson 2 By: Professor Lili Saghafi
Introduction to blockchain lesson 2 By: Professor Lili SaghafiIntroduction to blockchain lesson 2 By: Professor Lili Saghafi
Introduction to blockchain lesson 2 By: Professor Lili Saghafi
 
Introduction to Blockchain Technology By Professor Lili Saghafi
Introduction to Blockchain Technology By Professor Lili SaghafiIntroduction to Blockchain Technology By Professor Lili Saghafi
Introduction to Blockchain Technology By Professor Lili Saghafi
 
Cyber Security and Post Quantum Cryptography By: Professor Lili Saghafi
Cyber Security and Post Quantum Cryptography By: Professor Lili SaghafiCyber Security and Post Quantum Cryptography By: Professor Lili Saghafi
Cyber Security and Post Quantum Cryptography By: Professor Lili Saghafi
 
Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...
Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...
Machine learning by using python lesson 3 Confusion Matrix By : Professor Lil...
 
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
 
Machine learning by using python Lesson One Part 2 By Professor Lili Saghafi
Machine learning by using python Lesson One Part 2 By Professor Lili SaghafiMachine learning by using python Lesson One Part 2 By Professor Lili Saghafi
Machine learning by using python Lesson One Part 2 By Professor Lili Saghafi
 
Machine learning by using python By: Professor Lili Saghafi
Machine learning by using python By: Professor Lili SaghafiMachine learning by using python By: Professor Lili Saghafi
Machine learning by using python By: Professor Lili Saghafi
 
What is digital humanities ,By: Professor Lili Saghafi
What is digital humanities ,By: Professor Lili SaghafiWhat is digital humanities ,By: Professor Lili Saghafi
What is digital humanities ,By: Professor Lili Saghafi
 
Effective Algorithm for n Fibonacci Number By: Professor Lili Saghafi
Effective Algorithm for n Fibonacci Number By: Professor Lili SaghafiEffective Algorithm for n Fibonacci Number By: Professor Lili Saghafi
Effective Algorithm for n Fibonacci Number By: Professor Lili Saghafi
 
Computer Security Cyber Security DOS_DDOS Attacks By: Professor Lili Saghafi
Computer Security Cyber Security DOS_DDOS Attacks By: Professor Lili SaghafiComputer Security Cyber Security DOS_DDOS Attacks By: Professor Lili Saghafi
Computer Security Cyber Security DOS_DDOS Attacks By: Professor Lili Saghafi
 
New Assessments in Higher Education with Computers by: Prof Lili Saghafi
New Assessments in Higher Education with Computers by: Prof Lili Saghafi New Assessments in Higher Education with Computers by: Prof Lili Saghafi
New Assessments in Higher Education with Computers by: Prof Lili Saghafi
 
Quick Sort By Prof Lili Saghafi
Quick Sort By Prof Lili SaghafiQuick Sort By Prof Lili Saghafi
Quick Sort By Prof Lili Saghafi
 
Arrow of time by Professor Lili Saghafi
Arrow of time by Professor Lili SaghafiArrow of time by Professor Lili Saghafi
Arrow of time by Professor Lili Saghafi
 
Pointers by: Professor Lili Saghafi
Pointers by: Professor Lili SaghafiPointers by: Professor Lili Saghafi
Pointers by: Professor Lili Saghafi
 

Recently uploaded

DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...
IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...
IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...MerlizValdezGeronimo
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleCeline George
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 

Recently uploaded (20)

DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...
IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...
IPCRF/RPMS 2024 Classroom Observation tool is your access to the new performa...
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP Module
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 

Data science unit 1 By: Professor Lili Saghafi

  • 1. Data Science Unit 1 By : Professor Lili Saghafi https://professorlilisaghafiquantumcomputing.wor dpress.com proflilisaghafi@gmail.com https://sites.google.com/site/professorlilisaghafi/home @Lili_PLS
  • 2. Introduction • The course is designed as an introduction to programming and statistics for students from many different majors. • It teaches practical techniques that apply across many disciplines and also serves as the technical foundation for more advanced courses in data science, statistics, and computer science.
  • 3. Programming Prerequisite • No prior programming experience is necessary, but many of the programming techniques covered in this course do not appear in a typical introduction to programming. • The programming content of this course focuses on manipulating data tables, rather than building software applications. • Students who take the course after taking other programming courses often learn a new approach to programming that they haven't encountered before.
  • 4.
  • 5. Statistic Prerequisite • No prior statistics experience is necessary, but many of the statistical inference techniques covered in this course do not appear in an undergraduate statistics curriculum. • The techniques in this course rely heavily on sampling and simulation, and they require computers to carry out. • Students who have taken statistics courses before often learn new methods to complement what they already know.
  • 6. Understanding problem domains Prerequisite • Data science is more than just a combination of programming and statistics. • Effective data science requires understanding problem domains and correctly interpreting domain-specific approaches. • The examples in this course are largely drawn from real-world data sets, and one of the main goals of this course is to develop the ability to apply analysis and prediction techniques to real- world scenarios.
  • 7. NO Prerequisite • This course is designed specifically for those who have not previously taken statistics or computer science courses.
  • 8. Equipment and Supplies • A computer • R Studio (https://cran.r-project.org/ ) • Math Player • NVDA reader • SAS or Python • MS Azure • A browser that supports Jupyter (Project Jupyter exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages). https://jupyter.org/ • Jupyter notebooks to complete lab assignments. • We highly recommend using Google Chrome to complete Jupyter notebook lab assignments. https://jupyter.org/
  • 9. Using Jupyter Notebooks on Microsoft Azure • https://notebooks.azure.com • an overview of using Jupyter Notebooks with Python 3. • For further information on Jupyter Notebooks see the documentation at http://Jupyter.org.
  • 10. R
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. Jupyter notebook • Jupyter notebook, and it's not running on your computer. Instead, Google has generously donated compute cloud credits so that we can run your code on Google's machines in order to execute whatever examples you want, including all of the labs for the course.So, thanks, Google! • You'll learn about how to use this Jupyter environment in the labs. https://jupyter.org/try OR https://jupyter.org/install.html • For now, all you need to know is that you can run whatever examples you want by clicking on a cell, holding down shift, and pressing return or enter. So in this case, we told the computer to add two and two together and that made four. • Now the examples are going to get a lot more interesting soon, and you'll learn how to use this environment, which is one of the most popular environments for data science work out there in the world today.
  • 22. Jupyter notebook • Thanks to Google's support, all of the software relevant to the course is already pre-installed on their systems that you have access to, so you can start working on examples right without having to install anything. https://jupyter.org/try
  • 25. Recommended Text • Analytics, Data Science, & Artificial Intelligence: Systems for Decision Support, 11/E – Ramesh Sharda – Dursun Delen, Oklahoma State University – Efraim Turban, Oklahoma State University – ISBN-10: 0135192013 • ISBN- 13: 9780135192016 – ©2020 • Pearson • Cloth, 832 pp – Published 02/11/2019 • – Reading this text is not required but it is helpful for mastering the course material.
  • 26.
  • 27. Why Data science? • It's about taking large data sets and trying to make them useful or informative,especially for understanding the world or making informed decisions. • We need to use ideas from computing, ideas from statistics and also domain knowledge that informs what the data really represents.
  • 28. Domain knowledge • You can't do an analysis in the legal domain without understanding something about the law, so that's what we mean by domain knowledge. • It's that you really have to understand when you have a data set, some big table of numbers and descriptions, what's really going on behind those numbers and what they represent about the world.
  • 29. So what is data science? • What do you get when you combine computing and statistics and domain knowledge together? • You get a science that's about drawing useful conclusions from data using computation as our primary tool.
  • 30. Data science, as a practice, has three core activities.
  • 31. 1-Exploration • Exploration is figuring out what patterns exist in the data. • When you have many observations about some phenomenon, what can you conclude about the phenomenon itself? • Instead of just looking at large tables of numbers, we'll draw data visualizations because it's much easier to interpret lot of information at once if it's portrayed in some kind of visualway.
  • 32. 2-Statistical Inference • Once we've found a pattern, we need to perform statistical inference, and that's because some patterns are there just by chance and some are there because they're a reflection of some underlying process that's really interesting about the world. • The goal of statistical inference is to quantify whether the patterns that we observe during the exploration phase are reliable. • If we collected more data, would we see this pattern again or not?
  • 33. Randomization • The primary tool we have is randomization because by simulating random processes, we can see what kinds of patterns appear just by chance.
  • 34. 3-Prediction • And if the pattern we observe is not the kind of thing that could just appear by chance, then we can conclude that it's because of some robust or reliable pattern in the underlying phenomenon we want to study. • We'll perform prediction. • This is where we have partial information about something we want to know, and we want to guess about the things we don't know yet.
  • 35. Machine Learning • We are making informed guesses, quantitative guesses using a discipline called machine learning. • Normally when we write programs, we just focus on the particular logic of what the computer should do, but machine learning is about not programming every detail, but instead using the data to make decisions or choice within that program.
  • 36. A form of prediction • So when we write a program, for instance, to recognize speech or automatically translate languages or control a car or a robot, we don't actually write down all the details of what to do, but instead use examples from the world to help computers automatically learn how to behave. • And that's a form of prediction, one that we'll talk about in this course.
  • 37. Three stages in this course • And these three stages correspond to how we'll approach the material in this course. 1. First talk about how to identify patterns, 2. then we'll talk about quantifying whether those patterns are reliable. 3. And finally, based on the patterns we've discovered, the reliable ones can help us make informed guesses about the information that we wish we knew.
  • 38. On the way to become a Data scientist • Once you can do all that, you're well on your way to being a data scientist. • Now in the process of doing all these things, it's important that you learn how to program a computer, because computing underlies each step of the way and learning to program is just an essential part of participating in this discipline.
  • 39. Data Science Thanks Professor Lili Saghafi proflilisaghafi@gmail.com https://sites.google.com/site/professorlilisaghafi/home @Lili_PLS