SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
IIPGH Webinar
Getting Started With Data Science | AI
What Shall we talk about today?
● Data Science | AI, what is it
about and key enablers!
● Why should we care?
● Opportunities?
● Careers & Learning Paths,
Technology Stack
● Questions
Emmanuel Asimadi
easimadi
Data Science | AI, what is it about?
Let’s keep it simple
- The science of deriving value from data
Data Scientist (n.): Person who is better at statistics than any
software engineer and better at software engineering than any
statistician ~ Josh Wills tweet 2012
Data science is an interdisciplinary field that uses scientific
methods, processes, algorithms and systems to extract
knowledge and insights from data in various forms, both
structured and unstructured ~ wikipedia
How did we get? -
some of the transformations that enabled data science
Solution
Data Science to the rescue.
Apache Spark, Machine Learning, Deep Learning...etc.
Vertical
Scaling
Monolytic Database Systems, SQL, BI,
scale-up get a bigger Machine
Horizontal
Scaling
Hadoop mapreduce, scale horizontally, get more
commodity machines not bigger server, Separation
of Storage from compute
Web Services /
REST API
Enabling applications to share data via well defined
end-points. These have underachieved because the goal
was for discoverable and even autonomous web services.
Cloud Computing
& Big Data
pioneered by Amazon, rent computing & storage
resources, Store Big Data cheaply, Big Data [Volume,
Velocity, Variety]
Artificial Intelligence
Source: nvidia Additional Reading: Data Science Central (holds a different view)
Artificial Intelligence
General techniques to get machines to
achieve human level intelligence.
evaluated by TEST like
- Turing TEST
- Robot College student TEST
- Employment TEST
Machine Learning
Primarily statistical & other techniques
that help machines learn from
“experience”.
Deep Learning
A subset of machine learning
algorithms based on neural networks
mimicking how the brain works.
Essentially building more complex
functions.
You are already familiar with Machine Learning!!
3
5
9
?
1
2
4
5
f(x)
= 11
f(x) = 2X + 1 is the ML Model.
The goal of training is to find this function f(x).
Input
X
Output
Y
In reality
Input
X
Output
Y
Car Make Car Age Car Price
Why Should We Care?
Your Organisation’s Data on STEROIDS
Deriving new* value and MONETISING data
Traditionally
- Data as a cost center
New Paradigm
● Data is an asset (can enable new Revenue streams)
● Data is a key differentiator
● Data Creates a new barrier to entry for your
competition
Your Organisation’s Data on STEROIDS
Deriving new* value and MONETISING data
Analytic Spectrum
Where does your business sit?
Why Should We Care?
Its pervasive and affects every industry - Medicine, Literature, Journalism, Agriculture name it.
Careers & Learning Paths
Opportunities
Source: Data Science Central
Job Profiles
Business Problem Production
Domain Experts Business Analysts Data Scientist Data EngineerData Architect
Devops
Machine Learning EngineerBI Developer
Visualisation Developer
Software Developer
linux
Cloud computing
OS
Networking
SQL
Python
R
Scala
Qlik
Tableau
Looker
Data Visualisation
Apache Spark
NoSQL
Statistics & Quantitative
Analysis Apache Hadoop
Machine Learning
PowerBI
Data Mining
Creative Problem Solving
Business awareness
Sample Technical Skills:
Technology Stack biased towards Amazon AWS :) but captures key concepts
Source: AWS Tutorial
Other Cloud
Providers can meet
similar requirement
- google, microsoft
azure, etc
None of the major cloud
providers has a
datacenter in Africa.
- It is an issue if you have
data locality concerns.
- Most technologies are
open source so can be
implemented local
datacenters.
Example Learning Path Apache Spark - a big data framework
Foundation Deep Dive
What’s New Spark 2.x
Review of what’s new in Spark,
Data Structures, Key Concepts
and Operations. language!!!
Working With Spark
ML Models
Intuitively Understand, featurize,
build, evaluate and deploy Spark
Machine Learning (ML) Models.
Structured Streaming
Understand Streaming and deploy
your own ML-based structured
streaming application
Natural Language
Processing
Build Natural Language
applications working with spark
and other libraries.
Deep Learning
Understand & Apply
Spark Vs Deep learning
Use-cases
SparkSQL & Graphs
Working with SparkSQL, GraphX
and Graphframes
Example Learning Path Python For Data Science
Foundation Deep Dive
Intro to Data Science & Python
Key Concepts, Basics of Programming,
Data Structures (collections), Operations
(comprehensions) and Navigating help.
Data Science Libraries
Pandas, Numpy, Matplotlib,
seaborn, bokeh, sklearn, scipy
Machine Learning
Sklearn, Scipy… etc
Natural Language
Processing
Spacy, NLTK
Explore & Apply
Deep learning etc.
explore and keep
applying knowledge
Similar for R, Scala etc
Emml !!!! such (job) opportunities (almost) don’t exist in our world!
How can we find or create them?
Data Science Competitions
Kaggle
- Competitions $
- learn
Freelancer / Entrepreneur $$
- Upwork* (it works)
- BOAMI
- Guru, gigster Etc
Employment $$
- Demonstrate value and
get employed
Citizen Data Science Community
Using Data for Social Good & Learning
call-for-analysis
Analyse and build data applications
Output: inform, recommend act for social
good, learn, enable new business
call-for-data
Help us collect and publish interesting
Ghana Datasets for the community.
Output: published dataset
call-for-exploration
Explore and Curate the dataset for the
community
Output: cleaned dataset
01
03 02
#citizenDataScienceGh
#cdsgh
https://ds4good.github.io/ghana-datasets/ https://bit.ly/2BM74MK https://www.kaggle.com/citizen-ds-ghana
https://public.tableau.com/profile/datanix.ds4good#!/
Recab
● What is Data Science | AI and its
Motivation!
● Opportunities
● Career & Learning Paths, Technology Stack
● What can I do to take advantage?
Cloud Notebook Environments
don’t about worry about laptop capability or installations*
● Kaggle
● Google Colaboratory
● Azure Notebooks
And many more...
Apache Spark
don’t about worry about laptop capability or installations*
● Databrick Community Edition (Notebooks) - creators of spark
Resources
● interesting books/videos - O'reilly Media,Packt...etc
● MOOC - edX,Coursera,Udemy, Udacity...
● Social Media - not organised can be distracting.
● Blogs: towardsdatascience, KD Nuggets,analyticVidhya
● Youtube: socratica :), google adventures of AI...etc
Wanting to learn everything is like wanting to learn the dictionary
Any Interesting Resources?
You don’t need much start exploring
Questions

Contenu connexe

Tendances

Data science a practitioner's perspective
Data science  a practitioner's perspectiveData science  a practitioner's perspective
Data science a practitioner's perspectiveAmir Ziai
 
From Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into valueFrom Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into valuePeadar Coyle
 
Mastering the variety dimension of Big Data with semantic technologies: high ...
Mastering the variety dimension of Big Data with semantic technologies: high ...Mastering the variety dimension of Big Data with semantic technologies: high ...
Mastering the variety dimension of Big Data with semantic technologies: high ...Artificial Intelligence Institute at UofSC
 
YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016Richard Vidgen
 
Data science as a professional career
Data science as a professional careerData science as a professional career
Data science as a professional careerDavid Rostcheck
 
data scientist the sexiest job of the 21st century
data scientist the sexiest job of the 21st centurydata scientist the sexiest job of the 21st century
data scientist the sexiest job of the 21st centuryFrank Kienle
 
NDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data ScienceNDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data ScienceMark West
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION Elvis Muyanja
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceOptum
 
Human-in-the-loop: a design pattern for managing teams which leverage ML by P...
Human-in-the-loop: a design pattern for managing teams which leverage ML by P...Human-in-the-loop: a design pattern for managing teams which leverage ML by P...
Human-in-the-loop: a design pattern for managing teams which leverage ML by P...Big Data Spain
 
The State of Artificial Intelligence in 2018: A Good Old Fashioned Report
The State of Artificial Intelligence in 2018: A Good Old Fashioned ReportThe State of Artificial Intelligence in 2018: A Good Old Fashioned Report
The State of Artificial Intelligence in 2018: A Good Old Fashioned ReportNathan Benaich
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First CourseArnab Majumdar
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data ScienceKenny Daniel
 
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceGeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceMark West
 
Data Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OData Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OSri Ambati
 

Tendances (20)

Data science a practitioner's perspective
Data science  a practitioner's perspectiveData science  a practitioner's perspective
Data science a practitioner's perspective
 
From Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into valueFrom Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into value
 
Mastering the variety dimension of Big Data with semantic technologies: high ...
Mastering the variety dimension of Big Data with semantic technologies: high ...Mastering the variety dimension of Big Data with semantic technologies: high ...
Mastering the variety dimension of Big Data with semantic technologies: high ...
 
YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016
 
data science
data sciencedata science
data science
 
Data science as a professional career
Data science as a professional careerData science as a professional career
Data science as a professional career
 
data scientist the sexiest job of the 21st century
data scientist the sexiest job of the 21st centurydata scientist the sexiest job of the 21st century
data scientist the sexiest job of the 21st century
 
Ml intro
Ml introMl intro
Ml intro
 
NDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data ScienceNDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data Science
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
 
Human-in-the-loop: a design pattern for managing teams which leverage ML by P...
Human-in-the-loop: a design pattern for managing teams which leverage ML by P...Human-in-the-loop: a design pattern for managing teams which leverage ML by P...
Human-in-the-loop: a design pattern for managing teams which leverage ML by P...
 
The State of Artificial Intelligence in 2018: A Good Old Fashioned Report
The State of Artificial Intelligence in 2018: A Good Old Fashioned ReportThe State of Artificial Intelligence in 2018: A Good Old Fashioned Report
The State of Artificial Intelligence in 2018: A Good Old Fashioned Report
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data Science
 
Data Science: Past, Present, and Future
Data Science: Past, Present, and FutureData Science: Past, Present, and Future
Data Science: Past, Present, and Future
 
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceGeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
 
Data Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OData Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2O
 
Data Science
Data ScienceData Science
Data Science
 
Data science
Data scienceData science
Data science
 

Similaire à IIPGH Webinar 1: Getting Started With Data Science

Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)NirupamNishant2
 
Workshop_Presentation.pptx
Workshop_Presentation.pptxWorkshop_Presentation.pptx
Workshop_Presentation.pptxRUDRAPRASADSABAR
 
Data Workflows for Machine Learning - SF Bay Area ML
Data Workflows for Machine Learning - SF Bay Area MLData Workflows for Machine Learning - SF Bay Area ML
Data Workflows for Machine Learning - SF Bay Area MLPaco Nathan
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingDATAVERSITY
 
The Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewThe Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewDr. Ananth Krishnamoorthy
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLPaco Nathan
 
7 ideas on encouraging advanced analytics
7 ideas on encouraging advanced analytics7 ideas on encouraging advanced analytics
7 ideas on encouraging advanced analyticsMark Tabladillo
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
 
OSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningOSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningPaco Nathan
 
Big data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunilBig data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunilSujit Ghosh
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02BIWUG
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data ScienceDataWorks Summit
 

Similaire à IIPGH Webinar 1: Getting Started With Data Science (20)

Data science presentation
Data science presentationData science presentation
Data science presentation
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)
 
Data science and Machine learning Booklet
Data science and Machine learning BookletData science and Machine learning Booklet
Data science and Machine learning Booklet
 
Workshop_Presentation.pptx
Workshop_Presentation.pptxWorkshop_Presentation.pptx
Workshop_Presentation.pptx
 
Data Workflows for Machine Learning - SF Bay Area ML
Data Workflows for Machine Learning - SF Bay Area MLData Workflows for Machine Learning - SF Bay Area ML
Data Workflows for Machine Learning - SF Bay Area ML
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive Computing
 
The Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewThe Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape Overview
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAML
 
Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0
 
7 ideas on encouraging advanced analytics
7 ideas on encouraging advanced analytics7 ideas on encouraging advanced analytics
7 ideas on encouraging advanced analytics
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycle
 
Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017 Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017
 
On Big Data
On Big DataOn Big Data
On Big Data
 
OSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningOSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine Learning
 
Big data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunilBig data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunil
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
Data science
Data science Data science
Data science
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 

Dernier

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxParas Gupta
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制vexqp
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样wsppdmt
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss ConfederationEfruzAsilolu
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制vexqp
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 

Dernier (20)

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 

IIPGH Webinar 1: Getting Started With Data Science

  • 1. IIPGH Webinar Getting Started With Data Science | AI
  • 2. What Shall we talk about today? ● Data Science | AI, what is it about and key enablers! ● Why should we care? ● Opportunities? ● Careers & Learning Paths, Technology Stack ● Questions Emmanuel Asimadi easimadi
  • 3. Data Science | AI, what is it about? Let’s keep it simple - The science of deriving value from data Data Scientist (n.): Person who is better at statistics than any software engineer and better at software engineering than any statistician ~ Josh Wills tweet 2012 Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured ~ wikipedia
  • 4. How did we get? - some of the transformations that enabled data science Solution Data Science to the rescue. Apache Spark, Machine Learning, Deep Learning...etc. Vertical Scaling Monolytic Database Systems, SQL, BI, scale-up get a bigger Machine Horizontal Scaling Hadoop mapreduce, scale horizontally, get more commodity machines not bigger server, Separation of Storage from compute Web Services / REST API Enabling applications to share data via well defined end-points. These have underachieved because the goal was for discoverable and even autonomous web services. Cloud Computing & Big Data pioneered by Amazon, rent computing & storage resources, Store Big Data cheaply, Big Data [Volume, Velocity, Variety]
  • 5. Artificial Intelligence Source: nvidia Additional Reading: Data Science Central (holds a different view) Artificial Intelligence General techniques to get machines to achieve human level intelligence. evaluated by TEST like - Turing TEST - Robot College student TEST - Employment TEST Machine Learning Primarily statistical & other techniques that help machines learn from “experience”. Deep Learning A subset of machine learning algorithms based on neural networks mimicking how the brain works. Essentially building more complex functions.
  • 6. You are already familiar with Machine Learning!! 3 5 9 ? 1 2 4 5 f(x) = 11 f(x) = 2X + 1 is the ML Model. The goal of training is to find this function f(x). Input X Output Y In reality Input X Output Y Car Make Car Age Car Price
  • 8. Your Organisation’s Data on STEROIDS Deriving new* value and MONETISING data Traditionally - Data as a cost center New Paradigm ● Data is an asset (can enable new Revenue streams) ● Data is a key differentiator ● Data Creates a new barrier to entry for your competition
  • 9. Your Organisation’s Data on STEROIDS Deriving new* value and MONETISING data Analytic Spectrum Where does your business sit?
  • 10. Why Should We Care? Its pervasive and affects every industry - Medicine, Literature, Journalism, Agriculture name it.
  • 13. Job Profiles Business Problem Production Domain Experts Business Analysts Data Scientist Data EngineerData Architect Devops Machine Learning EngineerBI Developer Visualisation Developer Software Developer linux Cloud computing OS Networking SQL Python R Scala Qlik Tableau Looker Data Visualisation Apache Spark NoSQL Statistics & Quantitative Analysis Apache Hadoop Machine Learning PowerBI Data Mining Creative Problem Solving Business awareness Sample Technical Skills:
  • 14. Technology Stack biased towards Amazon AWS :) but captures key concepts Source: AWS Tutorial Other Cloud Providers can meet similar requirement - google, microsoft azure, etc None of the major cloud providers has a datacenter in Africa. - It is an issue if you have data locality concerns. - Most technologies are open source so can be implemented local datacenters.
  • 15. Example Learning Path Apache Spark - a big data framework Foundation Deep Dive What’s New Spark 2.x Review of what’s new in Spark, Data Structures, Key Concepts and Operations. language!!! Working With Spark ML Models Intuitively Understand, featurize, build, evaluate and deploy Spark Machine Learning (ML) Models. Structured Streaming Understand Streaming and deploy your own ML-based structured streaming application Natural Language Processing Build Natural Language applications working with spark and other libraries. Deep Learning Understand & Apply Spark Vs Deep learning Use-cases SparkSQL & Graphs Working with SparkSQL, GraphX and Graphframes
  • 16. Example Learning Path Python For Data Science Foundation Deep Dive Intro to Data Science & Python Key Concepts, Basics of Programming, Data Structures (collections), Operations (comprehensions) and Navigating help. Data Science Libraries Pandas, Numpy, Matplotlib, seaborn, bokeh, sklearn, scipy Machine Learning Sklearn, Scipy… etc Natural Language Processing Spacy, NLTK Explore & Apply Deep learning etc. explore and keep applying knowledge Similar for R, Scala etc
  • 17. Emml !!!! such (job) opportunities (almost) don’t exist in our world! How can we find or create them? Data Science Competitions Kaggle - Competitions $ - learn Freelancer / Entrepreneur $$ - Upwork* (it works) - BOAMI - Guru, gigster Etc Employment $$ - Demonstrate value and get employed
  • 18. Citizen Data Science Community Using Data for Social Good & Learning call-for-analysis Analyse and build data applications Output: inform, recommend act for social good, learn, enable new business call-for-data Help us collect and publish interesting Ghana Datasets for the community. Output: published dataset call-for-exploration Explore and Curate the dataset for the community Output: cleaned dataset 01 03 02 #citizenDataScienceGh #cdsgh https://ds4good.github.io/ghana-datasets/ https://bit.ly/2BM74MK https://www.kaggle.com/citizen-ds-ghana https://public.tableau.com/profile/datanix.ds4good#!/
  • 19. Recab ● What is Data Science | AI and its Motivation! ● Opportunities ● Career & Learning Paths, Technology Stack ● What can I do to take advantage?
  • 20. Cloud Notebook Environments don’t about worry about laptop capability or installations* ● Kaggle ● Google Colaboratory ● Azure Notebooks And many more... Apache Spark don’t about worry about laptop capability or installations* ● Databrick Community Edition (Notebooks) - creators of spark Resources ● interesting books/videos - O'reilly Media,Packt...etc ● MOOC - edX,Coursera,Udemy, Udacity... ● Social Media - not organised can be distracting. ● Blogs: towardsdatascience, KD Nuggets,analyticVidhya ● Youtube: socratica :), google adventures of AI...etc Wanting to learn everything is like wanting to learn the dictionary Any Interesting Resources? You don’t need much start exploring