SlideShare une entreprise Scribd logo
1  sur  17
DATA SCIENCE The highest paid job of future
Presented By:
Dr. Hemant Kumar Singh
Associate Professor & Head
Deptt. of Computer Science & Engineering
DATA SCIENCE
• Data Science is a new term. But in the same sense as
Columbus was discovered NEW Continent 1000 years ago
• A multi-disciplinary field that uses scientific methods,
processes, algorithms and systems to extract knowledge and
insights from structured and unstructured data.
• Google knows us more than our parents
DATA ALL AROUND
Lots of data is being collected and warehoused
• –Scientific Experiments
• –Internet of Things
• –Web data, e-commerce
• –Financial transactions, bank/credit transactions
• –Online trading and purchasing
• –Social Network
BIG DATA?
50
times
2010 2020
35 ZB
SOURCES OF BIG DATA
BIG DATA
Black
box Data
IOT
Social Media
Transactions
Call Record
Data
–……many more!
12+ TBs
of tweet data
every day
25+ TBs of
log data
every day
?
TBs
of
data
every
day
2+
billion
people on
the Web
by end
2011
30 billion RFID
tags today
(1.3B in 2005)
4.6
billion
camera
phones
world wide
100s of
millions
of GPS
enabled
devices
sold
annually
MEASUREMENTS
Unit of
Measure
Approximate
size
Mathematical
Representation
Kilobyte(KB) 103 210
Megabyte(MB) 106 220
Gigabyte(GB) 109 230
Terabyte(TB) 1012 240
Petabyte(PT) 1015 250
Exabyte(EB) 1018 260
Zetabyte(ZB) 1021 270
Yottabyte(YB) 1024 280
SOME FACTS ABOUT DATA
• 2.5 quintillion data bytes daily in 2020. 463 exabytes of data will be
generated each day by people as of 2025.
• There were 4.66 billion active internet users around the world in
January 2021.
• There were 319 million new internet users in 2020
• The end of 2021 could see two trillion Google searches.
SOME FACTS ABOUT DATA
• Five hundred thousand new Tweets were posted every day in 2020.
• Facebook generated four petabytes of data every day in 2020
• Almost 70% of GDPs will have undergone digitization by 2022.
• China had 3.17 billion IoT devices in 2020
• Cloud data storage around the world will amount to 200+ Zettabytes
by 2025.
AI-ML-DL-DS
DL
AI
ML
DS
AI enables machine to think
Self driving car is an AI application
It provides statistical tools to
explore the data (gives capability
to learn)
Create neural network(mimic human
brain)
Apply ML/DL tools and
some mathematical
models
PILLARS OF DATA SCIENCE
• Data science (DS) is a multidisciplinary field of study with
goal to address the challenges in big data
• Data science principles apply to all data –big and small
WHY LEARN DATA SCIENCE?
According to Simon Quinton, “If Analytics is the Engine, then
Data is the Fuel of the 21st century.” Without data, businesses
would not be able to uncover useful insights that could help
streamline their business.
• Computer Science field titles are Software development
engineer, software developer, Java developer, systems
engineer and network engineer
• Those who work in the field of data science may have titles
such as data scientist, data architect, data engineer,
business analyst and data analyst.
HOW IS DATA SCIENCE DIFFERENT FROM COMPUTER SCIENCE
Some typical computer science-related job duties include:
• Testing, documenting and debugging code
• Creating or modifying software and mobile apps
• Designing components of an application and integrating
them into a larger overall product
• Collaborating with a team of programmers to build and
optimize code
• Some typical data science-related job duties include:
• Collecting, “cleaning” and organizing data sets
• Building data models
• Asking and answering questions with large scale data
analysis
• Creating data visualizations and presenting findings to
stakeholders
• A Data Scientist is a data expert that extrapolates insights
from large data sets to help organizations solve complex
problems. To do so, Data Scientists combine
computer science, mathematics, statistics, and modeling
with a strong understanding of their business and industry
to unlock new opportunities and strategies.
WHAT DO DATA SCIENTISTS DO
Query Please

Contenu connexe

Tendances

Big data : Coudbells.com
Big data : Coudbells.comBig data : Coudbells.com
Big data : Coudbells.com
Cloudbells.com
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
Seungyun Lee
 

Tendances (19)

Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analytics
 
Fit for the Digital Future
Fit for the Digital FutureFit for the Digital Future
Fit for the Digital Future
 
Big data : Coudbells.com
Big data : Coudbells.comBig data : Coudbells.com
Big data : Coudbells.com
 
A Review of Big data for Social Policy Decision Making
A Review of Big data for Social Policy Decision Making A Review of Big data for Social Policy Decision Making
A Review of Big data for Social Policy Decision Making
 
Dr Ohad Barzilay
Dr Ohad BarzilayDr Ohad Barzilay
Dr Ohad Barzilay
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
 
Big data
Big dataBig data
Big data
 
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data AnalyticSymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
 
Big data intro.pptx
Big data intro.pptxBig data intro.pptx
Big data intro.pptx
 
Your brain is too small to manage your business
Your brain is too small to manage your business Your brain is too small to manage your business
Your brain is too small to manage your business
 
IOT DATA AND BIG DATA
IOT DATA AND BIG DATAIOT DATA AND BIG DATA
IOT DATA AND BIG DATA
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science Education
 
Big Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique BruxellesBig Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique Bruxelles
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
big data
big data big data
big data
 
Hadoop, Iot and Analytics- The Three Musketeers
Hadoop, Iot and Analytics- The Three MusketeersHadoop, Iot and Analytics- The Three Musketeers
Hadoop, Iot and Analytics- The Three Musketeers
 
Data science
Data scienceData science
Data science
 
Evolution of big data technology
Evolution of big data technologyEvolution of big data technology
Evolution of big data technology
 
Présentation de Bruno Schroder au 20e #mforum (07/12/2016)
Présentation de Bruno Schroder au 20e #mforum (07/12/2016)Présentation de Bruno Schroder au 20e #mforum (07/12/2016)
Présentation de Bruno Schroder au 20e #mforum (07/12/2016)
 

Similaire à Data science

Similaire à Data science (20)

Datascience
DatascienceDatascience
Datascience
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Data science and Artificial Intelligence
Data science and Artificial IntelligenceData science and Artificial Intelligence
Data science and Artificial Intelligence
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
 
SuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-finalSuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-final
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data
Big dataBig data
Big data
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
bigdatappt.pptx
bigdatappt.pptxbigdatappt.pptx
bigdatappt.pptx
 
BI, AI/ML, Use Cases, Business Impact and how to get started
BI, AI/ML, Use Cases, Business Impact and how to get startedBI, AI/ML, Use Cases, Business Impact and how to get started
BI, AI/ML, Use Cases, Business Impact and how to get started
 
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria? Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
 
GK NU CS 101 Session 1B (1).ppt
GK NU CS 101 Session 1B (1).pptGK NU CS 101 Session 1B (1).ppt
GK NU CS 101 Session 1B (1).ppt
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
DataScience_introduction.pdf
DataScience_introduction.pdfDataScience_introduction.pdf
DataScience_introduction.pdf
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Dernier

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Dernier (20)

psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Data science

  • 1. DATA SCIENCE The highest paid job of future Presented By: Dr. Hemant Kumar Singh Associate Professor & Head Deptt. of Computer Science & Engineering
  • 2. DATA SCIENCE • Data Science is a new term. But in the same sense as Columbus was discovered NEW Continent 1000 years ago • A multi-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data. • Google knows us more than our parents
  • 3. DATA ALL AROUND Lots of data is being collected and warehoused • –Scientific Experiments • –Internet of Things • –Web data, e-commerce • –Financial transactions, bank/credit transactions • –Online trading and purchasing • –Social Network
  • 5. SOURCES OF BIG DATA BIG DATA Black box Data IOT Social Media Transactions Call Record Data –……many more!
  • 6. 12+ TBs of tweet data every day 25+ TBs of log data every day ? TBs of data every day 2+ billion people on the Web by end 2011 30 billion RFID tags today (1.3B in 2005) 4.6 billion camera phones world wide 100s of millions of GPS enabled devices sold annually
  • 7. MEASUREMENTS Unit of Measure Approximate size Mathematical Representation Kilobyte(KB) 103 210 Megabyte(MB) 106 220 Gigabyte(GB) 109 230 Terabyte(TB) 1012 240 Petabyte(PT) 1015 250 Exabyte(EB) 1018 260 Zetabyte(ZB) 1021 270 Yottabyte(YB) 1024 280
  • 8. SOME FACTS ABOUT DATA • 2.5 quintillion data bytes daily in 2020. 463 exabytes of data will be generated each day by people as of 2025. • There were 4.66 billion active internet users around the world in January 2021. • There were 319 million new internet users in 2020 • The end of 2021 could see two trillion Google searches.
  • 9. SOME FACTS ABOUT DATA • Five hundred thousand new Tweets were posted every day in 2020. • Facebook generated four petabytes of data every day in 2020 • Almost 70% of GDPs will have undergone digitization by 2022. • China had 3.17 billion IoT devices in 2020 • Cloud data storage around the world will amount to 200+ Zettabytes by 2025.
  • 10. AI-ML-DL-DS DL AI ML DS AI enables machine to think Self driving car is an AI application It provides statistical tools to explore the data (gives capability to learn) Create neural network(mimic human brain) Apply ML/DL tools and some mathematical models
  • 11. PILLARS OF DATA SCIENCE • Data science (DS) is a multidisciplinary field of study with goal to address the challenges in big data • Data science principles apply to all data –big and small
  • 12. WHY LEARN DATA SCIENCE? According to Simon Quinton, “If Analytics is the Engine, then Data is the Fuel of the 21st century.” Without data, businesses would not be able to uncover useful insights that could help streamline their business.
  • 13. • Computer Science field titles are Software development engineer, software developer, Java developer, systems engineer and network engineer • Those who work in the field of data science may have titles such as data scientist, data architect, data engineer, business analyst and data analyst. HOW IS DATA SCIENCE DIFFERENT FROM COMPUTER SCIENCE
  • 14. Some typical computer science-related job duties include: • Testing, documenting and debugging code • Creating or modifying software and mobile apps • Designing components of an application and integrating them into a larger overall product • Collaborating with a team of programmers to build and optimize code
  • 15. • Some typical data science-related job duties include: • Collecting, “cleaning” and organizing data sets • Building data models • Asking and answering questions with large scale data analysis • Creating data visualizations and presenting findings to stakeholders
  • 16. • A Data Scientist is a data expert that extrapolates insights from large data sets to help organizations solve complex problems. To do so, Data Scientists combine computer science, mathematics, statistics, and modeling with a strong understanding of their business and industry to unlock new opportunities and strategies. WHAT DO DATA SCIENTISTS DO