SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Dr. Windu Gata, M.Kom
Data Science
Profile
Dr. Windu Gata, M.Kom
Internal Trainer Multimatics
IT Consultant since 1995
Lecturer since 2003
Computer Researcher since 2008
Profile
Datawarehouse
Data Science
Datawarehouse dan Data Science
Introduction Data Science
• What and Why Data Science
• Main Role and Data Science
Method
• History and Data Science
Implementation
Introduction Data Science
• Data (noun)
• facts and statistics collected together for reference or analysis
• the quantities, characters, or symbols on which operations are performed by a
computer, which may be stored and transmitted in the form of electrical signals and
recorded on magnetic, optical, or mechanical recording media.
• (philosophy) things known or assumed as facts, making the basis of reasoning or
calculation.
• mid 17th century (as a term in philosophy): from Latin, plural of datum
• Datum (noun)
• a piece of information.
• a fixed starting point of a scale or operation
Introduction Data Science
• Da.ta
• n keterangan yang benar dan nyata; pengumpulan; untuk
memperoleh keterangan tentang kehidupan petani
• n keterangan atau bahan nyata yang dapat dijadikan dasar kajian
(analisis atau kesimpulan)
• nKomp informasi dalam bentuk yang dapat diproses oleh komputer,
seperti representasi digital dari teks, angka, gambar grafis, atau suara
Introduction Data Science
• Da.ta
• n keterangan yang benar dan nyata; pengumpulan; untuk
memperoleh keterangan tentang kehidupan petani
• n keterangan atau bahan nyata yang dapat dijadikan dasar kajian
(analisis atau kesimpulan)
• nKomp informasi dalam bentuk yang dapat diproses oleh komputer,
seperti representasi digital dari teks, angka, gambar grafis, atau suara
Introduction Data Science
Impact
• Data Information Knowledge Insight Wisdom
Introduction Data Science
• Data Information Knowledge Insight Wisdom
Introduction Data Science
• What Data Science
Data science combines math and
statistics, specialized programming,
advanced analytics, artificial
intelligence (AI), and machine
learning with specific subject matter
expertise to uncover actionable insights
hidden in an organization’s data. These
insights can be used to guide decision
making and strategic planning.
What is Data Science | IBM
Introduction Data Science
Data Science
Problem Solver
Introduction Data Science
• Why Data Science
1. Helps In Analyzing Needs & Resources
2. Helps Acquire Customers
3. Improves A Business’ Efficiency &Effectiveness
4. Derives Valuable Insights
5. Can Slash Marketing Costs
6. Helps To Make More Informed Decisions
7. Helping To Solve The Problem Of Big Data
8. Can Help Improve Business Productivity
9. Can Help A Business Expand Internationally
10. Helps A Business Against Competition
11. Can Help Improve Customer Service
12. Enables Personalized Service
13. Enhance The Workplace Experience
14. Applications In Healthcare & Biotech
15. A Part Of Research & Development
15 Reasons Why Data Science Is Important - Curious Desire
CRISP-DM
CRISP-DM
Business Understanding
The Business Understanding phase focuses on understanding
the objectives and requirements of the project
1. Determine business objectives: You should first “thoroughly understand, from a business perspective, what
the customer really wants to accomplish.” and then define business success criteria.
2. Assess situation: Determine resources availability, project requirements, assess risks and contingencies,
and conduct a cost-benefit analysis.
3. Determine data mining goals: In addition to defining the business objectives, you should also define what
success looks like from a technical data mining perspective.
4. Produce project plan: Select technologies and tools and define detailed plans for each project phase.
CRISP-DM
Data Understanding
it drives the focus to identify, collect, and analyze the data sets
that can help you accomplish the project goals
1. Collect initial data: Acquire the necessary data and (if necessary) load it into your analysis tool.
2. Describe data: Examine the data and document its surface properties like data format, number of
records, or field identities.
3. Explore data: Dig deeper into the data. Query it, visualize it, and identify relationships among the data.
4. Verify data quality: How clean/dirty is the data? Document any quality issues.
CRISP-DM
Data Preparation
prepares the final data set(s) for modeling
1. Select data: Determine which data sets will be used and document reasons for inclusion/exclusion.
2. Clean data: Often this is the lengthiest task. Without it, you’ll likely fall victim to garbage-in, garbage-out.
A common practice during this task is to correct, impute, or remove erroneous values.
3. Construct data: Derive new attributes that will be helpful. For example, derive someone’s body mass
index from height and weight fields.
4. Integrate data: Create new data sets by combining data from multiple sources.
5. Format data: Re-format data as necessary. For example, you might convert string values that store
numbers to numeric values so that you can perform mathematical operations.
CRISP-DM
Modeling
build and assess various models based on several different
modeling techniques
1. Select modeling techniques: Determine which algorithms to try (e.g. regression, neural net).
2. Generate test design: Pending your modeling approach, you might need to split the data into training,
test, and validation sets.
3. Build model: As glamorous as this might sound, this might just be executing a few lines of code like “reg
= LinearRegression().fit(X, y)”.
4. Assess model: Generally, multiple models are competing against each other, and the data scientist needs
to interpret the model results based on domain knowledge, the pre-defined success criteria, and the test
design.
Data Mining
CRISP-DM
Evaluation
Whereas the Assess Model task of the Modeling phase focuses
on technical model assessment, the Evaluation phase looks
more broadly at which model best meets the business and what
to do next
1. Evaluate results: Do the models meet the business success criteria? Which one(s) should we approve for
the business?
2. Review process: Review the work accomplished. Was anything overlooked? Were all steps properly
executed? Summarize findings and correct anything if needed.
3. Determine next steps: Based on the previous three tasks, determine whether to proceed to
deployment, iterate further, or initiate new projects.
CRISP-DM
Deployment
A model is not particularly useful unless the customer can
access its results.
1. Plan deployment: Develop and document a plan for deploying the model.
2. Plan monitoring and maintenance: Develop a thorough monitoring and maintenance plan to avoid
issues during the operational phase (or post-project phase) of a model.
3. Produce final report: The project team documents a summary of the project which might include a final
presentation of data mining results.
4. Review project: Conduct a project retrospective about what went well, what could have been better,
and how to improve in the future.
Data Mining Tools
Data Mining -Regression
Price Prediction
Forecasting
Production lead time prediction
Data Mining -Classification
Disease Diagnosis
Fraud Detection
Sentiment Analysis
Predictive Maintenance
Credit Risk Analysis
Etc
Data Mining -Association
Market Basket Analysis
Medicine Basket Analysis
Etc
Data Mining -Clustering
Customer Segmention
Network traffic
Price Risk Clustering
Etc
Data Mining –Data Model
Data Mining –Data Model
Data Science.pdf

Contenu connexe

Similaire à Data Science.pdf

MODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptxMODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptxnikshaikh786
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data ScienceJohn B. Rollins, Ph.D.
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxsumitkumar600840
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfData Science Council of America
 
Data Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdfData Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdfmustaq4
 
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics Venkat .P
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxcloudserviceuit
 
Data mining introduction
Data mining introductionData mining introduction
Data mining introductionBasma Gamal
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceSukirti Garg
 
Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docxaudeleypearl
 
Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docxroushhsiu
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
DAT 520 Final Project Guidelines and Rubric Overview .docx
DAT 520 Final Project Guidelines and Rubric  Overview .docxDAT 520 Final Project Guidelines and Rubric  Overview .docx
DAT 520 Final Project Guidelines and Rubric Overview .docxsimonithomas47935
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?DIGITALSAI1
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
 

Similaire à Data Science.pdf (20)

MODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptxMODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptx
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data Science
 
Data Science in Python.pptx
Data Science in Python.pptxData Science in Python.pptx
Data Science in Python.pptx
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Data Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdfData Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdf
 
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
 
Data mining introduction
Data mining introductionData mining introduction
Data mining introduction
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docx
 
Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docx
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
DAT 520 Final Project Guidelines and Rubric Overview .docx
DAT 520 Final Project Guidelines and Rubric  Overview .docxDAT 520 Final Project Guidelines and Rubric  Overview .docx
DAT 520 Final Project Guidelines and Rubric Overview .docx
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 

Dernier

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 

Dernier (20)

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 

Data Science.pdf

  • 1. Dr. Windu Gata, M.Kom Data Science
  • 2. Profile Dr. Windu Gata, M.Kom Internal Trainer Multimatics IT Consultant since 1995 Lecturer since 2003 Computer Researcher since 2008
  • 6. Introduction Data Science • What and Why Data Science • Main Role and Data Science Method • History and Data Science Implementation
  • 7. Introduction Data Science • Data (noun) • facts and statistics collected together for reference or analysis • the quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. • (philosophy) things known or assumed as facts, making the basis of reasoning or calculation. • mid 17th century (as a term in philosophy): from Latin, plural of datum • Datum (noun) • a piece of information. • a fixed starting point of a scale or operation
  • 8. Introduction Data Science • Da.ta • n keterangan yang benar dan nyata; pengumpulan; untuk memperoleh keterangan tentang kehidupan petani • n keterangan atau bahan nyata yang dapat dijadikan dasar kajian (analisis atau kesimpulan) • nKomp informasi dalam bentuk yang dapat diproses oleh komputer, seperti representasi digital dari teks, angka, gambar grafis, atau suara
  • 9. Introduction Data Science • Da.ta • n keterangan yang benar dan nyata; pengumpulan; untuk memperoleh keterangan tentang kehidupan petani • n keterangan atau bahan nyata yang dapat dijadikan dasar kajian (analisis atau kesimpulan) • nKomp informasi dalam bentuk yang dapat diproses oleh komputer, seperti representasi digital dari teks, angka, gambar grafis, atau suara
  • 10. Introduction Data Science Impact • Data Information Knowledge Insight Wisdom
  • 11. Introduction Data Science • Data Information Knowledge Insight Wisdom
  • 12. Introduction Data Science • What Data Science Data science combines math and statistics, specialized programming, advanced analytics, artificial intelligence (AI), and machine learning with specific subject matter expertise to uncover actionable insights hidden in an organization’s data. These insights can be used to guide decision making and strategic planning. What is Data Science | IBM
  • 13. Introduction Data Science Data Science Problem Solver
  • 14. Introduction Data Science • Why Data Science 1. Helps In Analyzing Needs & Resources 2. Helps Acquire Customers 3. Improves A Business’ Efficiency &Effectiveness 4. Derives Valuable Insights 5. Can Slash Marketing Costs 6. Helps To Make More Informed Decisions 7. Helping To Solve The Problem Of Big Data 8. Can Help Improve Business Productivity 9. Can Help A Business Expand Internationally 10. Helps A Business Against Competition 11. Can Help Improve Customer Service 12. Enables Personalized Service 13. Enhance The Workplace Experience 14. Applications In Healthcare & Biotech 15. A Part Of Research & Development 15 Reasons Why Data Science Is Important - Curious Desire
  • 16. CRISP-DM Business Understanding The Business Understanding phase focuses on understanding the objectives and requirements of the project 1. Determine business objectives: You should first “thoroughly understand, from a business perspective, what the customer really wants to accomplish.” and then define business success criteria. 2. Assess situation: Determine resources availability, project requirements, assess risks and contingencies, and conduct a cost-benefit analysis. 3. Determine data mining goals: In addition to defining the business objectives, you should also define what success looks like from a technical data mining perspective. 4. Produce project plan: Select technologies and tools and define detailed plans for each project phase.
  • 17. CRISP-DM Data Understanding it drives the focus to identify, collect, and analyze the data sets that can help you accomplish the project goals 1. Collect initial data: Acquire the necessary data and (if necessary) load it into your analysis tool. 2. Describe data: Examine the data and document its surface properties like data format, number of records, or field identities. 3. Explore data: Dig deeper into the data. Query it, visualize it, and identify relationships among the data. 4. Verify data quality: How clean/dirty is the data? Document any quality issues.
  • 18. CRISP-DM Data Preparation prepares the final data set(s) for modeling 1. Select data: Determine which data sets will be used and document reasons for inclusion/exclusion. 2. Clean data: Often this is the lengthiest task. Without it, you’ll likely fall victim to garbage-in, garbage-out. A common practice during this task is to correct, impute, or remove erroneous values. 3. Construct data: Derive new attributes that will be helpful. For example, derive someone’s body mass index from height and weight fields. 4. Integrate data: Create new data sets by combining data from multiple sources. 5. Format data: Re-format data as necessary. For example, you might convert string values that store numbers to numeric values so that you can perform mathematical operations.
  • 19. CRISP-DM Modeling build and assess various models based on several different modeling techniques 1. Select modeling techniques: Determine which algorithms to try (e.g. regression, neural net). 2. Generate test design: Pending your modeling approach, you might need to split the data into training, test, and validation sets. 3. Build model: As glamorous as this might sound, this might just be executing a few lines of code like “reg = LinearRegression().fit(X, y)”. 4. Assess model: Generally, multiple models are competing against each other, and the data scientist needs to interpret the model results based on domain knowledge, the pre-defined success criteria, and the test design.
  • 21. CRISP-DM Evaluation Whereas the Assess Model task of the Modeling phase focuses on technical model assessment, the Evaluation phase looks more broadly at which model best meets the business and what to do next 1. Evaluate results: Do the models meet the business success criteria? Which one(s) should we approve for the business? 2. Review process: Review the work accomplished. Was anything overlooked? Were all steps properly executed? Summarize findings and correct anything if needed. 3. Determine next steps: Based on the previous three tasks, determine whether to proceed to deployment, iterate further, or initiate new projects.
  • 22. CRISP-DM Deployment A model is not particularly useful unless the customer can access its results. 1. Plan deployment: Develop and document a plan for deploying the model. 2. Plan monitoring and maintenance: Develop a thorough monitoring and maintenance plan to avoid issues during the operational phase (or post-project phase) of a model. 3. Produce final report: The project team documents a summary of the project which might include a final presentation of data mining results. 4. Review project: Conduct a project retrospective about what went well, what could have been better, and how to improve in the future.
  • 24. Data Mining -Regression Price Prediction Forecasting Production lead time prediction
  • 25. Data Mining -Classification Disease Diagnosis Fraud Detection Sentiment Analysis Predictive Maintenance Credit Risk Analysis Etc
  • 26. Data Mining -Association Market Basket Analysis Medicine Basket Analysis Etc
  • 27. Data Mining -Clustering Customer Segmention Network traffic Price Risk Clustering Etc