SlideShare a Scribd company logo
1 of 58
Download to read offline
# D A T A
N I G H T
–Eric Schmidt
“The biggest disruptor that we’re sure
about is the arrival of big data and
machine intelligence everywhere.”
–Someone posted this slide on Twitter
“Big data is like teenage sex:
everyone talks about it, nobody really
knows how to do it, everyone thinks
everyone else is doing it, so everyone
claims they are doing it…”
“big data”
attribut 1 attribut 2 attribut 3
client 1
client 2
client 3
attribut 1 attribut 2 attribut 3 …
client 1
client 2
client 3
…
“open data”
nom latitude longitude
station 1
station 2
station 3
…
Big Data 1.0
collecter
stocker
traiter
visualiser
collecter
stocker
traiter
visualiser
collecter!
stocker
traiter
visualiser
“big”
“Ce n’est pas la taille qui compte…”
collecter
stocker
traiter
visualiser
comprendre
Minutes Textos Achats Data Age Churn?
148 72 0 33.6 50 TRUE
85 66 0 26.6 31 FALSE
183 64 0 23.3 32 TRUE
89 66 9.4 28.1 21 FALSE
115 0 0 35.3 29 FALSE
166 72 17.5 25.8 51 TRUE
100 0 0 30 32 TRUE
118 84 23 45.8 31 TRUE
171 110 24 45.4 54 TRUE
159 64 0 27.4 40 FALSE
Big Data 2.0
–Data Science for Business
Once firms have become capable of
processing massive data in a flexible
fashion, they should begin asking: “What
can I now do that I couldn’t do before, or
do better than I could do before?”
–Waqar Hasan, Apigee Insights
“Predictive is the ‘killer app’ for big data.”
–Mike Gualtieri, Principal Analyst at Forrester
“Predictive apps are the next big thing in
app development.”
• “Quel est le sentiment de ce tweet?”
• “Ce client va-t’il nous quitter dans le mois qui
vient?”
• “Cet email est-il du spam?”
=> classification
• “Combien vaut cette maison?”
=> régression
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
3 1 1012 1951 house
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 1599 1964 house
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
1 769 1999 condo 315,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
3 1 1012 1951 house
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 1599 1964 house
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
1 769 1999 condo 315,000
Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)
3 1 860 1950 house 565,000
3 1 1012 1951 house
2 1.5 968 1976 townhouse 447,000
4 1315 1950 house 648,000
3 2 1599 1964 house
3 2 987 1951 townhouse 790,000
1 1 530 2007 condo 122,000
4 2 1574 1964 house 835,000
4 2001 house 855,000
3 2.5 1472 2005 house
4 3.5 1714 2005 townhouse
2 2 1113 1999 condo
1 769 1999 condo 315,000
Machine Learning
??
–McKinsey & Co.
“A significant constraint on realizing value
from big data will be a shortage of talent,
particularly of people with deep expertise
in statistics and machine learning.”
HTML / CSS / JavaScript
HTML / CSS / JavaScript
squarespace.com
The two phases of machine learning:
• TRAIN a model
• PREDICT with a model
The two methods of prediction APIs:
• TRAIN a model
• PREDICT with a model
The two methods of prediction APIs:
• model = create_model(dataset)!
• predicted_output

= create_prediction(model, new_input)
Talk Text Purchase
s
Data Age Churn?
148 72 0 33.6 50 TRUE
85 66 0 26.6 31 FALSE
183 64 0 23.3 32 TRUE
89 66 94 28.1 21 FALSE
115 0 0 35.3 29 FALSE
166 72 175 25.8 51 TRUE
100 0 0 30 32 TRUE
118 84 230 45.8 31 TRUE
171 110 240 45.4 54 TRUE
159 64 0 27.4 40 FALSE
click me
–Bret Victor
"Until machine learning is as accessible
and effortless as the word ‘learn,’ it will
never become widespread."
–Dr Kiri L. Wagstaff, Researcher at NASA
“If we can get usable, flexible, dependable
machine learning software into the hands
of domain experts, benefits to society are
bound to follow.”
!
• model = create_model(dataset)!
• predicted_output

= create_prediction(model, new_input)
www.louisdorard.com

More Related Content

Viewers also liked

Data Summit Brussels: Introduction
Data Summit Brussels: IntroductionData Summit Brussels: Introduction
Data Summit Brussels: IntroductionLouis Dorard
 
API, WhizzML and Apps
API, WhizzML and AppsAPI, WhizzML and Apps
API, WhizzML and AppsBigML, Inc
 
Future of AI-powered automation in business
Future of AI-powered automation in businessFuture of AI-powered automation in business
Future of AI-powered automation in businessLouis Dorard
 
Presentation yoann duriaux g21 ch
Presentation yoann duriaux g21 chPresentation yoann duriaux g21 ch
Presentation yoann duriaux g21 chYoann Duriaux
 
Open Data en France : Acteurs - Projets - Tendances
Open Data en France : Acteurs - Projets - TendancesOpen Data en France : Acteurs - Projets - Tendances
Open Data en France : Acteurs - Projets - TendancesGroupe Serda
 
Gestion et archivage des contenus audiovisuels : Enjeux, besoin et offre
Gestion et archivage des contenus audiovisuels : Enjeux, besoin et offreGestion et archivage des contenus audiovisuels : Enjeux, besoin et offre
Gestion et archivage des contenus audiovisuels : Enjeux, besoin et offreGroupe Serda
 
Alléger la Ville - Des stratégies de lieux partagés
Alléger la Ville - Des stratégies de lieux partagés Alléger la Ville - Des stratégies de lieux partagés
Alléger la Ville - Des stratégies de lieux partagés Fing
 
Toucan Toco: Les objets connectés au service de la création de valeur pour le...
Toucan Toco: Les objets connectés au service de la création de valeur pour le...Toucan Toco: Les objets connectés au service de la création de valeur pour le...
Toucan Toco: Les objets connectés au service de la création de valeur pour le...Toucan Toco
 

Viewers also liked (8)

Data Summit Brussels: Introduction
Data Summit Brussels: IntroductionData Summit Brussels: Introduction
Data Summit Brussels: Introduction
 
API, WhizzML and Apps
API, WhizzML and AppsAPI, WhizzML and Apps
API, WhizzML and Apps
 
Future of AI-powered automation in business
Future of AI-powered automation in businessFuture of AI-powered automation in business
Future of AI-powered automation in business
 
Presentation yoann duriaux g21 ch
Presentation yoann duriaux g21 chPresentation yoann duriaux g21 ch
Presentation yoann duriaux g21 ch
 
Open Data en France : Acteurs - Projets - Tendances
Open Data en France : Acteurs - Projets - TendancesOpen Data en France : Acteurs - Projets - Tendances
Open Data en France : Acteurs - Projets - Tendances
 
Gestion et archivage des contenus audiovisuels : Enjeux, besoin et offre
Gestion et archivage des contenus audiovisuels : Enjeux, besoin et offreGestion et archivage des contenus audiovisuels : Enjeux, besoin et offre
Gestion et archivage des contenus audiovisuels : Enjeux, besoin et offre
 
Alléger la Ville - Des stratégies de lieux partagés
Alléger la Ville - Des stratégies de lieux partagés Alléger la Ville - Des stratégies de lieux partagés
Alléger la Ville - Des stratégies de lieux partagés
 
Toucan Toco: Les objets connectés au service de la création de valeur pour le...
Toucan Toco: Les objets connectés au service de la création de valeur pour le...Toucan Toco: Les objets connectés au service de la création de valeur pour le...
Toucan Toco: Les objets connectés au service de la création de valeur pour le...
 

Similar to Big Data 2.0

Predictive apps for startups
Predictive apps for startupsPredictive apps for startups
Predictive apps for startupsLouis Dorard
 
Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application ProgrammableWeb
 
Predictive APIs at APIdays Berlin
Predictive APIs at APIdays BerlinPredictive APIs at APIdays Berlin
Predictive APIs at APIdays BerlinLouis Dorard
 
Exponential Convergence 2018
Exponential Convergence 2018Exponential Convergence 2018
Exponential Convergence 2018Azeem Azhar
 
Advancing the web without breaking it - #btconf
Advancing the web without breaking it - #btconfAdvancing the web without breaking it - #btconf
Advancing the web without breaking it - #btconfChristian Heilmann
 
Targeting Your Audience: Data Visualization to Communicate Data Insights
Targeting Your Audience: Data Visualization to Communicate Data InsightsTargeting Your Audience: Data Visualization to Communicate Data Insights
Targeting Your Audience: Data Visualization to Communicate Data InsightsRandy Krum
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLBigML, Inc
 
Epic Win - Why Gaming is the Future of Learning
Epic Win - Why Gaming is the Future of LearningEpic Win - Why Gaming is the Future of Learning
Epic Win - Why Gaming is the Future of LearningJane McGonigal
 
Global metaverse the digitization of everything
Global metaverse the digitization of everythingGlobal metaverse the digitization of everything
Global metaverse the digitization of everything尹思哲
 
Mobile World Congress 2011 - MWC
Mobile World Congress 2011 - MWCMobile World Congress 2011 - MWC
Mobile World Congress 2011 - MWCStephen Gay
 
Demystifying blockchain Dec'18
Demystifying blockchain Dec'18Demystifying blockchain Dec'18
Demystifying blockchain Dec'18Mayank Jain
 
Clean Cube Pick Deck
Clean Cube Pick DeckClean Cube Pick Deck
Clean Cube Pick DeckRyan Agran
 
Schaffner Quantum Computing and Cryptography.pptx
Schaffner Quantum Computing and Cryptography.pptxSchaffner Quantum Computing and Cryptography.pptx
Schaffner Quantum Computing and Cryptography.pptxsanta142869
 
Sixt vision 2030 vincent everts presentation
Sixt vision 2030 vincent everts presentation  Sixt vision 2030 vincent everts presentation
Sixt vision 2030 vincent everts presentation Vincent Everts
 
Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017
Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017
Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017Christopher Bishop
 

Similar to Big Data 2.0 (20)

Predictive apps for startups
Predictive apps for startupsPredictive apps for startups
Predictive apps for startups
 
Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application Why And How To Leverage Predictive APIs In Any Application
Why And How To Leverage Predictive APIs In Any Application
 
Predictive APIs at APIdays Berlin
Predictive APIs at APIdays BerlinPredictive APIs at APIdays Berlin
Predictive APIs at APIdays Berlin
 
Exponential Convergence 2018
Exponential Convergence 2018Exponential Convergence 2018
Exponential Convergence 2018
 
Advancing the web without breaking it - #btconf
Advancing the web without breaking it - #btconfAdvancing the web without breaking it - #btconf
Advancing the web without breaking it - #btconf
 
Targeting Your Audience: Data Visualization to Communicate Data Insights
Targeting Your Audience: Data Visualization to Communicate Data InsightsTargeting Your Audience: Data Visualization to Communicate Data Insights
Targeting Your Audience: Data Visualization to Communicate Data Insights
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End ML
 
AI Is a Two-Edged Sword
AI Is a Two-Edged SwordAI Is a Two-Edged Sword
AI Is a Two-Edged Sword
 
Epic Win - Why Gaming is the Future of Learning
Epic Win - Why Gaming is the Future of LearningEpic Win - Why Gaming is the Future of Learning
Epic Win - Why Gaming is the Future of Learning
 
Global metaverse the digitization of everything
Global metaverse the digitization of everythingGlobal metaverse the digitization of everything
Global metaverse the digitization of everything
 
Digital signatures
Digital signaturesDigital signatures
Digital signatures
 
Mobile World Congress 2011 - MWC
Mobile World Congress 2011 - MWCMobile World Congress 2011 - MWC
Mobile World Congress 2011 - MWC
 
Eap in 2028 – is employee assistance “tech proof”?
Eap in 2028 – is employee assistance “tech proof”?Eap in 2028 – is employee assistance “tech proof”?
Eap in 2028 – is employee assistance “tech proof”?
 
The Age of Data and AI - Derek Reedy, IBM HyperBlue Fund
The Age of Data and AI - Derek Reedy, IBM HyperBlue FundThe Age of Data and AI - Derek Reedy, IBM HyperBlue Fund
The Age of Data and AI - Derek Reedy, IBM HyperBlue Fund
 
AI and Blockchain
AI and BlockchainAI and Blockchain
AI and Blockchain
 
Demystifying blockchain Dec'18
Demystifying blockchain Dec'18Demystifying blockchain Dec'18
Demystifying blockchain Dec'18
 
Clean Cube Pick Deck
Clean Cube Pick DeckClean Cube Pick Deck
Clean Cube Pick Deck
 
Schaffner Quantum Computing and Cryptography.pptx
Schaffner Quantum Computing and Cryptography.pptxSchaffner Quantum Computing and Cryptography.pptx
Schaffner Quantum Computing and Cryptography.pptx
 
Sixt vision 2030 vincent everts presentation
Sixt vision 2030 vincent everts presentation  Sixt vision 2030 vincent everts presentation
Sixt vision 2030 vincent everts presentation
 
Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017
Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017
Chris Bishop - Keynote - Texas STEAM Summit - Jan. 13, 2017
 

Recently uploaded

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 

Recently uploaded (20)

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 

Big Data 2.0

  • 1. # D A T A N I G H T
  • 2. –Eric Schmidt “The biggest disruptor that we’re sure about is the arrival of big data and machine intelligence everywhere.”
  • 3. –Someone posted this slide on Twitter “Big data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it…”
  • 5. attribut 1 attribut 2 attribut 3 client 1 client 2 client 3
  • 6. attribut 1 attribut 2 attribut 3 … client 1 client 2 client 3 …
  • 8. nom latitude longitude station 1 station 2 station 3 …
  • 11.
  • 15.
  • 16. “Ce n’est pas la taille qui compte…”
  • 17.
  • 19. Minutes Textos Achats Data Age Churn? 148 72 0 33.6 50 TRUE 85 66 0 26.6 31 FALSE 183 64 0 23.3 32 TRUE 89 66 9.4 28.1 21 FALSE 115 0 0 35.3 29 FALSE 166 72 17.5 25.8 51 TRUE 100 0 0 30 32 TRUE 118 84 23 45.8 31 TRUE 171 110 24 45.4 54 TRUE 159 64 0 27.4 40 FALSE
  • 21. –Data Science for Business Once firms have become capable of processing massive data in a flexible fashion, they should begin asking: “What can I now do that I couldn’t do before, or do better than I could do before?”
  • 22. –Waqar Hasan, Apigee Insights “Predictive is the ‘killer app’ for big data.”
  • 23. –Mike Gualtieri, Principal Analyst at Forrester “Predictive apps are the next big thing in app development.”
  • 24.
  • 25. • “Quel est le sentiment de ce tweet?” • “Ce client va-t’il nous quitter dans le mois qui vient?” • “Cet email est-il du spam?” => classification
  • 26.
  • 27. • “Combien vaut cette maison?” => régression
  • 28. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 3 1 1012 1951 house 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 1599 1964 house 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo 1 769 1999 condo 315,000
  • 29. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 3 1 1012 1951 house 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 1599 1964 house 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo 1 769 1999 condo 315,000
  • 30.
  • 31. Bedrooms Bathrooms Surface (foot²) Year built Type Price ($) 3 1 860 1950 house 565,000 3 1 1012 1951 house 2 1.5 968 1976 townhouse 447,000 4 1315 1950 house 648,000 3 2 1599 1964 house 3 2 987 1951 townhouse 790,000 1 1 530 2007 condo 122,000 4 2 1574 1964 house 835,000 4 2001 house 855,000 3 2.5 1472 2005 house 4 3.5 1714 2005 townhouse 2 2 1113 1999 condo 1 769 1999 condo 315,000
  • 33.
  • 34. ??
  • 35. –McKinsey & Co. “A significant constraint on realizing value from big data will be a shortage of talent, particularly of people with deep expertise in statistics and machine learning.”
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42. HTML / CSS / JavaScript
  • 43. HTML / CSS / JavaScript
  • 45.
  • 46.
  • 47. The two phases of machine learning: • TRAIN a model • PREDICT with a model
  • 48. The two methods of prediction APIs: • TRAIN a model • PREDICT with a model
  • 49. The two methods of prediction APIs: • model = create_model(dataset)! • predicted_output
 = create_prediction(model, new_input)
  • 50.
  • 51. Talk Text Purchase s Data Age Churn? 148 72 0 33.6 50 TRUE 85 66 0 26.6 31 FALSE 183 64 0 23.3 32 TRUE 89 66 94 28.1 21 FALSE 115 0 0 35.3 29 FALSE 166 72 175 25.8 51 TRUE 100 0 0 30 32 TRUE 118 84 230 45.8 31 TRUE 171 110 240 45.4 54 TRUE 159 64 0 27.4 40 FALSE click me
  • 52.
  • 53. –Bret Victor "Until machine learning is as accessible and effortless as the word ‘learn,’ it will never become widespread."
  • 54. –Dr Kiri L. Wagstaff, Researcher at NASA “If we can get usable, flexible, dependable machine learning software into the hands of domain experts, benefits to society are bound to follow.”
  • 55.
  • 56.
  • 57. ! • model = create_model(dataset)! • predicted_output
 = create_prediction(model, new_input)