SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Fantastic Problems
Daryl Weir_
Senior Data Scientist_
And Where to Find Them
Founded in 2000
400+ employees from 22 countries
8th year in a row profitable growth
YOY growth 30%
Helsinki
Tampere
Stockholm
Berlin
Munich
London
We help our customers
succeed in digital
business_
Daryl
Weir_
Senior Data
Scientist
PhD Computer Science 

University of Glasgow
Research Intern

Microsoft Finland
Postdoc Researcher

Aalto University
Data Scientist

Futurice
daryl.weir@futurice.com

@darylweir
2010-2014

2014

2014-2016

2016-Now
Machine Learning Hype_
AI, big data, machine learning… These have been
buzz words for years now
This talk tries to answer two key questions:
- What can machine learning actually do?
- Should I use machine learning to solve my
problem?
AI/Machine Learning is not Skynet
(yet…)
So what is it?_
Machine learning is really good at
answering narrow questions
It does this by learning from examples
The most common method is supervised
learning, where problems look like:
Given some input A, predict an output B
By choosing the right A and B, you can do
some amazing stuff
A—>B
For example_
Baby’s first self driving car
Narrow question: given the road looks
like this, how should I turn the steering
wheel to not crash?
Input: camera image of the road
Output: steering wheel angle
(CMU did this in 1995 - the car drove
3000 miles across the US)
But what kind of problem should we
look for?_
“If a typical person can do a
mental task with less than one
second of thought, we can
probably automate it using AI”
- Andrew Ng
A (somewhat trite) counterexample_
18
27
131
94
322
1061
Are these numbers odd or even?
You can (hopefully) judge this in under a
second
But many popular machine learning
algorithms are terrible at learning this
(from the numbers alone)
Not all problems that can be tackled with
ML should be
You might have a machine learning
problem if:
1 - writing down a set of rules is hard
BUT
2 - gathering examples is easy
My rule of thumb_
Is there a cat in this image?_
Writing rules for the presence or absence
of a cat is really, really hard
BUT
Finding cat pictures is really, really easy
Narrowing the focus_
What kind of things can we do?
Predict the future Uncover hidden structureRecommend content
Predicting the Future_
Scenario_
We have lots of historical
examples of system state,
each one labelled by its
outcome
Given a new example of the
state, we try to predict the
outcome
Such problems fall into two
main types: regression and
classification
Regression problems are
those where the outcome is a
number, for example:
— how many potatoes will my
supermarket sell next week?
— what’s the optimal price to
sell this stock?
— what speed should I drive
given the weather & traffic?
Classification problems are
those where the outcome is
one of a fixed set, for example:
— whose face is in this image?
— is this credit card
transaction fraudulent?
— do these scan results
indicate cancer?
Case: AirBnB_
AirBnB is a great example of a
company oriented around data
They blog regularly about data
science and open source some
of their tools
One problem they’ve talked
about: given the user’s search
term, should we show a given
listing in the results?
Basic distance based search
for San Francisco shows
many rentals in Oakland
Users were likely to find
these results irrelevant
Bayesian model estimates the
probability of booking each
listing given a specific search
This sped up the booking
process and significantly
increased conversion rate
Case: Railways_
Two buzz words for the price
of one: machine learning + IoT!
Modern trains and rails are
highly instrumented with
sensors (motion, sound,
pressure, etc.)
Using data from these, we can
classify whether or not a part
(e.g. a wheel or door) will fail
within a given time
Lets the operator move from
scheduled to on-demand
maintenance/replacement
VR Group estimates this cuts
amount of wheel maintenance
by around a third
Bonus inception: sometimes
the sensor themselves start
giving incorrect readings
Another classifier can detect
these faults before they
suggest phantom breakdowns
Recommending Content_
Scenario_
Many businesses depend on
content of some sort, e.g.:
— articles and videos
— products in a store
— ads
It is often valuable to
personalise the content shown
to your users
There are two main methods to
predict content users will like
Collaborative filters suggest
things other users have liked
The preferences of others are
weighted by how similar they
are to you
— “Customers also bought…”
— Spotify Discover Weekly
Content based recommenders
suggest, unsurprisingly, based
on the actual content
— these articles have similar
topics to your previous reads
— more videos from channels
you watched
But it’s not always easy/
possible to define similarity
between pieces of content
Case: Netflix_
Netflix is one of the most well
known users of recommender
systems
Like AirBnB, they blog
regularly and have some open
source tools available
Good recommendations
increase user satisfaction,
encourage continued use, and
drive users to the ‘long tail’
Famously offered a $1M prize
to anyone who could improve
their recommender by 10%
Winning solution was never
used!
Recently changed from star
based ratings to a thumbs up/
down system
Better UX, avoids differences
in users’ rating systems
Uncovering Hidden Structure_
Scenario_
What if we don’t have labels
for our historical states?
Another major ML application
area is automatically detecting
recurring patterns in data
This is called unsupervised
learning - it’s about learning
what’s interesting about some
given data
This lets us ask different sorts
of questions, like:
— how do players of my game
group together based on
play behaviour?
— is this spending behaviour
unusual given someone’s past
purchases?
— what are the common
topics in this collection of
documents?
Case: Google News_
Google has been very vocal
about its transformation into “a
machine learning company”
As of 2016, ~10% of their
engineers have ML experience
and internal training is growing
They have stated a goal of
systematically applying
machine learning in all their
business areas
The rankings of different
topics and outlets can be
tailored by the user explicitly
The service can also learn user
preferences from usage data
Google News automatically
groups versions of the same
story from different outlets
Stories are also grouped to
form news categories
Trying it out_
Starting out with ML_
Only for masochistsDesigned with data analysis in
mind
Amazing library and
community support
Not general purpose
Excellent tooling - try the
Anaconda distribution
Jupyter notebooks allow mixed
code and Markdown
Can be slow
The real third option_
Fast, powerful language with
excellent support for really big
datasets (via Spark, Hadoop)
Highly parallel
Not very beginner friendly
Designed with data analysis in
mind
Amazing library and
community support
Not general purpose
Excellent tooling - try the
Anaconda distribution
Jupyter notebooks allow mixed
code and Markdown
Can be slow
Resources_
“I want tutorials and interesting blog posts pitched
at beginners (and I don’t mind Python)”
github.com/hangtwenty/dive-into-machine-learning
“Python sucks, show me resources for machine
learning in <my language of choice>”
github.com/josephmisiti/awesome-machine-learning
“Coding sucks, I just want to read a book”
‘The Master Algorithm’ by Pedro Domingos
If you remember 3 things_
1) Machine learning answers narrow questions
2) Look for complex rules with plentiful examples
3) Don’t use Matlab
Thanks for listening!

daryl.weir@futurice.com

@darylweir

Contenu connexe

Tendances

7 Inspiring Examples of Augmented Reality in the Business World
7 Inspiring Examples of Augmented Reality in the Business World 7 Inspiring Examples of Augmented Reality in the Business World
7 Inspiring Examples of Augmented Reality in the Business World Pixel Crayons
 
Sentinel Report - Q2 2015
Sentinel Report - Q2 2015Sentinel Report - Q2 2015
Sentinel Report - Q2 2015Globant
 
Paybook Vol. IX presented by Verifone
Paybook Vol. IX presented by VerifonePaybook Vol. IX presented by Verifone
Paybook Vol. IX presented by VerifoneVerifone
 
DMI – World Gaming Executive Summit
DMI – World Gaming Executive SummitDMI – World Gaming Executive Summit
DMI – World Gaming Executive SummitDMI
 
Future of Payments for Debit, Mobile Payments, Prepaid & ATM Execs
Future of Payments for Debit, Mobile Payments, Prepaid & ATM ExecsFuture of Payments for Debit, Mobile Payments, Prepaid & ATM Execs
Future of Payments for Debit, Mobile Payments, Prepaid & ATM Execspaythink2015
 
Summary artificial intelligence in practice- part-4
Summary  artificial intelligence in practice- part-4Summary  artificial intelligence in practice- part-4
Summary artificial intelligence in practice- part-4GMR Group
 
Digital Disruption - Disrupt or be Disrupted
Digital Disruption - Disrupt or be DisruptedDigital Disruption - Disrupt or be Disrupted
Digital Disruption - Disrupt or be DisruptedMaria Schmidt
 
CES 2014 Review: 12 Principles & What Matters for Marketers
CES 2014 Review: 12 Principles & What Matters for MarketersCES 2014 Review: 12 Principles & What Matters for Marketers
CES 2014 Review: 12 Principles & What Matters for MarketersDavid Berkowitz
 
2015 05 NO INTERFACE
2015 05 NO INTERFACE2015 05 NO INTERFACE
2015 05 NO INTERFACETrendWatching
 
Summary artificial intelligence in practice- part-2
Summary  artificial intelligence in practice- part-2Summary  artificial intelligence in practice- part-2
Summary artificial intelligence in practice- part-2GMR Group
 
Augmented Humanity: Isobar Trends Report 2018
Augmented Humanity: Isobar Trends Report 2018Augmented Humanity: Isobar Trends Report 2018
Augmented Humanity: Isobar Trends Report 2018Isobar
 
Competing for the Future: Iteration vs. Innovation by Brian Solis
Competing for the Future: Iteration vs. Innovation by Brian SolisCompeting for the Future: Iteration vs. Innovation by Brian Solis
Competing for the Future: Iteration vs. Innovation by Brian SolisBrian Solis
 
ListenLogic Solution Overview
ListenLogic Solution OverviewListenLogic Solution Overview
ListenLogic Solution Overviewkglacken
 
Sentinel Report - Q3 2015
Sentinel Report  - Q3 2015Sentinel Report  - Q3 2015
Sentinel Report - Q3 2015Globant
 
Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...
Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...
Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...ABN AMRO
 
Digital disruption how to transform banking
Digital disruption how to transform bankingDigital disruption how to transform banking
Digital disruption how to transform bankingJo Caudron
 
Fjord Trends 2017 - Highlights
Fjord Trends 2017 - HighlightsFjord Trends 2017 - Highlights
Fjord Trends 2017 - HighlightsKiran Kumar
 
J Schwan
J SchwanJ Schwan
J SchwandaveGBE
 

Tendances (20)

7 Inspiring Examples of Augmented Reality in the Business World
7 Inspiring Examples of Augmented Reality in the Business World 7 Inspiring Examples of Augmented Reality in the Business World
7 Inspiring Examples of Augmented Reality in the Business World
 
Sentinel Report - Q2 2015
Sentinel Report - Q2 2015Sentinel Report - Q2 2015
Sentinel Report - Q2 2015
 
Paybook Vol. IX presented by Verifone
Paybook Vol. IX presented by VerifonePaybook Vol. IX presented by Verifone
Paybook Vol. IX presented by Verifone
 
DMI – World Gaming Executive Summit
DMI – World Gaming Executive SummitDMI – World Gaming Executive Summit
DMI – World Gaming Executive Summit
 
Future of Payments for Debit, Mobile Payments, Prepaid & ATM Execs
Future of Payments for Debit, Mobile Payments, Prepaid & ATM ExecsFuture of Payments for Debit, Mobile Payments, Prepaid & ATM Execs
Future of Payments for Debit, Mobile Payments, Prepaid & ATM Execs
 
Cognitive Future In Customer Engagement
Cognitive Future In Customer EngagementCognitive Future In Customer Engagement
Cognitive Future In Customer Engagement
 
Summary artificial intelligence in practice- part-4
Summary  artificial intelligence in practice- part-4Summary  artificial intelligence in practice- part-4
Summary artificial intelligence in practice- part-4
 
Digital Disruption - Disrupt or be Disrupted
Digital Disruption - Disrupt or be DisruptedDigital Disruption - Disrupt or be Disrupted
Digital Disruption - Disrupt or be Disrupted
 
CES 2014 Review: 12 Principles & What Matters for Marketers
CES 2014 Review: 12 Principles & What Matters for MarketersCES 2014 Review: 12 Principles & What Matters for Marketers
CES 2014 Review: 12 Principles & What Matters for Marketers
 
2015 05 NO INTERFACE
2015 05 NO INTERFACE2015 05 NO INTERFACE
2015 05 NO INTERFACE
 
Radio shack final slide
Radio shack final slideRadio shack final slide
Radio shack final slide
 
Summary artificial intelligence in practice- part-2
Summary  artificial intelligence in practice- part-2Summary  artificial intelligence in practice- part-2
Summary artificial intelligence in practice- part-2
 
Augmented Humanity: Isobar Trends Report 2018
Augmented Humanity: Isobar Trends Report 2018Augmented Humanity: Isobar Trends Report 2018
Augmented Humanity: Isobar Trends Report 2018
 
Competing for the Future: Iteration vs. Innovation by Brian Solis
Competing for the Future: Iteration vs. Innovation by Brian SolisCompeting for the Future: Iteration vs. Innovation by Brian Solis
Competing for the Future: Iteration vs. Innovation by Brian Solis
 
ListenLogic Solution Overview
ListenLogic Solution OverviewListenLogic Solution Overview
ListenLogic Solution Overview
 
Sentinel Report - Q3 2015
Sentinel Report  - Q3 2015Sentinel Report  - Q3 2015
Sentinel Report - Q3 2015
 
Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...
Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...
Changing customer behaviour and how we as ABN AMRO are adapting - Jessica Nie...
 
Digital disruption how to transform banking
Digital disruption how to transform bankingDigital disruption how to transform banking
Digital disruption how to transform banking
 
Fjord Trends 2017 - Highlights
Fjord Trends 2017 - HighlightsFjord Trends 2017 - Highlights
Fjord Trends 2017 - Highlights
 
J Schwan
J SchwanJ Schwan
J Schwan
 

En vedette

Pepperoni – A framework for spicing up your mobile apps with React Native
Pepperoni – A framework for spicing up your mobile apps with React NativePepperoni – A framework for spicing up your mobile apps with React Native
Pepperoni – A framework for spicing up your mobile apps with React NativeFuturice
 
A Retrospective on React Native - Lessons Learned
A Retrospective on React Native - Lessons LearnedA Retrospective on React Native - Lessons Learned
A Retrospective on React Native - Lessons LearnedMatthieu Vachon
 
Testing Without Waste - Automatic Testing
Testing Without Waste - Automatic TestingTesting Without Waste - Automatic Testing
Testing Without Waste - Automatic TestingFuturice
 
Reark : a Reference Architecture for Android using RxJava
Reark : a Reference Architecture for Android using RxJavaReark : a Reference Architecture for Android using RxJava
Reark : a Reference Architecture for Android using RxJavaFuturice
 
Project thinking and digital service business
Project thinking and digital service businessProject thinking and digital service business
Project thinking and digital service businessFuturice
 
Agile Software Security
Agile Software SecurityAgile Software Security
Agile Software SecurityFuturice
 
Markus Paasovaara: Face recognition
Markus Paasovaara: Face recognitionMarkus Paasovaara: Face recognition
Markus Paasovaara: Face recognitionFuturice
 

En vedette (7)

Pepperoni – A framework for spicing up your mobile apps with React Native
Pepperoni – A framework for spicing up your mobile apps with React NativePepperoni – A framework for spicing up your mobile apps with React Native
Pepperoni – A framework for spicing up your mobile apps with React Native
 
A Retrospective on React Native - Lessons Learned
A Retrospective on React Native - Lessons LearnedA Retrospective on React Native - Lessons Learned
A Retrospective on React Native - Lessons Learned
 
Testing Without Waste - Automatic Testing
Testing Without Waste - Automatic TestingTesting Without Waste - Automatic Testing
Testing Without Waste - Automatic Testing
 
Reark : a Reference Architecture for Android using RxJava
Reark : a Reference Architecture for Android using RxJavaReark : a Reference Architecture for Android using RxJava
Reark : a Reference Architecture for Android using RxJava
 
Project thinking and digital service business
Project thinking and digital service businessProject thinking and digital service business
Project thinking and digital service business
 
Agile Software Security
Agile Software SecurityAgile Software Security
Agile Software Security
 
Markus Paasovaara: Face recognition
Markus Paasovaara: Face recognitionMarkus Paasovaara: Face recognition
Markus Paasovaara: Face recognition
 

Similaire à Fantastic Problems and Where to Find Them: Daryl Weir

Machine Learning with Azure and Databricks Virtual Workshop
Machine Learning with Azure and Databricks Virtual WorkshopMachine Learning with Azure and Databricks Virtual Workshop
Machine Learning with Azure and Databricks Virtual WorkshopCCG
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer universityLászló Kovács
 
Business analytics Project.docx
Business analytics Project.docxBusiness analytics Project.docx
Business analytics Project.docxkushi62
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsProduct School
 
data analytics lecture5.pptx
data analytics lecture5.pptxdata analytics lecture5.pptx
data analytics lecture5.pptxNamrataBhatt8
 
Predictive Analytics: An Executive Primer
Predictive Analytics: An Executive PrimerPredictive Analytics: An Executive Primer
Predictive Analytics: An Executive PrimerRyan Withop
 
Data model scorecard (Article 5 of 11)
Data model scorecard (Article 5 of 11)Data model scorecard (Article 5 of 11)
Data model scorecard (Article 5 of 11)Analytics8
 
Modelling for decisions
Modelling for decisionsModelling for decisions
Modelling for decisionscoppeliamla
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorialAakashdata
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersRuhollah Farchtchi
 
How To Start A Reflective Essay 8 Tips For Writing A
How To Start A Reflective Essay 8 Tips For Writing AHow To Start A Reflective Essay 8 Tips For Writing A
How To Start A Reflective Essay 8 Tips For Writing ALisa Martinez
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceEnes Bolfidan
 
Narrative Mind Week 6 H4D Stanford 2016
Narrative Mind Week 6 H4D Stanford 2016Narrative Mind Week 6 H4D Stanford 2016
Narrative Mind Week 6 H4D Stanford 2016Stanford University
 
Lightning Talks: An Innovation Showcase
Lightning Talks: An Innovation ShowcaseLightning Talks: An Innovation Showcase
Lightning Talks: An Innovation ShowcaseSomo
 
How to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your BusinessHow to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your BusinessKissmetrics on SlideShare
 
AI Orange Belt - Session 2
AI Orange Belt - Session 2AI Orange Belt - Session 2
AI Orange Belt - Session 2AI Black Belt
 
data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptxNamrataBhatt8
 

Similaire à Fantastic Problems and Where to Find Them: Daryl Weir (20)

Machine Learning with Azure and Databricks Virtual Workshop
Machine Learning with Azure and Databricks Virtual WorkshopMachine Learning with Azure and Databricks Virtual Workshop
Machine Learning with Azure and Databricks Virtual Workshop
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer university
 
Business analytics Project.docx
Business analytics Project.docxBusiness analytics Project.docx
Business analytics Project.docx
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data Decisions
 
data analytics lecture5.pptx
data analytics lecture5.pptxdata analytics lecture5.pptx
data analytics lecture5.pptx
 
Predictive Analytics: An Executive Primer
Predictive Analytics: An Executive PrimerPredictive Analytics: An Executive Primer
Predictive Analytics: An Executive Primer
 
Data model scorecard (Article 5 of 11)
Data model scorecard (Article 5 of 11)Data model scorecard (Article 5 of 11)
Data model scorecard (Article 5 of 11)
 
Modelling for decisions
Modelling for decisionsModelling for decisions
Modelling for decisions
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorial
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makers
 
How To Start A Reflective Essay 8 Tips For Writing A
How To Start A Reflective Essay 8 Tips For Writing AHow To Start A Reflective Essay 8 Tips For Writing A
How To Start A Reflective Essay 8 Tips For Writing A
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Narrative Mind Week 6 H4D Stanford 2016
Narrative Mind Week 6 H4D Stanford 2016Narrative Mind Week 6 H4D Stanford 2016
Narrative Mind Week 6 H4D Stanford 2016
 
Lightning Talks: An Innovation Showcase
Lightning Talks: An Innovation ShowcaseLightning Talks: An Innovation Showcase
Lightning Talks: An Innovation Showcase
 
Intro big data.pdf
Intro big data.pdfIntro big data.pdf
Intro big data.pdf
 
How to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your BusinessHow to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your Business
 
AI Orange Belt - Session 2
AI Orange Belt - Session 2AI Orange Belt - Session 2
AI Orange Belt - Session 2
 
data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
 

Dernier

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Dernier (20)

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Fantastic Problems and Where to Find Them: Daryl Weir

  • 1. Fantastic Problems Daryl Weir_ Senior Data Scientist_ And Where to Find Them
  • 2. Founded in 2000 400+ employees from 22 countries 8th year in a row profitable growth YOY growth 30% Helsinki Tampere Stockholm Berlin Munich London We help our customers succeed in digital business_
  • 3. Daryl Weir_ Senior Data Scientist PhD Computer Science 
 University of Glasgow Research Intern
 Microsoft Finland Postdoc Researcher
 Aalto University Data Scientist
 Futurice daryl.weir@futurice.com
 @darylweir 2010-2014
 2014
 2014-2016
 2016-Now
  • 4. Machine Learning Hype_ AI, big data, machine learning… These have been buzz words for years now This talk tries to answer two key questions: - What can machine learning actually do? - Should I use machine learning to solve my problem?
  • 5. AI/Machine Learning is not Skynet (yet…)
  • 6. So what is it?_ Machine learning is really good at answering narrow questions It does this by learning from examples The most common method is supervised learning, where problems look like: Given some input A, predict an output B By choosing the right A and B, you can do some amazing stuff A—>B
  • 7. For example_ Baby’s first self driving car Narrow question: given the road looks like this, how should I turn the steering wheel to not crash? Input: camera image of the road Output: steering wheel angle (CMU did this in 1995 - the car drove 3000 miles across the US)
  • 8. But what kind of problem should we look for?_
  • 9. “If a typical person can do a mental task with less than one second of thought, we can probably automate it using AI” - Andrew Ng
  • 10. A (somewhat trite) counterexample_ 18 27 131 94 322 1061 Are these numbers odd or even? You can (hopefully) judge this in under a second But many popular machine learning algorithms are terrible at learning this (from the numbers alone) Not all problems that can be tackled with ML should be
  • 11. You might have a machine learning problem if: 1 - writing down a set of rules is hard BUT 2 - gathering examples is easy My rule of thumb_
  • 12. Is there a cat in this image?_ Writing rules for the presence or absence of a cat is really, really hard BUT Finding cat pictures is really, really easy
  • 13. Narrowing the focus_ What kind of things can we do? Predict the future Uncover hidden structureRecommend content
  • 15. Scenario_ We have lots of historical examples of system state, each one labelled by its outcome Given a new example of the state, we try to predict the outcome Such problems fall into two main types: regression and classification Regression problems are those where the outcome is a number, for example: — how many potatoes will my supermarket sell next week? — what’s the optimal price to sell this stock? — what speed should I drive given the weather & traffic? Classification problems are those where the outcome is one of a fixed set, for example: — whose face is in this image? — is this credit card transaction fraudulent? — do these scan results indicate cancer?
  • 16. Case: AirBnB_ AirBnB is a great example of a company oriented around data They blog regularly about data science and open source some of their tools One problem they’ve talked about: given the user’s search term, should we show a given listing in the results? Basic distance based search for San Francisco shows many rentals in Oakland Users were likely to find these results irrelevant Bayesian model estimates the probability of booking each listing given a specific search This sped up the booking process and significantly increased conversion rate
  • 17. Case: Railways_ Two buzz words for the price of one: machine learning + IoT! Modern trains and rails are highly instrumented with sensors (motion, sound, pressure, etc.) Using data from these, we can classify whether or not a part (e.g. a wheel or door) will fail within a given time Lets the operator move from scheduled to on-demand maintenance/replacement VR Group estimates this cuts amount of wheel maintenance by around a third Bonus inception: sometimes the sensor themselves start giving incorrect readings Another classifier can detect these faults before they suggest phantom breakdowns
  • 19. Scenario_ Many businesses depend on content of some sort, e.g.: — articles and videos — products in a store — ads It is often valuable to personalise the content shown to your users There are two main methods to predict content users will like Collaborative filters suggest things other users have liked The preferences of others are weighted by how similar they are to you — “Customers also bought…” — Spotify Discover Weekly Content based recommenders suggest, unsurprisingly, based on the actual content — these articles have similar topics to your previous reads — more videos from channels you watched But it’s not always easy/ possible to define similarity between pieces of content
  • 20. Case: Netflix_ Netflix is one of the most well known users of recommender systems Like AirBnB, they blog regularly and have some open source tools available Good recommendations increase user satisfaction, encourage continued use, and drive users to the ‘long tail’ Famously offered a $1M prize to anyone who could improve their recommender by 10% Winning solution was never used! Recently changed from star based ratings to a thumbs up/ down system Better UX, avoids differences in users’ rating systems
  • 22. Scenario_ What if we don’t have labels for our historical states? Another major ML application area is automatically detecting recurring patterns in data This is called unsupervised learning - it’s about learning what’s interesting about some given data This lets us ask different sorts of questions, like: — how do players of my game group together based on play behaviour? — is this spending behaviour unusual given someone’s past purchases? — what are the common topics in this collection of documents?
  • 23. Case: Google News_ Google has been very vocal about its transformation into “a machine learning company” As of 2016, ~10% of their engineers have ML experience and internal training is growing They have stated a goal of systematically applying machine learning in all their business areas The rankings of different topics and outlets can be tailored by the user explicitly The service can also learn user preferences from usage data Google News automatically groups versions of the same story from different outlets Stories are also grouped to form news categories
  • 25. Starting out with ML_ Only for masochistsDesigned with data analysis in mind Amazing library and community support Not general purpose Excellent tooling - try the Anaconda distribution Jupyter notebooks allow mixed code and Markdown Can be slow
  • 26. The real third option_ Fast, powerful language with excellent support for really big datasets (via Spark, Hadoop) Highly parallel Not very beginner friendly Designed with data analysis in mind Amazing library and community support Not general purpose Excellent tooling - try the Anaconda distribution Jupyter notebooks allow mixed code and Markdown Can be slow
  • 27. Resources_ “I want tutorials and interesting blog posts pitched at beginners (and I don’t mind Python)” github.com/hangtwenty/dive-into-machine-learning “Python sucks, show me resources for machine learning in <my language of choice>” github.com/josephmisiti/awesome-machine-learning “Coding sucks, I just want to read a book” ‘The Master Algorithm’ by Pedro Domingos
  • 28. If you remember 3 things_ 1) Machine learning answers narrow questions 2) Look for complex rules with plentiful examples 3) Don’t use Matlab Thanks for listening!
 daryl.weir@futurice.com
 @darylweir