SlideShare une entreprise Scribd logo
1  sur  28
bite-sized lecture
       @neal_lathia
            october 7, 2011
my research:
urban data mining
over half of us live
in cities, by 2050 –
70% will
the oyster card
what tools can we design to help
travellers?
one example:
there is more to urban mobility
than just moving.
who are you?
where do you want to go?
how often?
how?
when?
how are you paying?
what route?
+ how do we travel? //
how do we spend?
+ do travellers make the
correct decisions? (no)
+ can we help them with
recommendations? (yes)
(%)          pay as you go purchases
                                     49.8         < 5 GBP
                                     24.2         5 – 10 GBP
                                     15.5         10 – 20 GBP
                                     (%)          travel card purchases
                                     70.8         7-day travel card
                                     15.8         1-month travel card
                                     11.6         7-day bus/tram pass

                           Purchase Behaviour
              30
                                                          Travel
              25                                          Cards
                                                          PAYG
              20
% Purchases




              15


              10


               5


               0
                   Mon   Tue   Wed    Thu   Fri     Sat      Sun
Purchase Geography                                Mobility Flow
45
                                                                                   Zone 1
40
                                          PAYG                                     Zone 2
                                          Travel Cards                             Zone 3
35
                                                                                   Zone 4
30                                                                                 Zone 5
                                                                                   Zone 6
25
                                                         arrive
20

15

10

 5                                                        depart
 0
     1   2   3       4    5    6      7       8     9
high regularity – in movement,
purchases
small increments, short terms
is this ideal?
luckily,
computers are good at
counting. let them do it.
idea:
compare what you bought to
what you could have bought
(was it cheaper?).
repeat 300,000 times.
results for this data:
£2.5 million overspend
using this sample to estimate the entire
city means we overspend by:

£200 million per year
by making the wrong decisions.
£200 million per year
by making the wrong decision?

not understanding how we will
need public transport (but..)
failing to match fares with our
needs (but...)
pop quiz:
who has bought something on
amazon?
so you know what a
recommender system is?
recommender system:
data + machine learning for
personalised results
we tested
recommender systems for oyster
purchases, which were 74-98% accurate.
                          Accuracy (%)                     Savings (GBP)
              Dataset 1         Dataset 2      Dataset 1          Dataset 2
Baseline           74.99             76.91       326,447.95         306,145.85
Naïve Bayes        77.46             80.71       393,585.81         369,232.24
k-NN (5)           96.74             97.09       465,822.17         426,375.85
C4.5               98.01             98.29       473,918.38         434,082.81
Oracle             100                   100     479,583.91         438,923.30
bite-sized lecture
       @neal_lathia
            october 7, 2011
further reading:

N. Lathia, L. Capra. Mining Mobility Data to Minimise Travellers'
Spending on Public Transport. In ACM KDD 2011, San Diego, USA.

N. Lathia, J. Froehlich, L. Capra. Mining Public Transport Data for
Personalised Intelligent Transport Systems. In IEEE ICDM 2010,
Sydney, Australia.

N. Lathia and L. Capra. How Smart is Your Smart card? Measuring
Travel Behaviours, Perceptions, and Incentives. In ACM UbiComp
2011, Beijing, China.

Contenu connexe

En vedette

Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...Indian dental academy
 
Deep bite.. /certified fixed orthodontic courses by Indian dental academy
Deep bite.. /certified fixed orthodontic courses by Indian dental academy Deep bite.. /certified fixed orthodontic courses by Indian dental academy
Deep bite.. /certified fixed orthodontic courses by Indian dental academy Indian dental academy
 
Risk factors and risk assessment of periodontal disease.
Risk factors and risk assessment of periodontal disease.Risk factors and risk assessment of periodontal disease.
Risk factors and risk assessment of periodontal disease.Gururam MDS
 
Overjet/Overbite
Overjet/OverbiteOverjet/Overbite
Overjet/OverbiteRaj Kumar
 
class ii division 2 malocclusion
class ii division 2 malocclusionclass ii division 2 malocclusion
class ii division 2 malocclusionRami Aldori
 
Treatment of class ii malocclusions
Treatment of class ii malocclusionsTreatment of class ii malocclusions
Treatment of class ii malocclusionsSapeedeh Afzal
 
Management of class ii division 1 malocclusion
Management of class ii division 1 malocclusionManagement of class ii division 1 malocclusion
Management of class ii division 1 malocclusionSumudu Himesha Meawela
 

En vedette (8)

Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
 
Deep bite.. /certified fixed orthodontic courses by Indian dental academy
Deep bite.. /certified fixed orthodontic courses by Indian dental academy Deep bite.. /certified fixed orthodontic courses by Indian dental academy
Deep bite.. /certified fixed orthodontic courses by Indian dental academy
 
Risk factors and risk assessment of periodontal disease.
Risk factors and risk assessment of periodontal disease.Risk factors and risk assessment of periodontal disease.
Risk factors and risk assessment of periodontal disease.
 
Overjet/Overbite
Overjet/OverbiteOverjet/Overbite
Overjet/Overbite
 
Risk factors
Risk factors Risk factors
Risk factors
 
class ii division 2 malocclusion
class ii division 2 malocclusionclass ii division 2 malocclusion
class ii division 2 malocclusion
 
Treatment of class ii malocclusions
Treatment of class ii malocclusionsTreatment of class ii malocclusions
Treatment of class ii malocclusions
 
Management of class ii division 1 malocclusion
Management of class ii division 1 malocclusionManagement of class ii division 1 malocclusion
Management of class ii division 1 malocclusion
 

Plus de Neal Lathia

Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)Neal Lathia
 
Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)Neal Lathia
 
Using language models to supercharge Monzo’s customer support
 Using language models to supercharge Monzo’s customer support Using language models to supercharge Monzo’s customer support
Using language models to supercharge Monzo’s customer supportNeal Lathia
 
Making Better Decisions Faster
Making Better Decisions FasterMaking Better Decisions Faster
Making Better Decisions FasterNeal Lathia
 
Machine Learning, Faster
Machine Learning, FasterMachine Learning, Faster
Machine Learning, FasterNeal Lathia
 
AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised ExperiencesNeal Lathia
 
Opportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelOpportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelNeal Lathia
 
Bootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineBootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineNeal Lathia
 
Machine Learning for Product Managers
Machine Learning for Product ManagersMachine Learning for Product Managers
Machine Learning for Product ManagersNeal Lathia
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Neal Lathia
 
Happier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataHappier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataNeal Lathia
 
Data Science in Digital Health
Data Science in Digital HealthData Science in Digital Health
Data Science in Digital HealthNeal Lathia
 
Using Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeUsing Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeNeal Lathia
 
Analysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataAnalysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataNeal Lathia
 
Cambridge Quantified Self Meetup
Cambridge Quantified Self MeetupCambridge Quantified Self Meetup
Cambridge Quantified Self MeetupNeal Lathia
 
Data Science in #mHealth
Data Science in #mHealthData Science in #mHealth
Data Science in #mHealthNeal Lathia
 
Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Neal Lathia
 
Emotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentEmotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentNeal Lathia
 
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Neal Lathia
 
Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeNeal Lathia
 

Plus de Neal Lathia (20)

Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)
 
Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)
 
Using language models to supercharge Monzo’s customer support
 Using language models to supercharge Monzo’s customer support Using language models to supercharge Monzo’s customer support
Using language models to supercharge Monzo’s customer support
 
Making Better Decisions Faster
Making Better Decisions FasterMaking Better Decisions Faster
Making Better Decisions Faster
 
Machine Learning, Faster
Machine Learning, FasterMachine Learning, Faster
Machine Learning, Faster
 
AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised Experiences
 
Opportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelOpportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised Travel
 
Bootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineBootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation Engine
 
Machine Learning for Product Managers
Machine Learning for Product ManagersMachine Learning for Product Managers
Machine Learning for Product Managers
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)
 
Happier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataHappier and Healthier with Smartphone Data
Happier and Healthier with Smartphone Data
 
Data Science in Digital Health
Data Science in Digital HealthData Science in Digital Health
Data Science in Digital Health
 
Using Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeUsing Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily Life
 
Analysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataAnalysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone Data
 
Cambridge Quantified Self Meetup
Cambridge Quantified Self MeetupCambridge Quantified Self Meetup
Cambridge Quantified Self Meetup
 
Data Science in #mHealth
Data Science in #mHealthData Science in #mHealth
Data Science in #mHealth
 
Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport
 
Emotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentEmotion Sense: From Design to Deployment
Emotion Sense: From Design to Deployment
 
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
 
Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily Life
 

Dernier

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 

Dernier (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 

Helping travelers make smarter transport decisions

  • 1. bite-sized lecture @neal_lathia october 7, 2011
  • 3. over half of us live in cities, by 2050 – 70% will
  • 4.
  • 5.
  • 6.
  • 7.
  • 9. what tools can we design to help travellers?
  • 10. one example: there is more to urban mobility than just moving.
  • 11.
  • 12. who are you? where do you want to go? how often? how? when? how are you paying? what route?
  • 13. + how do we travel? // how do we spend? + do travellers make the correct decisions? (no) + can we help them with recommendations? (yes)
  • 14. (%) pay as you go purchases 49.8 < 5 GBP 24.2 5 – 10 GBP 15.5 10 – 20 GBP (%) travel card purchases 70.8 7-day travel card 15.8 1-month travel card 11.6 7-day bus/tram pass Purchase Behaviour 30 Travel 25 Cards PAYG 20 % Purchases 15 10 5 0 Mon Tue Wed Thu Fri Sat Sun
  • 15. Purchase Geography Mobility Flow 45 Zone 1 40 PAYG Zone 2 Travel Cards Zone 3 35 Zone 4 30 Zone 5 Zone 6 25 arrive 20 15 10 5 depart 0 1 2 3 4 5 6 7 8 9
  • 16. high regularity – in movement, purchases small increments, short terms is this ideal?
  • 17. luckily, computers are good at counting. let them do it. idea: compare what you bought to what you could have bought (was it cheaper?). repeat 300,000 times.
  • 18. results for this data: £2.5 million overspend
  • 19. using this sample to estimate the entire city means we overspend by: £200 million per year by making the wrong decisions.
  • 20. £200 million per year by making the wrong decision? not understanding how we will need public transport (but..) failing to match fares with our needs (but...)
  • 21. pop quiz: who has bought something on amazon?
  • 22. so you know what a recommender system is?
  • 23. recommender system: data + machine learning for personalised results
  • 24. we tested recommender systems for oyster purchases, which were 74-98% accurate. Accuracy (%) Savings (GBP) Dataset 1 Dataset 2 Dataset 1 Dataset 2 Baseline 74.99 76.91 326,447.95 306,145.85 Naïve Bayes 77.46 80.71 393,585.81 369,232.24 k-NN (5) 96.74 97.09 465,822.17 426,375.85 C4.5 98.01 98.29 473,918.38 434,082.81 Oracle 100 100 479,583.91 438,923.30
  • 25.
  • 26.
  • 27. bite-sized lecture @neal_lathia october 7, 2011
  • 28. further reading: N. Lathia, L. Capra. Mining Mobility Data to Minimise Travellers' Spending on Public Transport. In ACM KDD 2011, San Diego, USA. N. Lathia, J. Froehlich, L. Capra. Mining Public Transport Data for Personalised Intelligent Transport Systems. In IEEE ICDM 2010, Sydney, Australia. N. Lathia and L. Capra. How Smart is Your Smart card? Measuring Travel Behaviours, Perceptions, and Incentives. In ACM UbiComp 2011, Beijing, China.