Transport for London (TfL) collects large amounts of data from its services that it analyzes to improve customer experience. TfL uses data from Oyster cards, bus journeys, emails and more to understand customer behavior and congestion patterns. It then tests targeted communications to influence passenger travel during peak times. A trial in Bethnal Green showed about a 5% shift in travel from the most congested period, indicating changes to passenger behavior. Going forward, TfL aims to integrate more data sources to better optimize transport networks and customer service.
Big Data Analytics for Improved Customer Experience
1. Big Data for a Better Customer Experience
Lauren Sager Weinstein
Head of Analytics
Transport for London
2. 2
Our purpose
‘Keep cities working and growing and make life better’
Plan ahead to meet the challenges
of a growing population
Unlock economic development
and growth
Meet the rising expectations of
our customers and users
3. 3
9.46 million oyster cards in
regular usage
4.725 million Tube journeys on 28
Nov 2014
10 million people in London by
2030
London’s Big Data
More than 50 million bus
journeys in week of 22 Sept 2014
Given our number of customers, by its very nature our data is Big...
4. Our work in Big Data
4
But we can do even more to turn this into ‘Big Data Analytics’ when we
combine it with other data sources and apply algorithms. ...
The first step is to understand the context of our data
5. Inferring destination for bus trips
A customer taps an Oyster card
on the reader, which records the
location and time
Can we infer the exit point?
Stop
B
Stop
A
Bus events are recorded in the iBus system and we
can match this with our Oyster data
5
6. Station Y
Stop X
Stop
B
Stop
A
Where is the next tap?
From the location of the
next tap (if there is one), we
can infer where a customer
alights
If next trip begins at stop X, the
current segment is inferred to
end at stop A
If next trip begins at station Y, the
current segment inferred to end at
stop B
6
8. Customer Emails
8
Database holds more than 4 million email addressees
85 million
emails
sent in
2013
Message content is often dynamically tailored
Key sources are:
Oyster data
Congestion Charge
Barclays Cycle Hire
Registered cyclists
Multi modal
message reaches
2m customers
every week
83% of customers rate the
CRM service update
emails as useful or very
useful
10. Customer Segmentation at Stations
10
Cluster Description Median Start
Time
Journeys /
Day
Stations / Day # of
Regular
Days
# of
Irregular
Days
Cluster
Sample Size
1
Regular Frequent
User
08:11 2.5 0.7 11.6 4.6 36%
2
Occasional User
(Resident)
12:53 1.9 1.3 2.5 3.6 30%
3
Irregular Frequent
User
11:50 2.6 0.9 4.1 12.1 18%
4
Occasional User
(Visitor)
11:01 3.4 3.1 1.9 2.0 16%
36%
30%
18%
16%
B aker S treet - C lus ter S izes
1
2
3
4
11. Behaviour Change Trial Communications
11
Whiteboard posterLeaflet Email
Leytonstone – to be
played from 0800 –
0830
“TfL is investing to
improve the capacity
and frequency of
Tube services but we
know that at certain
times and places the
network can be very
busy.
The busiest time
here is between
08:15 and 08:30.
If you are able to
travel outside this
time you could have
a more comfortable
journey.”
Announcement
12. Bethnal Green Results
Target Time Period
Proportionofdemandbetween7amand10amineachperiod
Proportionofdemandbetween7amand10am
The results suggests change in
passenger behaviour:
• Demand distribution over the peak
period consistent between pre trial and
September periods
• Approximate 5% shift in demand during
target time period of 08:15 to 08:45.
• Total Peak Demand over the trial period
relatively unchanged
12
13. What next? Our future aims
Many more topics and questions to explore!
• Integrating ticketing, bus, traffic congestion, and incident data for
better performance of the bus and road networks
• Integrating social media with our customer data for deeper
understanding
• Looking at weather data to see how it affects transport use
• Using new data mining tools and geo-spatial visualisations to bring
data to life