SlideShare a Scribd company logo
1 of 34
Download to read offline
A NEW PLATFORM FOR A NEW ERA
Driving the Future of Smart Cities

How to Beat the Traffic

Strata Santa Clara – February 13, 2013
Alexander Kagoshima, Data Scientist, @akagoshima
Noelle Sio, Senior Data Scientist, @noellesio
Ian Huston, Data Scientist, @ianhuston
@gopivotal
© Copyright 2014 Pivotal. All rights reserved.
2013

2
What Matters: Apps. Data. Analytics.
Apps power businesses, and
those apps generate data
Analytic insights from that data
drive new app functionality,
which in-turn drives new data
The faster you can move
around that cycle, the faster
you learn, innovate & pull
away from the competition
@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

3
Pivotal’s Opportunity
Uniquely positioned to help
enterprises modernize each
facet of this cycle today
Comprehensive portfolio of
products spanning Apps, Data
& Analytics
Converging these technologies
into a coherent, next-gen
Enterprise PaaS platform
@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

4
The Connected Car Drives Innovation
Telematics

Stolen vehicle
Remote
recovery
Behaviour
diagnosis
monitoring
Remote
Car2X
Driver
activation
assistance solutions
Floating
Real-time
eMobility
car data
parking info
solutions
Share my
trip

Traffic
updates
Car
search

Social Media

Responsive
Navigation
PoIs
Next gen
Navigation
Hybrid
Predictive
Navigation
traffic info Map/PoI
updates
Vehicle
Concierge
Fleet
Geo
Tracking
services
Management
fencing

Handsfree
telephony
Music
WiFi
streaming
hotspot
Pay as
Online
you drive Payment Web
games
solutions radio
Road
Environmental
Parking
tolls
browsing
VoD
space reservation
Car
sharing

eCommerce

City
toll

Rich media
comms
Car2X
comms

Communication

Entertainment

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

5
Possible Data Science Use-Cases
!  Predictive Car Maintenance

–  More accurately predict part failure
–  Optimize part repair and replacement schedule

!  Leveraging Driving Behavior

–  Useful to differentiate insurance pricing based on driving style
–  Optimize car design

!  Improving GPS Systems
–  Establish baseline for traffic congestion
–  Gain a detailed view on traffic
–  Create more meaningful metrics for routing

!  Predictive Power for Assistance Systems

–  Optimize fuel efficiency
–  Predict the future state of a car in the next 2 minutes
(starts, stops, emergency braking)

!  Traffic Light Assistance

–  Signal timing of traffic lights
–  Crowd sourcing of traffic signals

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

6
What does
traffic data
look like?

© Copyright 2014 Pivotal. All rights reserved.
2013

7
…like this?

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

8
How fast are vehicles moving?

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

9
How fast are vehicles moving?

0.015
0.000

0.005

0.010

density

0.020

0.025

0.030

Link 1000064869

0

50

100

km/h

150

200

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

10
When do disruptions happen?

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

11
[seconds]

When will the light change?

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

12
Taking Lessons From Other Disciplines
Change-Point Detection can
be used to uncover regimes in
wind-turbine data.
It can also be applied to uncover
regimes in traffic light
switching patterns.

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

13
Taking Lessons From Other Disciplines
Link 1000064869

Different Cell
Populations

0.015
0.010

density

0.020

0.025

0.030

Combined
Component 1
Component 2
Component 3
Component 4

0.000

0.005

Different Driving
Conditions

0

50

100

150

200

@noellesio, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

14
Understanding Traffic Flow
A dynamic, more detailed
understanding of traffic is now possible.
Can we answer both ‘What velocity?’
and ‘Why?’
Context
!  Current GPS systems are based on
average velocity over street segments
!  Real-time traffic information (e.g. Waze)
does not deliver detailed view nor
prediction
@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

15
Our Approach – Multi-step algorithm
From our experience, real-world data often requires multi-step procedures
Step 1: Answer ‘What velocity?’
First find distinct velocity
groups
Link 1000064869

Find influencing effects

?

?

?

?

?

?

0.000

0.005

0.010

density

0.015

0.020

0.025

0.030

Combined
Component 1
Component 2
Component 3
Component 4

Step 2: Answer ‘Why?’

0

50

100

150

200

km/h

@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

16
Find Velocity Groups
!  Velocity distributions can be fit well
with Gaussians
!  An ‘overlay’ of multiple Gaussians is
called Gaussian Mixture Model

0.030
0.025
0.020
density

0.015
0.010
0.005

!  Shapes and positions of Gaussians
determine velocity groups

Combined
Component 1
Component 2
Component 3
Component 4

0.000

!  GMM fitting of the velocity
distribution is done by ExpectationMaximization algorithm

Link 1000064869

0

50

100

150

200

km/h

@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

17
Gaussian Mixture Model
Link 1000064869

Link 1000064869
0.030

Combined
Component
Component
Component
Component

1
2
3
4

0.020
density

0.010
0.005
0.000
0

50

100

150

200

0

50

100

150

200

km/h

km/h

Link 1000064869

0.025

1
2
3
4

density

0.000

0.000

0.005

0.005

0.010

0.010

0.015

0.015

0.020

0.020

0.025

0.030

Combined
Component
Component
Component
Component

0.030

Link 1000064869

density

1
2
3
4

0.015

0.020
0.015
0.000

0.005

0.010

density

Combined
Component
Component
Component
Component

0.025

1
2
3
4

0.025

0.030

Combined
Component
Component
Component
Component

0

0

50

100

km/h

150

200

50

100

150

200

km/h

@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

18
0.04

Predict Gaussians

0.00

0.01

"  Classification task!

0.02

density

0.03

!  The second step seeks to explain/predict
which Gaussian a data point belongs to

Combined
Component 1
Component 2
Component 3

0

50

100

150

200

250

!  Features for classification:
– 
– 
– 
– 
– 

Time of day, day of week
Weather
Direction
Special Events
…

?

?

?

?…

@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

19
White and Black Boxes
!  Analyze correlations between features of a data point and its
assignment to a Gaussian
!  From a Machine Learning point of view, this is classification

!  Generate an interpretable model
description

!  Can capture more complex
correlations

" Explanation of behavior

" Prediction of Gaussian assignment

@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

20
0.04

Putting it all together…
Combined
Component 1
Component 2
Component 3

0.02

Average velocity = 85 km/h
• 
Weekdays after 8pm
• 
Weekdays before 2pm, exiting
Average velocity = 120 km/h
• 
Weekends
• 
Weekdays before 2pm, not exiting

Two-Step algorithm:
GMM + Classification
•  Identified multiple velocity
profiles for every road segment
•  Intuitive and easily
interpretable results
•  Highly scalable for more
features and data

0.00

0.01

density

0.03

Average velocity = 45 km/h
• 
Weekdays between 2 – 8pm

0

50

100

km/h

150

200

250

@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

21
Better Travel Time Prediction
!  Traffic profiles emerged from data
–  Without using metadata, we uncovered road
segment traffic patterns

!  Identified Bias Effects
–  Inferring the impact of turns and day of week on velocity
–  Able to predict rush hour by day and time by road segment

!  Traffic Light Patterns
–  Infer public transportation effects on traffic
–  Automatically determine different switching patterns
@akagoshima, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

22
London Road Traffic
Disruptions
Can we predict when
unexpected incidents will end?
Publicly available data:
!  Transport for London traffic feed
(refreshed every 5 minutes)

!  Weather Underground reports

Photo by James Blunt Photography on
Flickr (CC BY-ND 2.0)

@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

23
@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

24
Major storm hits UK (Photo: BBC)

@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

25
@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

26
@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

27
Durations are very
different for
different types of
incident.
Mean duration for
Surface Damage
incidents is 107
hours!

@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

28
Rain affects
duration in a
surprising way.
Incidents which start
when it is raining
finish faster than
others.

@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

29
Models
Linear Regression
•  Disruption reports &
weather features
Random Forests
•  Rounded categorical
•  Regression
Category MAP
•  Only use category of
incident
•  Maximum Likelihood
estimate
@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

30
Live Predictions
http://ds-demo-transport.cfapps.io
Using:

@ianhuston, @gopivotal
© Copyright 2014 Pivotal. All rights reserved.

31
Summary
! Making use of a vibrant ecosystem of traffic data
! Innovative approaches needed to generate value
from abundant and complex sources
! Connecting predictive models to traffic in the
physical world is the future of smart cities

© Copyright 2014 Pivotal. All rights reserved.

32
Thank You!
Check out more of our Data Science use-cases at
www.goPivotal.com

© Copyright 2014 Pivotal. All rights reserved.
2013

33
A NEW PLATFORM FOR A NEW ERA

More Related Content

Similar to Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

Smart City: Intelligent Traffic Management System
Smart City: Intelligent Traffic Management SystemSmart City: Intelligent Traffic Management System
Smart City: Intelligent Traffic Management SystemSuhas Motwani
 
Bruce Thompson on digital disruption and the environment
Bruce Thompson on digital disruption and the environment Bruce Thompson on digital disruption and the environment
Bruce Thompson on digital disruption and the environment OCESAdmin
 
Using Swarm Intelligence to Prepare for the Next Carmageddon
Using Swarm Intelligence to Prepare for the Next CarmageddonUsing Swarm Intelligence to Prepare for the Next Carmageddon
Using Swarm Intelligence to Prepare for the Next CarmageddonNess Digital Engineering
 
Internet of Things - Building a Smarter World
Internet of Things - Building a Smarter WorldInternet of Things - Building a Smarter World
Internet of Things - Building a Smarter WorldDr. Mazlan Abbas
 
Traffic Light Controller System using Optical Flow Estimation
Traffic Light Controller System using Optical Flow EstimationTraffic Light Controller System using Optical Flow Estimation
Traffic Light Controller System using Optical Flow EstimationEditor IJCATR
 
IOT based fuel monitoring for future vehicles.
IOT based fuel monitoring for  future vehicles.IOT based fuel monitoring for  future vehicles.
IOT based fuel monitoring for future vehicles.IRJET Journal
 
Combining Mobile Air Quality Sensors With Census Data
Combining Mobile Air Quality Sensors With Census DataCombining Mobile Air Quality Sensors With Census Data
Combining Mobile Air Quality Sensors With Census DataLiz Derr
 
Smart Road System to Ensure Road Accidents & Traffic Flow: an Overview
Smart Road System to Ensure Road Accidents & Traffic Flow: an OverviewSmart Road System to Ensure Road Accidents & Traffic Flow: an Overview
Smart Road System to Ensure Road Accidents & Traffic Flow: an OverviewIRJET Journal
 
Design of intelligent traffic light controller using gsm & embedded system
Design of intelligent traffic light controller using gsm & embedded systemDesign of intelligent traffic light controller using gsm & embedded system
Design of intelligent traffic light controller using gsm & embedded systemYakkali Kiran
 
The Dawn of Industry 4.0
The Dawn of Industry 4.0The Dawn of Industry 4.0
The Dawn of Industry 4.0CPqD
 
Spectrum auction failure in India, October 2016
Spectrum auction failure in India, October 2016Spectrum auction failure in India, October 2016
Spectrum auction failure in India, October 2016Coleago Consulting
 
Wireless Network Optimization (2010)
Wireless Network Optimization (2010)Wireless Network Optimization (2010)
Wireless Network Optimization (2010)Marc Jadoul
 

Similar to Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014) (20)

Smart City: Intelligent Traffic Management System
Smart City: Intelligent Traffic Management SystemSmart City: Intelligent Traffic Management System
Smart City: Intelligent Traffic Management System
 
Bruce Thompson on digital disruption and the environment
Bruce Thompson on digital disruption and the environment Bruce Thompson on digital disruption and the environment
Bruce Thompson on digital disruption and the environment
 
WIPAC Monthly - March 2023.pdf
WIPAC Monthly - March 2023.pdfWIPAC Monthly - March 2023.pdf
WIPAC Monthly - March 2023.pdf
 
Yu info 2015 final jg
Yu info 2015 final jgYu info 2015 final jg
Yu info 2015 final jg
 
Using Swarm Intelligence to Prepare for the Next Carmageddon
Using Swarm Intelligence to Prepare for the Next CarmageddonUsing Swarm Intelligence to Prepare for the Next Carmageddon
Using Swarm Intelligence to Prepare for the Next Carmageddon
 
Internet of Things - Building a Smarter World
Internet of Things - Building a Smarter WorldInternet of Things - Building a Smarter World
Internet of Things - Building a Smarter World
 
6 use cases
6  use cases6  use cases
6 use cases
 
6 iot cases
6 iot cases6 iot cases
6 iot cases
 
Traffic Light Controller System using Optical Flow Estimation
Traffic Light Controller System using Optical Flow EstimationTraffic Light Controller System using Optical Flow Estimation
Traffic Light Controller System using Optical Flow Estimation
 
IOT based fuel monitoring for future vehicles.
IOT based fuel monitoring for  future vehicles.IOT based fuel monitoring for  future vehicles.
IOT based fuel monitoring for future vehicles.
 
Combining Mobile Air Quality Sensors With Census Data
Combining Mobile Air Quality Sensors With Census DataCombining Mobile Air Quality Sensors With Census Data
Combining Mobile Air Quality Sensors With Census Data
 
Smart Road System to Ensure Road Accidents & Traffic Flow: an Overview
Smart Road System to Ensure Road Accidents & Traffic Flow: an OverviewSmart Road System to Ensure Road Accidents & Traffic Flow: an Overview
Smart Road System to Ensure Road Accidents & Traffic Flow: an Overview
 
WIPAC Monthly - June 2023.pdf
WIPAC Monthly - June 2023.pdfWIPAC Monthly - June 2023.pdf
WIPAC Monthly - June 2023.pdf
 
Wipac monthly 49th edition october 2015
Wipac monthly 49th edition  october 2015Wipac monthly 49th edition  october 2015
Wipac monthly 49th edition october 2015
 
Design of intelligent traffic light controller using gsm & embedded system
Design of intelligent traffic light controller using gsm & embedded systemDesign of intelligent traffic light controller using gsm & embedded system
Design of intelligent traffic light controller using gsm & embedded system
 
The Dawn of Industry 4.0
The Dawn of Industry 4.0The Dawn of Industry 4.0
The Dawn of Industry 4.0
 
Spectrum auction failure in India, October 2016
Spectrum auction failure in India, October 2016Spectrum auction failure in India, October 2016
Spectrum auction failure in India, October 2016
 
Mini-grids for Energy Access in Sub-Saharan Africa
Mini-grids for Energy Access in Sub-Saharan Africa Mini-grids for Energy Access in Sub-Saharan Africa
Mini-grids for Energy Access in Sub-Saharan Africa
 
Wireless Network Optimization (2010)
Wireless Network Optimization (2010)Wireless Network Optimization (2010)
Wireless Network Optimization (2010)
 
WIPAC Monthly - February 2023.pdf
WIPAC Monthly - February 2023.pdfWIPAC Monthly - February 2023.pdf
WIPAC Monthly - February 2023.pdf
 

More from Ian Huston

Cloud Foundry for Data Science
Cloud Foundry for Data ScienceCloud Foundry for Data Science
Cloud Foundry for Data ScienceIan Huston
 
Python on Cloud Foundry
Python on Cloud FoundryPython on Cloud Foundry
Python on Cloud FoundryIan Huston
 
Data Science Amsterdam - Massively Parallel Processing with Procedural Languages
Data Science Amsterdam - Massively Parallel Processing with Procedural LanguagesData Science Amsterdam - Massively Parallel Processing with Procedural Languages
Data Science Amsterdam - Massively Parallel Processing with Procedural LanguagesIan Huston
 
Massively Parallel Processing with Procedural Python (PyData London 2014)
Massively Parallel Processing with Procedural Python (PyData London 2014)Massively Parallel Processing with Procedural Python (PyData London 2014)
Massively Parallel Processing with Procedural Python (PyData London 2014)Ian Huston
 
Calculating Non-adiabatic Pressure Perturbations during Multi-field Inflation
Calculating Non-adiabatic Pressure Perturbations during Multi-field InflationCalculating Non-adiabatic Pressure Perturbations during Multi-field Inflation
Calculating Non-adiabatic Pressure Perturbations during Multi-field InflationIan Huston
 
Second Order Perturbations - National Astronomy Meeting 2011
Second Order Perturbations - National Astronomy Meeting 2011Second Order Perturbations - National Astronomy Meeting 2011
Second Order Perturbations - National Astronomy Meeting 2011Ian Huston
 
Second Order Perturbations During Inflation Beyond Slow-roll
Second Order Perturbations During Inflation Beyond Slow-rollSecond Order Perturbations During Inflation Beyond Slow-roll
Second Order Perturbations During Inflation Beyond Slow-rollIan Huston
 
Inflation as a solution to the problems of the Big Bang
Inflation as a solution to the problems of the Big BangInflation as a solution to the problems of the Big Bang
Inflation as a solution to the problems of the Big BangIan Huston
 
Cosmological Perturbations and Numerical Simulations
Cosmological Perturbations and Numerical SimulationsCosmological Perturbations and Numerical Simulations
Cosmological Perturbations and Numerical SimulationsIan Huston
 
Cosmo09 presentation
Cosmo09 presentationCosmo09 presentation
Cosmo09 presentationIan Huston
 

More from Ian Huston (10)

Cloud Foundry for Data Science
Cloud Foundry for Data ScienceCloud Foundry for Data Science
Cloud Foundry for Data Science
 
Python on Cloud Foundry
Python on Cloud FoundryPython on Cloud Foundry
Python on Cloud Foundry
 
Data Science Amsterdam - Massively Parallel Processing with Procedural Languages
Data Science Amsterdam - Massively Parallel Processing with Procedural LanguagesData Science Amsterdam - Massively Parallel Processing with Procedural Languages
Data Science Amsterdam - Massively Parallel Processing with Procedural Languages
 
Massively Parallel Processing with Procedural Python (PyData London 2014)
Massively Parallel Processing with Procedural Python (PyData London 2014)Massively Parallel Processing with Procedural Python (PyData London 2014)
Massively Parallel Processing with Procedural Python (PyData London 2014)
 
Calculating Non-adiabatic Pressure Perturbations during Multi-field Inflation
Calculating Non-adiabatic Pressure Perturbations during Multi-field InflationCalculating Non-adiabatic Pressure Perturbations during Multi-field Inflation
Calculating Non-adiabatic Pressure Perturbations during Multi-field Inflation
 
Second Order Perturbations - National Astronomy Meeting 2011
Second Order Perturbations - National Astronomy Meeting 2011Second Order Perturbations - National Astronomy Meeting 2011
Second Order Perturbations - National Astronomy Meeting 2011
 
Second Order Perturbations During Inflation Beyond Slow-roll
Second Order Perturbations During Inflation Beyond Slow-rollSecond Order Perturbations During Inflation Beyond Slow-roll
Second Order Perturbations During Inflation Beyond Slow-roll
 
Inflation as a solution to the problems of the Big Bang
Inflation as a solution to the problems of the Big BangInflation as a solution to the problems of the Big Bang
Inflation as a solution to the problems of the Big Bang
 
Cosmological Perturbations and Numerical Simulations
Cosmological Perturbations and Numerical SimulationsCosmological Perturbations and Numerical Simulations
Cosmological Perturbations and Numerical Simulations
 
Cosmo09 presentation
Cosmo09 presentationCosmo09 presentation
Cosmo09 presentation
 

Recently uploaded

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

  • 1. A NEW PLATFORM FOR A NEW ERA
  • 2. Driving the Future of Smart Cities How to Beat the Traffic Strata Santa Clara – February 13, 2013 Alexander Kagoshima, Data Scientist, @akagoshima Noelle Sio, Senior Data Scientist, @noellesio Ian Huston, Data Scientist, @ianhuston @gopivotal © Copyright 2014 Pivotal. All rights reserved. 2013 2
  • 3. What Matters: Apps. Data. Analytics. Apps power businesses, and those apps generate data Analytic insights from that data drive new app functionality, which in-turn drives new data The faster you can move around that cycle, the faster you learn, innovate & pull away from the competition @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 3
  • 4. Pivotal’s Opportunity Uniquely positioned to help enterprises modernize each facet of this cycle today Comprehensive portfolio of products spanning Apps, Data & Analytics Converging these technologies into a coherent, next-gen Enterprise PaaS platform @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 4
  • 5. The Connected Car Drives Innovation Telematics Stolen vehicle Remote recovery Behaviour diagnosis monitoring Remote Car2X Driver activation assistance solutions Floating Real-time eMobility car data parking info solutions Share my trip Traffic updates Car search Social Media Responsive Navigation PoIs Next gen Navigation Hybrid Predictive Navigation traffic info Map/PoI updates Vehicle Concierge Fleet Geo Tracking services Management fencing Handsfree telephony Music WiFi streaming hotspot Pay as Online you drive Payment Web games solutions radio Road Environmental Parking tolls browsing VoD space reservation Car sharing eCommerce City toll Rich media comms Car2X comms Communication Entertainment @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 5
  • 6. Possible Data Science Use-Cases !  Predictive Car Maintenance –  More accurately predict part failure –  Optimize part repair and replacement schedule !  Leveraging Driving Behavior –  Useful to differentiate insurance pricing based on driving style –  Optimize car design !  Improving GPS Systems –  Establish baseline for traffic congestion –  Gain a detailed view on traffic –  Create more meaningful metrics for routing !  Predictive Power for Assistance Systems –  Optimize fuel efficiency –  Predict the future state of a car in the next 2 minutes (starts, stops, emergency braking) !  Traffic Light Assistance –  Signal timing of traffic lights –  Crowd sourcing of traffic signals @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 6
  • 7. What does traffic data look like? © Copyright 2014 Pivotal. All rights reserved. 2013 7
  • 8. …like this? @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 8
  • 9. How fast are vehicles moving? @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 9
  • 10. How fast are vehicles moving? 0.015 0.000 0.005 0.010 density 0.020 0.025 0.030 Link 1000064869 0 50 100 km/h 150 200 @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 10
  • 11. When do disruptions happen? @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 11
  • 12. [seconds] When will the light change? @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 12
  • 13. Taking Lessons From Other Disciplines Change-Point Detection can be used to uncover regimes in wind-turbine data. It can also be applied to uncover regimes in traffic light switching patterns. @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 13
  • 14. Taking Lessons From Other Disciplines Link 1000064869 Different Cell Populations 0.015 0.010 density 0.020 0.025 0.030 Combined Component 1 Component 2 Component 3 Component 4 0.000 0.005 Different Driving Conditions 0 50 100 150 200 @noellesio, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 14
  • 15. Understanding Traffic Flow A dynamic, more detailed understanding of traffic is now possible. Can we answer both ‘What velocity?’ and ‘Why?’ Context !  Current GPS systems are based on average velocity over street segments !  Real-time traffic information (e.g. Waze) does not deliver detailed view nor prediction @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 15
  • 16. Our Approach – Multi-step algorithm From our experience, real-world data often requires multi-step procedures Step 1: Answer ‘What velocity?’ First find distinct velocity groups Link 1000064869 Find influencing effects ? ? ? ? ? ? 0.000 0.005 0.010 density 0.015 0.020 0.025 0.030 Combined Component 1 Component 2 Component 3 Component 4 Step 2: Answer ‘Why?’ 0 50 100 150 200 km/h @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 16
  • 17. Find Velocity Groups !  Velocity distributions can be fit well with Gaussians !  An ‘overlay’ of multiple Gaussians is called Gaussian Mixture Model 0.030 0.025 0.020 density 0.015 0.010 0.005 !  Shapes and positions of Gaussians determine velocity groups Combined Component 1 Component 2 Component 3 Component 4 0.000 !  GMM fitting of the velocity distribution is done by ExpectationMaximization algorithm Link 1000064869 0 50 100 150 200 km/h @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 17
  • 18. Gaussian Mixture Model Link 1000064869 Link 1000064869 0.030 Combined Component Component Component Component 1 2 3 4 0.020 density 0.010 0.005 0.000 0 50 100 150 200 0 50 100 150 200 km/h km/h Link 1000064869 0.025 1 2 3 4 density 0.000 0.000 0.005 0.005 0.010 0.010 0.015 0.015 0.020 0.020 0.025 0.030 Combined Component Component Component Component 0.030 Link 1000064869 density 1 2 3 4 0.015 0.020 0.015 0.000 0.005 0.010 density Combined Component Component Component Component 0.025 1 2 3 4 0.025 0.030 Combined Component Component Component Component 0 0 50 100 km/h 150 200 50 100 150 200 km/h @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 18
  • 19. 0.04 Predict Gaussians 0.00 0.01 "  Classification task! 0.02 density 0.03 !  The second step seeks to explain/predict which Gaussian a data point belongs to Combined Component 1 Component 2 Component 3 0 50 100 150 200 250 !  Features for classification: –  –  –  –  –  Time of day, day of week Weather Direction Special Events … ? ? ? ?… @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 19
  • 20. White and Black Boxes !  Analyze correlations between features of a data point and its assignment to a Gaussian !  From a Machine Learning point of view, this is classification !  Generate an interpretable model description !  Can capture more complex correlations " Explanation of behavior " Prediction of Gaussian assignment @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 20
  • 21. 0.04 Putting it all together… Combined Component 1 Component 2 Component 3 0.02 Average velocity = 85 km/h •  Weekdays after 8pm •  Weekdays before 2pm, exiting Average velocity = 120 km/h •  Weekends •  Weekdays before 2pm, not exiting Two-Step algorithm: GMM + Classification •  Identified multiple velocity profiles for every road segment •  Intuitive and easily interpretable results •  Highly scalable for more features and data 0.00 0.01 density 0.03 Average velocity = 45 km/h •  Weekdays between 2 – 8pm 0 50 100 km/h 150 200 250 @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 21
  • 22. Better Travel Time Prediction !  Traffic profiles emerged from data –  Without using metadata, we uncovered road segment traffic patterns !  Identified Bias Effects –  Inferring the impact of turns and day of week on velocity –  Able to predict rush hour by day and time by road segment !  Traffic Light Patterns –  Infer public transportation effects on traffic –  Automatically determine different switching patterns @akagoshima, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 22
  • 23. London Road Traffic Disruptions Can we predict when unexpected incidents will end? Publicly available data: !  Transport for London traffic feed (refreshed every 5 minutes) !  Weather Underground reports Photo by James Blunt Photography on Flickr (CC BY-ND 2.0) @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 23
  • 24. @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 24
  • 25. Major storm hits UK (Photo: BBC) @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 25
  • 26. @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 26
  • 27. @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 27
  • 28. Durations are very different for different types of incident. Mean duration for Surface Damage incidents is 107 hours! @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 28
  • 29. Rain affects duration in a surprising way. Incidents which start when it is raining finish faster than others. @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 29
  • 30. Models Linear Regression •  Disruption reports & weather features Random Forests •  Rounded categorical •  Regression Category MAP •  Only use category of incident •  Maximum Likelihood estimate @ianhuston, @gopivotal © Copyright 2014 Pivotal. All rights reserved. 30
  • 32. Summary ! Making use of a vibrant ecosystem of traffic data ! Innovative approaches needed to generate value from abundant and complex sources ! Connecting predictive models to traffic in the physical world is the future of smart cities © Copyright 2014 Pivotal. All rights reserved. 32
  • 33. Thank You! Check out more of our Data Science use-cases at www.goPivotal.com © Copyright 2014 Pivotal. All rights reserved. 2013 33
  • 34. A NEW PLATFORM FOR A NEW ERA