SlideShare une entreprise Scribd logo
1  sur  39
How Many Volunteers Does It Take To Map An Area Well?  Dr Muki Haklay  Department of Civil, Environmental and Geomatic Engineering, UCL  m.haklay@ucl.ac.uk AamerAther (M.Eng 2009), Sofia Basiouka (MSc GIS 2009)  and NaureenZulfiqar (M.Eng 2008) Ordnance Survey data was kindly provided by the Ordnance Survey research unit.   OSM data was provided by GeoFabrik & CloudMade
Outline A bit about quality of geographical information Evaluation of OSM with Meridian data set Evaluation of OSM with MasterMap Linus’ low –more users: higher quality?
The quality issue How good it the data?  First question: good for what? Subjective quality – fitness for purpose/use Second question: how to measure? Objective quality – but need to evaluate it in light of the first question
The quality issue How good it the data?  Positional accuracy – the position of features or geographic objects in either two or three dimensions Temporal accuracy – how up to date is the data? Does it presents the existing situation and when will it be updated?  Thematic/attribute accuracy – for quantitative attributes (width) and qualitative attributes (geographic names) Completeness – The presence and absence of objects in a dataset at a particular point in time Logical consistency –adherence to the logical rules of the data structure, attribution and relationships
The ‘problem’ “We know little about the people that collect it, their skills, knowledge or patterns of data collection” “Loose coordination and no top-down quality assurance processes – can’t produce good data” “It is not complete and comprehensive – there are white areas”
Coverage and completeness
Coverage and completeness
Completeness – difference by user?
Patterns of collaboration
Users Limited ‘on the ground’ collaboration. Important as this can be the main source of quality assurance - ‘Given enough eyeballs, all bugs are shallow’ (Raymond, 2001)  Translate to VGI it might mean:“The more users there are per area, the better is the positional and attribute quality” But does Linus’ law apply to OSM (and to VGI)?!?
Accuracy and Completeness- Study I   Comparing OSM to OS Meridian 2 roads layer Maridian 2 -Motorways, major and minor roads are... Complex junctions are collapsed to single nodes and multi-carriageways to single links... some minor roads and cul-de-sacs less than 200m are not represented... Private roads and tracks are not included... Nodes are derived from 1:1,250-1:2,500 mapping, with 20m filter around centre line generalisation
Positional Accuracy Meridian 2 and OSM – Motorway comparison
Goodchild and Hunter (1997), Hunter (1999) method Assuming that one dataset is of higher quality Create buffer around the dataset with known width  Calculate the percentage of the evaluated dataset that falls within the buffer
Motorway comparison Buffer of 20m Average of 80% - ranging from 59.81% to 88.80%
Comparison II – Ordnance Survey Master Map Data used for comparison: OS MasterMap Integrated Transport Network (ITN) layer ITN consists of road network information The most accurate and up-to-date geographic reference for Great Britain’s road structure  Any major real world changes are updated within 6 months  Used for numerous applications e.g. Transport management systems, road routing, emergency planning...
Four test locations chosen: TQ28se TQ38se TQ17ne TQ37sw
Buffer analysis – again based on Goodchild and Hunter  (1997) buffer comparison technique: Buffer width (X): X ITN OSM Comparison methodology
Buffer overlap results: 109 roads examined covering over 328 km Results of Master Map comparison
TQ38se (East London) TQ28se (North/Central London)
TQ37sw (South London) TQ17ne (West London)
Quality not linked to length
Completeness – bulk method Assumption: as Meridian 2 is generalised, so for each sq km: If Total length(OSM roads)>Total length(Meridian 2 roads) Than OSM is more complete than Meridian 2 The comparison can also includes attributes, by testing for the number of objects with complete set of values
Methodology 1 3 2 4 5
Change in completeness Mar 2008 – Mar 2010
England – March 2008
England – March 2009
England – October 2009
England – March 2010
Completeness with attributes The test for completeness with attributes checks that roads and streets names have been completed Until the release of Ordnance Survey data in 1st April 2010, this was a good indication for ground survey of an area
England – March 2008
England – March 2009
England – October 2009
England – March 2010
Linus’ law and OSM
Conclusions OSM quality is high – and it is assumed that the quality is coming from aerial imagery  Linus’ Law does not seem to apply in a straight forward manner – at least not from 5 and above More research is required for lower numbers or participants and different quality of imagery
Further reading  Haklay, M., 2008, How good is OpenStreetMap information? A comparative study of OpenStreetMap and Ordnance Survey datasets for London and the rest of England, submitted to Environment and Planning B. Haklay, M. And Weber, P., 2008, OpenStreetMap – User Generated Street Map, IEEE Pervasive Computing. Haklay, M., Singleton, A., and Parker, C., 2008, Web mapping 2.0: the Neogeography of the Geoweb, Geography Compass Haklay, M., 2008, Open Knowledge – learning from environmental information, presented at the Open Knowledge Conference (OKCon) 2008, London, 15 March.  Haklay, M., 2007, OSM and the public - what barriers need to be crossed?presented at State of the Map conference, Manchester, UK, 14-15 July. To get a copy, write to m.haklay@ucl.ac.uk , or get them on povesham.wordpress.com

Contenu connexe

Tendances

Beyond Level of Service – Towards a relative measurement of congestion in pla...
Beyond Level of Service – Towards a relative measurement of congestion in pla...Beyond Level of Service – Towards a relative measurement of congestion in pla...
Beyond Level of Service – Towards a relative measurement of congestion in pla...JumpingJaq
 
Public Transport Accessibility Index for Thiruvananthapuram Urban Area
Public Transport Accessibility Index for Thiruvananthapuram Urban AreaPublic Transport Accessibility Index for Thiruvananthapuram Urban Area
Public Transport Accessibility Index for Thiruvananthapuram Urban AreaIOSR Journals
 
Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...
Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...
Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...Song Gao
 
Traffic simulation
Traffic simulationTraffic simulation
Traffic simulationPraful -
 
Network analysis in gis
Network analysis in gisNetwork analysis in gis
Network analysis in gisstudent
 
Synopsis defense-version01-hamid
Synopsis defense-version01-hamidSynopsis defense-version01-hamid
Synopsis defense-version01-hamidSafdar Wattu
 
Application of a Markov chain traffic model to the Greater Philadelphia Region
Application of a Markov chain traffic model to the Greater Philadelphia RegionApplication of a Markov chain traffic model to the Greater Philadelphia Region
Application of a Markov chain traffic model to the Greater Philadelphia RegionJoseph Reiter
 
Network analysis for shortest optimum path
Network analysis for shortest optimum pathNetwork analysis for shortest optimum path
Network analysis for shortest optimum pathSourabh Jain
 
Gis based method to analyse vulnerability of transportation infrastructure
Gis based method to analyse vulnerability of transportation infrastructureGis based method to analyse vulnerability of transportation infrastructure
Gis based method to analyse vulnerability of transportation infrastructureHAO YE
 

Tendances (14)

Beyond Level of Service – Towards a relative measurement of congestion in pla...
Beyond Level of Service – Towards a relative measurement of congestion in pla...Beyond Level of Service – Towards a relative measurement of congestion in pla...
Beyond Level of Service – Towards a relative measurement of congestion in pla...
 
Public Transport Accessibility Index for Thiruvananthapuram Urban Area
Public Transport Accessibility Index for Thiruvananthapuram Urban AreaPublic Transport Accessibility Index for Thiruvananthapuram Urban Area
Public Transport Accessibility Index for Thiruvananthapuram Urban Area
 
Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...
Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...
Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Tr...
 
Traffic simulation
Traffic simulationTraffic simulation
Traffic simulation
 
Traffic simulation and modelling
Traffic simulation and modellingTraffic simulation and modelling
Traffic simulation and modelling
 
Traffic model
Traffic modelTraffic model
Traffic model
 
Session 38 Oded Cats
Session 38 Oded CatsSession 38 Oded Cats
Session 38 Oded Cats
 
Network analysis in gis
Network analysis in gisNetwork analysis in gis
Network analysis in gis
 
A2114065115
A2114065115A2114065115
A2114065115
 
Synopsis defense-version01-hamid
Synopsis defense-version01-hamidSynopsis defense-version01-hamid
Synopsis defense-version01-hamid
 
Application of a Markov chain traffic model to the Greater Philadelphia Region
Application of a Markov chain traffic model to the Greater Philadelphia RegionApplication of a Markov chain traffic model to the Greater Philadelphia Region
Application of a Markov chain traffic model to the Greater Philadelphia Region
 
Network analysis for shortest optimum path
Network analysis for shortest optimum pathNetwork analysis for shortest optimum path
Network analysis for shortest optimum path
 
Network analysis
Network analysisNetwork analysis
Network analysis
 
Gis based method to analyse vulnerability of transportation infrastructure
Gis based method to analyse vulnerability of transportation infrastructureGis based method to analyse vulnerability of transportation infrastructure
Gis based method to analyse vulnerability of transportation infrastructure
 

En vedette

En vedette (10)

Presentación2ticb2222
Presentación2ticb2222Presentación2ticb2222
Presentación2ticb2222
 
CV DOCUMENT
CV DOCUMENTCV DOCUMENT
CV DOCUMENT
 
Coursera Usable Security
Coursera Usable SecurityCoursera Usable Security
Coursera Usable Security
 
Tagging tracking & traffic sources 2016
Tagging tracking & traffic sources 2016Tagging tracking & traffic sources 2016
Tagging tracking & traffic sources 2016
 
7A_4_Gps data collection setting for pedestrian activity modelling
7A_4_Gps data collection setting for pedestrian activity modelling7A_4_Gps data collection setting for pedestrian activity modelling
7A_4_Gps data collection setting for pedestrian activity modelling
 
FarSight Pitch Deck
FarSight Pitch DeckFarSight Pitch Deck
FarSight Pitch Deck
 
Roadmap To Microsoft Training Resources
Roadmap To Microsoft Training ResourcesRoadmap To Microsoft Training Resources
Roadmap To Microsoft Training Resources
 
4. gear manufacturing
4. gear manufacturing4. gear manufacturing
4. gear manufacturing
 
Child labour final
Child labour finalChild labour final
Child labour final
 
Aya Alah Emad CV
Aya Alah Emad CVAya Alah Emad CV
Aya Alah Emad CV
 

Similaire à 4B_1_How many volunteers does it take to map an area well

Osm Quality Assessment 2008
Osm Quality Assessment 2008Osm Quality Assessment 2008
Osm Quality Assessment 2008Muki Haklay
 
OpenStreetMap - The Quality Issue
OpenStreetMap - The Quality IssueOpenStreetMap - The Quality Issue
OpenStreetMap - The Quality IssueCASA, UCL
 
Progress of land ecosystem studies with geo information and space technology ...
Progress of land ecosystem studies with geo information and space technology ...Progress of land ecosystem studies with geo information and space technology ...
Progress of land ecosystem studies with geo information and space technology ...Institute of Space Knowledge Development
 
AGILE_FinalDay_RobinFrew
AGILE_FinalDay_RobinFrewAGILE_FinalDay_RobinFrew
AGILE_FinalDay_RobinFrewRobin Frew
 
Improving transport in Malta using GIS and LBS
Improving transport in Malta using GIS and LBSImproving transport in Malta using GIS and LBS
Improving transport in Malta using GIS and LBSMatthew Pulis
 
M. Coombes, 21st Century Labour Market Areas: concept, methods, applications
M. Coombes, 21st Century Labour Market Areas: concept, methods, applicationsM. Coombes, 21st Century Labour Market Areas: concept, methods, applications
M. Coombes, 21st Century Labour Market Areas: concept, methods, applicationsIstituto nazionale di statistica
 
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...Marco Minghini
 
Transportation service innovation through big data
Transportation service innovation through big data Transportation service innovation through big data
Transportation service innovation through big data cityprojectdevelopment
 
Using spatial presentation of bus GPS data to identify and justify bus priori...
Using spatial presentation of bus GPS data to identify and justify bus priori...Using spatial presentation of bus GPS data to identify and justify bus priori...
Using spatial presentation of bus GPS data to identify and justify bus priori...JumpingJaq
 
Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...
Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...
Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...Punit Sharnagat
 
Applications of GIS to Logistics and Transportation
Applications of GIS to Logistics and TransportationApplications of GIS to Logistics and Transportation
Applications of GIS to Logistics and Transportationsorbi
 
European Location Framework
European Location FrameworkEuropean Location Framework
European Location FrameworkAntti Jakobsson
 
European Location Framework
European Location FrameworkEuropean Location Framework
European Location FrameworkAntti Jakobsson
 
European Location Framework
European Location FrameworkEuropean Location Framework
European Location FrameworkAntti Jakobsson
 
Kiichiro Hatoyama. Toward Congestion Free Metropolis
Kiichiro Hatoyama. Toward Congestion Free MetropolisKiichiro Hatoyama. Toward Congestion Free Metropolis
Kiichiro Hatoyama. Toward Congestion Free MetropolisЮлия Егорова
 
Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...
Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...
Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...Beniamino Murgante
 
The Distributional Impacts of Transportation Networks in China
The Distributional Impacts of Transportation Networks in ChinaThe Distributional Impacts of Transportation Networks in China
The Distributional Impacts of Transportation Networks in Chinamalin84
 
National Highway Alignment from Namakkal to Erode Using GIS
National Highway Alignment from Namakkal to Erode Using GISNational Highway Alignment from Namakkal to Erode Using GIS
National Highway Alignment from Namakkal to Erode Using GISIJERA Editor
 
APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...
APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...
APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...IJDKP
 

Similaire à 4B_1_How many volunteers does it take to map an area well (20)

Osm Quality Assessment 2008
Osm Quality Assessment 2008Osm Quality Assessment 2008
Osm Quality Assessment 2008
 
OpenStreetMap - The Quality Issue
OpenStreetMap - The Quality IssueOpenStreetMap - The Quality Issue
OpenStreetMap - The Quality Issue
 
Progress of land ecosystem studies with geo information and space technology ...
Progress of land ecosystem studies with geo information and space technology ...Progress of land ecosystem studies with geo information and space technology ...
Progress of land ecosystem studies with geo information and space technology ...
 
AGILE_FinalDay_RobinFrew
AGILE_FinalDay_RobinFrewAGILE_FinalDay_RobinFrew
AGILE_FinalDay_RobinFrew
 
Improving transport in Malta using GIS and LBS
Improving transport in Malta using GIS and LBSImproving transport in Malta using GIS and LBS
Improving transport in Malta using GIS and LBS
 
M. Coombes, 21st Century Labour Market Areas: concept, methods, applications
M. Coombes, 21st Century Labour Market Areas: concept, methods, applicationsM. Coombes, 21st Century Labour Market Areas: concept, methods, applications
M. Coombes, 21st Century Labour Market Areas: concept, methods, applications
 
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
 
Transportation service innovation through big data
Transportation service innovation through big data Transportation service innovation through big data
Transportation service innovation through big data
 
Using spatial presentation of bus GPS data to identify and justify bus priori...
Using spatial presentation of bus GPS data to identify and justify bus priori...Using spatial presentation of bus GPS data to identify and justify bus priori...
Using spatial presentation of bus GPS data to identify and justify bus priori...
 
Strategic Visitor Flows (SVF) analysis using mobile data
Strategic Visitor Flows (SVF) analysis using mobile dataStrategic Visitor Flows (SVF) analysis using mobile data
Strategic Visitor Flows (SVF) analysis using mobile data
 
Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...
Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...
Graph Centric Analysis of Road Network Patterns for CBD’s of Metropolitan Cit...
 
Applications of GIS to Logistics and Transportation
Applications of GIS to Logistics and TransportationApplications of GIS to Logistics and Transportation
Applications of GIS to Logistics and Transportation
 
European Location Framework
European Location FrameworkEuropean Location Framework
European Location Framework
 
European Location Framework
European Location FrameworkEuropean Location Framework
European Location Framework
 
European Location Framework
European Location FrameworkEuropean Location Framework
European Location Framework
 
Kiichiro Hatoyama. Toward Congestion Free Metropolis
Kiichiro Hatoyama. Toward Congestion Free MetropolisKiichiro Hatoyama. Toward Congestion Free Metropolis
Kiichiro Hatoyama. Toward Congestion Free Metropolis
 
Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...
Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...
Integrating spatial and thematic data: the CRISOLA case for Malta and the Eur...
 
The Distributional Impacts of Transportation Networks in China
The Distributional Impacts of Transportation Networks in ChinaThe Distributional Impacts of Transportation Networks in China
The Distributional Impacts of Transportation Networks in China
 
National Highway Alignment from Namakkal to Erode Using GIS
National Highway Alignment from Namakkal to Erode Using GISNational Highway Alignment from Namakkal to Erode Using GIS
National Highway Alignment from Namakkal to Erode Using GIS
 
APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...
APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...
APPLICABILITY OF CROWD SOURCING TO DETERMINE THE BEST TRANSPORTATION METHOD B...
 

Plus de GISRUK conference

7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizon7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizonGISRUK conference
 
7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmap7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmapGISRUK conference
 
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...GISRUK conference
 
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methodsGISRUK conference
 
4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imaged4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imagedGISRUK conference
 
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...GISRUK conference
 
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...GISRUK conference
 
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic space3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic spaceGISRUK conference
 
1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schools1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schoolsGISRUK conference
 
UK Map Challenge Aidan Slingsby
UK Map Challenge   Aidan SlingsbyUK Map Challenge   Aidan Slingsby
UK Map Challenge Aidan SlingsbyGISRUK conference
 
SP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planningSP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planningGISRUK conference
 
SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...GISRUK conference
 
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...GISRUK conference
 
9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...GISRUK conference
 
9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...GISRUK conference
 
8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic information8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic informationGISRUK conference
 
8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locations8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locationsGISRUK conference
 
8B_1_A map to hear - use of sound in enhancing the map use experience
8B_1_A map to hear - use of sound in enhancing the map use experience8B_1_A map to hear - use of sound in enhancing the map use experience
8B_1_A map to hear - use of sound in enhancing the map use experienceGISRUK conference
 
8A_2_A containment-first search algorithm for higher-order analysis of urban ...
8A_2_A containment-first search algorithm for higher-order analysis of urban ...8A_2_A containment-first search algorithm for higher-order analysis of urban ...
8A_2_A containment-first search algorithm for higher-order analysis of urban ...GISRUK conference
 

Plus de GISRUK conference (20)

8A_1_To vote or not to vote
8A_1_To vote or not to vote8A_1_To vote or not to vote
8A_1_To vote or not to vote
 
7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizon7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizon
 
7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmap7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmap
 
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
 
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
 
4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imaged4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imaged
 
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
 
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
 
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic space3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
 
1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schools1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schools
 
UK Map Challenge Aidan Slingsby
UK Map Challenge   Aidan SlingsbyUK Map Challenge   Aidan Slingsby
UK Map Challenge Aidan Slingsby
 
SP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planningSP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planning
 
SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...
 
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
 
9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...
 
9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...
 
8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic information8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic information
 
8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locations8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locations
 
8B_1_A map to hear - use of sound in enhancing the map use experience
8B_1_A map to hear - use of sound in enhancing the map use experience8B_1_A map to hear - use of sound in enhancing the map use experience
8B_1_A map to hear - use of sound in enhancing the map use experience
 
8A_2_A containment-first search algorithm for higher-order analysis of urban ...
8A_2_A containment-first search algorithm for higher-order analysis of urban ...8A_2_A containment-first search algorithm for higher-order analysis of urban ...
8A_2_A containment-first search algorithm for higher-order analysis of urban ...
 

4B_1_How many volunteers does it take to map an area well

  • 1. How Many Volunteers Does It Take To Map An Area Well? Dr Muki Haklay Department of Civil, Environmental and Geomatic Engineering, UCL m.haklay@ucl.ac.uk AamerAther (M.Eng 2009), Sofia Basiouka (MSc GIS 2009) and NaureenZulfiqar (M.Eng 2008) Ordnance Survey data was kindly provided by the Ordnance Survey research unit. OSM data was provided by GeoFabrik & CloudMade
  • 2. Outline A bit about quality of geographical information Evaluation of OSM with Meridian data set Evaluation of OSM with MasterMap Linus’ low –more users: higher quality?
  • 3. The quality issue How good it the data? First question: good for what? Subjective quality – fitness for purpose/use Second question: how to measure? Objective quality – but need to evaluate it in light of the first question
  • 4. The quality issue How good it the data? Positional accuracy – the position of features or geographic objects in either two or three dimensions Temporal accuracy – how up to date is the data? Does it presents the existing situation and when will it be updated? Thematic/attribute accuracy – for quantitative attributes (width) and qualitative attributes (geographic names) Completeness – The presence and absence of objects in a dataset at a particular point in time Logical consistency –adherence to the logical rules of the data structure, attribution and relationships
  • 5. The ‘problem’ “We know little about the people that collect it, their skills, knowledge or patterns of data collection” “Loose coordination and no top-down quality assurance processes – can’t produce good data” “It is not complete and comprehensive – there are white areas”
  • 10.
  • 11. Users Limited ‘on the ground’ collaboration. Important as this can be the main source of quality assurance - ‘Given enough eyeballs, all bugs are shallow’ (Raymond, 2001) Translate to VGI it might mean:“The more users there are per area, the better is the positional and attribute quality” But does Linus’ law apply to OSM (and to VGI)?!?
  • 12. Accuracy and Completeness- Study I Comparing OSM to OS Meridian 2 roads layer Maridian 2 -Motorways, major and minor roads are... Complex junctions are collapsed to single nodes and multi-carriageways to single links... some minor roads and cul-de-sacs less than 200m are not represented... Private roads and tracks are not included... Nodes are derived from 1:1,250-1:2,500 mapping, with 20m filter around centre line generalisation
  • 13. Positional Accuracy Meridian 2 and OSM – Motorway comparison
  • 14. Goodchild and Hunter (1997), Hunter (1999) method Assuming that one dataset is of higher quality Create buffer around the dataset with known width Calculate the percentage of the evaluated dataset that falls within the buffer
  • 15. Motorway comparison Buffer of 20m Average of 80% - ranging from 59.81% to 88.80%
  • 16. Comparison II – Ordnance Survey Master Map Data used for comparison: OS MasterMap Integrated Transport Network (ITN) layer ITN consists of road network information The most accurate and up-to-date geographic reference for Great Britain’s road structure Any major real world changes are updated within 6 months Used for numerous applications e.g. Transport management systems, road routing, emergency planning...
  • 17. Four test locations chosen: TQ28se TQ38se TQ17ne TQ37sw
  • 18. Buffer analysis – again based on Goodchild and Hunter (1997) buffer comparison technique: Buffer width (X): X ITN OSM Comparison methodology
  • 19. Buffer overlap results: 109 roads examined covering over 328 km Results of Master Map comparison
  • 20. TQ38se (East London) TQ28se (North/Central London)
  • 21. TQ37sw (South London) TQ17ne (West London)
  • 22.
  • 23. Quality not linked to length
  • 24. Completeness – bulk method Assumption: as Meridian 2 is generalised, so for each sq km: If Total length(OSM roads)>Total length(Meridian 2 roads) Than OSM is more complete than Meridian 2 The comparison can also includes attributes, by testing for the number of objects with complete set of values
  • 26. Change in completeness Mar 2008 – Mar 2010
  • 31. Completeness with attributes The test for completeness with attributes checks that roads and streets names have been completed Until the release of Ordnance Survey data in 1st April 2010, this was a good indication for ground survey of an area
  • 36.
  • 38. Conclusions OSM quality is high – and it is assumed that the quality is coming from aerial imagery Linus’ Law does not seem to apply in a straight forward manner – at least not from 5 and above More research is required for lower numbers or participants and different quality of imagery
  • 39. Further reading Haklay, M., 2008, How good is OpenStreetMap information? A comparative study of OpenStreetMap and Ordnance Survey datasets for London and the rest of England, submitted to Environment and Planning B. Haklay, M. And Weber, P., 2008, OpenStreetMap – User Generated Street Map, IEEE Pervasive Computing. Haklay, M., Singleton, A., and Parker, C., 2008, Web mapping 2.0: the Neogeography of the Geoweb, Geography Compass Haklay, M., 2008, Open Knowledge – learning from environmental information, presented at the Open Knowledge Conference (OKCon) 2008, London, 15 March. Haklay, M., 2007, OSM and the public - what barriers need to be crossed?presented at State of the Map conference, Manchester, UK, 14-15 July. To get a copy, write to m.haklay@ucl.ac.uk , or get them on povesham.wordpress.com

Notes de l'éditeur

  1. The higher level dataset to be used was OS MasterMap. The Ordnance Survey’s MasterMap is a database that records every fixed feature of Great Britain larger than a few metres in one continuous digital map – It is a framework on which future OS products will be based The ITN layer is one of four layers, and the layer used for this project. ITN obtained from Edina, OSM for CloudeMadeITN data obtained in GML format, which was then converted to shp file format, and the required OSM downloaded as shp file.
  2. - 25km square grid , based on OS 1:10000 grid tilesThe tiles were chosen to cover a range of London, North, South, East & West, two close and two further out from the centre, so that a good sample was chosen
  3. Requires the use of a reference source (ITN) and a test source (OSM)Here we have an example of an ITN and corresponding OSM road feature.ITN is the higher level dataset, and therefore the ITN road feature is considered to be the actual centreline of the road. A buffer of width x is created around the ITN featureThe proportion of the OSM road that lies that lies within the buffer is then calculatedThis analysis carried out for every A-road, B-road and motorway feature in the datasets
  4. The results are very high, indicating that the positional accuracy of OSM is very good.In particular, the South London tile had an average of 92.62% overlap for A-roads, and North/Central London tile had 81.46% overlap for B-roads.Only one motorway segment (very high)
  5. - Results maps for each of the four test regions- The distribution of results in the histogram is clearly illustrated by the results maps. Anything above 90% overlap in red, between 80-90% in orange, and between 70-80% in yellow.