SlideShare une entreprise Scribd logo
1  sur  40
A step towards the improvement of spatial data quality of Web 2.0 geo-applications The case of OpenStreetMap Vyron Antoniou, Muki Haklay, Jeremy Morley Department of Civil, Environmental  and Geomatic Engineering
A fundamental GIS problem Information System Real World http://www.bing.com/maps Google Earth
 
OSM Map Features
Wiki Democracy +
OSM Data Geometry Attributes (Tags) +
OSM’s Geometry Haklay et al. Antoniou et al. Completeness Positional Accuracy
Tags?
 
Unique Tags vs Total Tags for each OSM Feature Category (GB) Sum: 2.25M tags
How many tags do we have for each entity?
Residential (2826) Primary (623) How often there is a new Tag introduced?
How often there is a new Tag introduced?
Unique Tags vs Popular Tags (95% of population)
From OSM wiki-pages to XML Schema XML Schema = OSM Specification
From OSM wiki-pages to XML Schema
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
Merkaartor Potlatch JOSM Freedom, Formalization and Quality Standards?
Freedom, Formalization and Quality Standards?
 
 
Final Points ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Thank   you
Thank   you!
User Discrepancies Start and Stop Digitizing?
Name: X Road One-way: Yes  Name: Y Road One-way: No  User Discrepancies
OSM Quality Assurance Openstreetbugs Keep Right
The purpose of this specification  (566 pages!) is to: •  provide a list of the RWOs present in OS MasterMap data; •  define the RWOs present in OS MasterMap data; •  list the  features  and  attributes  of each of the RWOs in OS MasterMap data; •  clarify the representation of the RWOs in OS MasterMap data when needed;  a nd •  provide a document that will accommodate change as OS MasterMap data is enhanced.
OSM Growth Source: http://www.openstreetmap.org
 
 
OSM Data Quality Tests
 
 
Video
Tags: 2.25M tags in 18 OSM Feature Categories   (UK)
Unique Tags per OSM Feature Category
Unique Tags vs Total Tags for each OSM Feature Category
What is really new with OSM? Free  Raster maps…..Google Maps, Bing Maps, Yahoo! Maps? Licence terms User generated Content Geometry….. Vector data Attributes….. Tags

Contenu connexe

En vedette

#FuturePub - Citizen Science, Open Science & scientific publications
#FuturePub - Citizen Science, Open Science & scientific publications#FuturePub - Citizen Science, Open Science & scientific publications
#FuturePub - Citizen Science, Open Science & scientific publicationsMuki Haklay
 
EEA EPA Network - Frameworks for Citizen Science - 2015
EEA EPA Network - Frameworks for Citizen Science - 2015EEA EPA Network - Frameworks for Citizen Science - 2015
EEA EPA Network - Frameworks for Citizen Science - 2015Muki Haklay
 
INSPIRE 2014 conference
INSPIRE 2014 conferenceINSPIRE 2014 conference
INSPIRE 2014 conferenceMuki Haklay
 
Extreme Citizen Science: Current Development
Extreme Citizen Science: Current Development Extreme Citizen Science: Current Development
Extreme Citizen Science: Current Development Muki Haklay
 
Oxford Martin School talk - May 2014
Oxford Martin School talk - May 2014Oxford Martin School talk - May 2014
Oxford Martin School talk - May 2014Muki Haklay
 
Citizen Observatories: Mapping for Change air quality studies
Citizen Observatories:  Mapping for Change air quality studiesCitizen Observatories:  Mapping for Change air quality studies
Citizen Observatories: Mapping for Change air quality studiesMuki Haklay
 
Beyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap dataBeyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap dataMuki Haklay
 
Overview of Citizen Science - Zurich November 2015
Overview of Citizen Science - Zurich November 2015Overview of Citizen Science - Zurich November 2015
Overview of Citizen Science - Zurich November 2015Muki Haklay
 
Eye on Earth Summit - Data Revolution plenary
Eye on Earth Summit - Data Revolution plenary Eye on Earth Summit - Data Revolution plenary
Eye on Earth Summit - Data Revolution plenary Muki Haklay
 
Citizen Science & Geographical Technologies: creativity, learning, and engage...
Citizen Science & Geographical Technologies: creativity, learning, and engage...Citizen Science & Geographical Technologies: creativity, learning, and engage...
Citizen Science & Geographical Technologies: creativity, learning, and engage...Muki Haklay
 
Citizen science - theory, practice & policy workshop
Citizen science - theory, practice & policy workshopCitizen science - theory, practice & policy workshop
Citizen science - theory, practice & policy workshopMuki Haklay
 
Data and the City workshop 2015
Data and the City workshop 2015Data and the City workshop 2015
Data and the City workshop 2015Muki Haklay
 
Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?Muki Haklay
 
Participatory [Citizen] Science
Participatory [Citizen] ScienceParticipatory [Citizen] Science
Participatory [Citizen] ScienceMuki Haklay
 
Algorithmic governance in environmental information (or how technophilia shap...
Algorithmic governance in environmental information (or how technophilia shap...Algorithmic governance in environmental information (or how technophilia shap...
Algorithmic governance in environmental information (or how technophilia shap...Muki Haklay
 
Volunteered Geographic Information and OpenStreetMap
Volunteered Geographic Information and OpenStreetMapVolunteered Geographic Information and OpenStreetMap
Volunteered Geographic Information and OpenStreetMapchippy
 
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...Muki Haklay
 
OpenStreetMap - The Quality Issue
OpenStreetMap - The Quality IssueOpenStreetMap - The Quality Issue
OpenStreetMap - The Quality IssueCASA, UCL
 
AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap
AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap
AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap Muki Haklay
 

En vedette (20)

#FuturePub - Citizen Science, Open Science & scientific publications
#FuturePub - Citizen Science, Open Science & scientific publications#FuturePub - Citizen Science, Open Science & scientific publications
#FuturePub - Citizen Science, Open Science & scientific publications
 
AlfredoConetta_EGM712_GIS_Project
AlfredoConetta_EGM712_GIS_ProjectAlfredoConetta_EGM712_GIS_Project
AlfredoConetta_EGM712_GIS_Project
 
EEA EPA Network - Frameworks for Citizen Science - 2015
EEA EPA Network - Frameworks for Citizen Science - 2015EEA EPA Network - Frameworks for Citizen Science - 2015
EEA EPA Network - Frameworks for Citizen Science - 2015
 
INSPIRE 2014 conference
INSPIRE 2014 conferenceINSPIRE 2014 conference
INSPIRE 2014 conference
 
Extreme Citizen Science: Current Development
Extreme Citizen Science: Current Development Extreme Citizen Science: Current Development
Extreme Citizen Science: Current Development
 
Oxford Martin School talk - May 2014
Oxford Martin School talk - May 2014Oxford Martin School talk - May 2014
Oxford Martin School talk - May 2014
 
Citizen Observatories: Mapping for Change air quality studies
Citizen Observatories:  Mapping for Change air quality studiesCitizen Observatories:  Mapping for Change air quality studies
Citizen Observatories: Mapping for Change air quality studies
 
Beyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap dataBeyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap data
 
Overview of Citizen Science - Zurich November 2015
Overview of Citizen Science - Zurich November 2015Overview of Citizen Science - Zurich November 2015
Overview of Citizen Science - Zurich November 2015
 
Eye on Earth Summit - Data Revolution plenary
Eye on Earth Summit - Data Revolution plenary Eye on Earth Summit - Data Revolution plenary
Eye on Earth Summit - Data Revolution plenary
 
Citizen Science & Geographical Technologies: creativity, learning, and engage...
Citizen Science & Geographical Technologies: creativity, learning, and engage...Citizen Science & Geographical Technologies: creativity, learning, and engage...
Citizen Science & Geographical Technologies: creativity, learning, and engage...
 
Citizen science - theory, practice & policy workshop
Citizen science - theory, practice & policy workshopCitizen science - theory, practice & policy workshop
Citizen science - theory, practice & policy workshop
 
Data and the City workshop 2015
Data and the City workshop 2015Data and the City workshop 2015
Data and the City workshop 2015
 
Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?
 
Participatory [Citizen] Science
Participatory [Citizen] ScienceParticipatory [Citizen] Science
Participatory [Citizen] Science
 
Algorithmic governance in environmental information (or how technophilia shap...
Algorithmic governance in environmental information (or how technophilia shap...Algorithmic governance in environmental information (or how technophilia shap...
Algorithmic governance in environmental information (or how technophilia shap...
 
Volunteered Geographic Information and OpenStreetMap
Volunteered Geographic Information and OpenStreetMapVolunteered Geographic Information and OpenStreetMap
Volunteered Geographic Information and OpenStreetMap
 
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
 
OpenStreetMap - The Quality Issue
OpenStreetMap - The Quality IssueOpenStreetMap - The Quality Issue
OpenStreetMap - The Quality Issue
 
AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap
AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap
AAG 2015 - COST Energic, VGI, Citizen Science and OpenStreetMap
 

Similaire à 4B_2_A step towards the improvement of spatial quality of web 2.0 geo-applications

What's New in ArcGIS 10.1 Data Interoperability Extension
What's New in ArcGIS 10.1 Data Interoperability ExtensionWhat's New in ArcGIS 10.1 Data Interoperability Extension
What's New in ArcGIS 10.1 Data Interoperability ExtensionSafe Software
 
Comprehensive Overview of the Geoweb
Comprehensive Overview of the GeowebComprehensive Overview of the Geoweb
Comprehensive Overview of the GeowebGovernment/CU Denver
 
How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...
How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...
How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...Safe Software
 
Big data use cases in the cloud presentation
Big data use cases in the cloud presentationBig data use cases in the cloud presentation
Big data use cases in the cloud presentationTUSHAR GARG
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4GV prasad
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4Pragati Singh
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4heyramzz
 
OpenStreetMap Past, Present and Future
OpenStreetMap Past, Present and FutureOpenStreetMap Past, Present and Future
OpenStreetMap Past, Present and FuturePeter Batty
 
JACIC
JACICJACIC
JACICDMacP
 
Change Detection Research for the National Map
Change Detection Research for the National MapChange Detection Research for the National Map
Change Detection Research for the National MapGeCo in the Rockies
 
Maps4Finland 28.8.2012, Jari Reini
Maps4Finland 28.8.2012, Jari ReiniMaps4Finland 28.8.2012, Jari Reini
Maps4Finland 28.8.2012, Jari ReiniApps4Finland
 
Maps4 finland 28.8.2012, jari reini
Maps4 finland 28.8.2012, jari reiniMaps4 finland 28.8.2012, jari reini
Maps4 finland 28.8.2012, jari reiniOlli Rinne
 
Tnmc mc andrew_sotmus13_rev2
Tnmc mc andrew_sotmus13_rev2Tnmc mc andrew_sotmus13_rev2
Tnmc mc andrew_sotmus13_rev2JimmyRocks
 
Sound cloud - User & Partner Conference - AT Internet
Sound cloud - User & Partner Conference - AT InternetSound cloud - User & Partner Conference - AT Internet
Sound cloud - User & Partner Conference - AT InternetAT Internet
 
2010 Gist
2010 Gist2010 Gist
2010 Gistfjudson
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GISJoe Larson
 

Similaire à 4B_2_A step towards the improvement of spatial quality of web 2.0 geo-applications (20)

What's New in ArcGIS 10.1 Data Interoperability Extension
What's New in ArcGIS 10.1 Data Interoperability ExtensionWhat's New in ArcGIS 10.1 Data Interoperability Extension
What's New in ArcGIS 10.1 Data Interoperability Extension
 
Comprehensive Overview of the Geoweb
Comprehensive Overview of the GeowebComprehensive Overview of the Geoweb
Comprehensive Overview of the Geoweb
 
How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...
How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...
How to Automatically Create Beautiful Cartographic Labels Using FME’s New Map...
 
Lenguajes y plataformas de desarrollo
Lenguajes y plataformas de desarrolloLenguajes y plataformas de desarrollo
Lenguajes y plataformas de desarrollo
 
Big Data
Big DataBig Data
Big Data
 
Big data use cases in the cloud presentation
Big data use cases in the cloud presentationBig data use cases in the cloud presentation
Big data use cases in the cloud presentation
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4
 
Big data tutorial
Big data tutorialBig data tutorial
Big data tutorial
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4
 
OpenStreetMap Past, Present and Future
OpenStreetMap Past, Present and FutureOpenStreetMap Past, Present and Future
OpenStreetMap Past, Present and Future
 
JACIC
JACICJACIC
JACIC
 
Change Detection Research for the National Map
Change Detection Research for the National MapChange Detection Research for the National Map
Change Detection Research for the National Map
 
Maps4Finland 28.8.2012, Jari Reini
Maps4Finland 28.8.2012, Jari ReiniMaps4Finland 28.8.2012, Jari Reini
Maps4Finland 28.8.2012, Jari Reini
 
Maps4 finland 28.8.2012, jari reini
Maps4 finland 28.8.2012, jari reiniMaps4 finland 28.8.2012, jari reini
Maps4 finland 28.8.2012, jari reini
 
Tnmc mc andrew_sotmus13_rev2
Tnmc mc andrew_sotmus13_rev2Tnmc mc andrew_sotmus13_rev2
Tnmc mc andrew_sotmus13_rev2
 
Sound cloud - User & Partner Conference - AT Internet
Sound cloud - User & Partner Conference - AT InternetSound cloud - User & Partner Conference - AT Internet
Sound cloud - User & Partner Conference - AT Internet
 
2010 Gist
2010 Gist2010 Gist
2010 Gist
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GIS
 

Plus de GISRUK conference

7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizon7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizonGISRUK conference
 
7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmap7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmapGISRUK conference
 
7A_4_Gps data collection setting for pedestrian activity modelling
7A_4_Gps data collection setting for pedestrian activity modelling7A_4_Gps data collection setting for pedestrian activity modelling
7A_4_Gps data collection setting for pedestrian activity modellingGISRUK conference
 
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...GISRUK conference
 
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methodsGISRUK conference
 
4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imaged4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imagedGISRUK conference
 
4B_1_How many volunteers does it take to map an area well
4B_1_How many volunteers does it take to map an area well4B_1_How many volunteers does it take to map an area well
4B_1_How many volunteers does it take to map an area wellGISRUK conference
 
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...GISRUK conference
 
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...GISRUK conference
 
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic space3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic spaceGISRUK conference
 
1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schools1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schoolsGISRUK conference
 
UK Map Challenge Aidan Slingsby
UK Map Challenge   Aidan SlingsbyUK Map Challenge   Aidan Slingsby
UK Map Challenge Aidan SlingsbyGISRUK conference
 
SP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planningSP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planningGISRUK conference
 
SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...GISRUK conference
 
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...GISRUK conference
 
9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...GISRUK conference
 
9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...GISRUK conference
 
8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic information8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic informationGISRUK conference
 
8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locations8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locationsGISRUK conference
 

Plus de GISRUK conference (20)

8A_1_To vote or not to vote
8A_1_To vote or not to vote8A_1_To vote or not to vote
8A_1_To vote or not to vote
 
7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizon7B_3_Matterhorn on the horizon
7B_3_Matterhorn on the horizon
 
7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmap7B_2_Topological consistent generalization of openstreetmap
7B_2_Topological consistent generalization of openstreetmap
 
7A_4_Gps data collection setting for pedestrian activity modelling
7A_4_Gps data collection setting for pedestrian activity modelling7A_4_Gps data collection setting for pedestrian activity modelling
7A_4_Gps data collection setting for pedestrian activity modelling
 
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
 
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
 
4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imaged4B_3_Automatically generating keywods for georeferenced imaged
4B_3_Automatically generating keywods for georeferenced imaged
 
4B_1_How many volunteers does it take to map an area well
4B_1_How many volunteers does it take to map an area well4B_1_How many volunteers does it take to map an area well
4B_1_How many volunteers does it take to map an area well
 
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...4A_1_Uncertainty in the 2001 output area classification for the census of eng...
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
 
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
 
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic space3A_2_Modelling health-harming behaviours in a socially ranked geographic space
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
 
1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schools1A_3_A geodemographic classification of london primary schools
1A_3_A geodemographic classification of london primary schools
 
UK Map Challenge Aidan Slingsby
UK Map Challenge   Aidan SlingsbyUK Map Challenge   Aidan Slingsby
UK Map Challenge Aidan Slingsby
 
SP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planningSP_4 Supporting spatial negotiations in land use planning
SP_4 Supporting spatial negotiations in land use planning
 
SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...SP_3 Automatic identification of high streets and classification of urban lan...
SP_3 Automatic identification of high streets and classification of urban lan...
 
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
 
9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...
 
9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...9A_1_On automatic mapping of environmental data using adaptive general regres...
9A_1_On automatic mapping of environmental data using adaptive general regres...
 
8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic information8B_4_Exploring the usability of geographic information
8B_4_Exploring the usability of geographic information
 
8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locations8B_2_Using sound to represent uncertainty in address locations
8B_2_Using sound to represent uncertainty in address locations
 

4B_2_A step towards the improvement of spatial quality of web 2.0 geo-applications

Notes de l'éditeur

  1. The subject of the presentation is the improvement of SDQ of web 2.0 geo-applications by examining in particular the case of OSM
  2. Well, maybe the most fundamental problem of GIS is how we can put the real world into an information system. How we can model the reality in such a way in order to fit into a GIS.
  3. To deal with that problem the Ordnance Survey has published a catalogue that contains the real-world objects which actually serves as a specification for their OS MasterMap porduct. The scope of this catalogue, that has 566 pages, is to provide a list of the RWOs of the product and a list of features and attributes of each of the RWOs
  4. In fact OSM has something similar to that. Well, it is not a catalogue per se rather a wiki page, but it serves the same purpose to provide a list of entities and possible attributes (or tags) that the users can assign to these entities
  5. The thing is though that this list has not just been published but it has been created through democratic procedures with the help of the wiki technology. In brief, OSM users through a voting system can suggest which entities or tags need to be deleted, altered or added at the map feature page.
  6. So in fact when we speak for OSM Data we actually speak for the geometry and the attributes or tags that users have assigned to the real world entities.
  7. Now, regarding the quality of the OSM Geometry there has been some research either to examine completeness or positional accuracy against the OS Meridian2 dataset. But what we haven’t seen up to now is the quality of the tags
  8. So the question is what is going on with the tags in OSM?
  9. After all, tags is what actually transforms spaghetti-like digitized data into a proper map
  10. So, we looked into what is going on in the OSM tags for GB. This graph shows 2 things. The first thing is the number of tags recorded for each of these 18 categories for GB So, we see that the population of tags ranges from just few thousands for motorway_links up to 900k tags for the residential roads category. In total these 18 categories have more than 2.2 million tags. The second thing shown here is the number of unique tags recorded for each category. The interesting thing here in this line graph is that we obviously don’t need more than 300 unique tags to describe a residential road, well, not even 50 unique tags to describe a motorway_link.
  11. Now, the thing is that, despite the huge amount of tags generated by the OSM contributors the average number of tags per recorded entity is quite small, with the majority of the categories having between 1 and 3 tags per feature. This really gives us an indication about the OSM completeness in terms of entity attribution and certainly indicates that population of tags will keep getting bigger and bigger both because new entities will be digitized but also because the average number of tags per entity will grow.
  12. Now, what we wanted to see is how often a new tag is introduced. Well, the answer is that this depends in the total tag population of each category. So, for example, for the residential roads, in average, we have a new tag for almost every 3000 tags where as for primary roads we get a new tag for every 600 tags. So, the question now is…. Is this good or bad? Well, in order to answer that question we translated this figures into percentage of growth
  13. So, this graph shows what the growth of the tags population has to be for each category in order to have a new tag introduced for that category The interesting thing here is that after a threshold of about 40.000 tags, an increase of 0.3%-0.5% percent creates a new tag in each category.
  14. Now, the next question is ….ok we do not need that many tags per feature category but how many tags are actually enough? We see here that just a small fraction of tags covers the 95% of the tag population in each case. So, actually we need only the tip of the iceberg to correctly model the real world and not the whole iceberg itself. So, is there something we can do about that?
  15. Now, our initial aim was to examine the quality of OSM tag…. But examine the quality against what? OSM is a product that literally has no specification and it captures reality in much more detail than any other product. So what we wanted was first to create an XML Schema that will work as the OSM specification We did that by both manually gathering information both from the OSM wiki pages and by examining the tags that were included in the tip of the iceberg that showed you earlier that I showed you earlier. So just to give you an example of what the schema looks like
  16. So, when we finished some fragments of that Schema we start performing all shorts of comparisons and with the actual data. Here is some of the interesting stuff we found. When we examined the entities of some of the OSM feature categories we found that the % of Schema violation was really high We noticed that the majority of the entities violated the schema because they had the ‘create_by’ tag that had been deprecated. When we didn’t take the create_by tag into account we saw that the % of feature violation was considerably smaller.
  17. We performed the same evaluation in larger categories including all OSM Highways, Nodes and Places but this time we just examined the specific Schema principle that says that tags should not have the created_by key both for all the entities and for entities created after the 30 of April last year when this rule was adopted. The interesting thing here is that while new guidelines are announced for the OSM dataset users don’t implement them immediately rather they continue providing data as they used to.
  18. So, what we suggest is in order to improve the quality of OSM and why not of other VGI geo-applications some sort of formalization should exist under the hood. Putting the XML Schema as a layer between the editors that users use and the database we can have Freedom, Formalization and preserve the Quality Standards of the dataset
  19. This schema could also be used again under the hood with the voting system in that way that any changes decided would be automatically propagated to the schema and finally to the data
  20. Before answering that question, lets see another parameter of the problem: In OSM data, since users are free to digitize or capture with a GPS whatever they want, there are numerous cases that the data capture has stopped without actually recording the entire entity. Where is the problem with that?
  21. Suppose that while the geometry of the two adjacent features is pretty much ok, there might be discrepancies in the tags contributed to each part of that entity That kind of discrepancies actually diminishes the value of the entity.
  22. Now, the OSM community has set up some of applications to deal with errors of the OSM dataset. These application, while they are quite impressive, to my view deal with the problem in a patchy way without providing a standard way of solving problems. In other word, while these apps spot some of the problems and invite users to correct them, they provide no guaranty that while correcting one error the user is not introducing another one.
  23. To deal with that problem the Ordnance Survey has published a catalogue that contains the real-world objects which actually serves as a specification for their OS MasterMap porduct. The scope of this catalogue, that has 566 pages, is to provide a list of the RWOs of the product and a list of features and attributes of each of the RWOs
  24. So, the next time someone shows you that really exciting graph of the OSM growth just remember that for every tiny fraction of that growth the errors are growing as well.