SlideShare une entreprise Scribd logo
1  sur  35
Managing Data Quality in
         OpenStreetMap


TOOLS FOR AN ACTIVE
MAPPING COMMUNITY

NC GIS CONFERENCE 2013



    This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see:
    http://creativecommons.org/licenses/by-sa/3.0/
Overview
                            2

 The Short History of the OpenStreetMap
   Revolution

 Assessing Open Source Data Quality


 Overview of Tools


 Creating Tools that Matter


NC GIS Conference 2013                     23 February 2013
Overview: Key Questions
                                    3

 How can crowd-sourced projects manage data
   quality effectively?

 What tools exist for monitoring data quality in
   OpenStreetMap?

 What conclusions can be drawn about existing tools?


 What is the future of data quality in crowd-sourced
   projects?
NC GIS Conference 2013                             23 February 2013
OpenStreetMap is…
                                 4




 A freely-editable map of the world
   unconstrained by proprietary ownership

 “Wikipedia for maps”




NC GIS Conference 2013                       23 February 2013
The Origins of OpenStreetMap
                              5



 OpenStreetMap.org domain registered by Steve
  Coast in 2004
 Project originated in the United Kingdom, where…
   Crown copyright on geospatial data

   Little, or no public domain data

 Simple goal to create a free, publicly-available
  database of street centerlines


NC GIS Conference 2013                      23 February 2013
OpenStreetMap is…
                                 6




 A freely-editable map of the world
   unconstrained by proprietary ownership

 “Wikipedia for maps”




NC GIS Conference 2013                       23 February 2013
Looks like…a wiki
                                 7




NC GIS Conference 2013                       23 February 2013
Wiki-based Documentation!
                         8




NC GIS Conference 2013

                                         23 February 2013
Milestones in OpenStreetMap History
                             9

 2004 - OpenStreetMap.org registered by Steve Coast
 2005 – Map Limehouse, 1st OpenStreetMap mapping
    party
   2005 – 1000 registered OpenStreetMap users
   2006 – OpenStreetMap Foundation established
   2007 – 5 million ways in OSM database
   2007 – 10,000 registered OpenStreetMap users
   2008 - TIGER data import for the US completed
   2009 - 100,000 registered OpenStreetMap users
   2010 - 200,000 registered OpenStreetMap users
   2012 – ~670,000 registered OpenStreetMap users

NC GIS Conference 2013                          23 February 2013
OpenStreetMap User Growth
                                          10
One million registered users worldwide!




 NC GIS Conference 2013                         23 February 2013
OpenStreetMap Growth in User Edits
                         11




NC GIS Conference 2013                 23 February 2013
OpenStreetMap Database Growth
                           12




NC GIS Conference 2013                  23 February 2013
Data Quality in Crowd-sourced Projects
                                                            13

 Goodchild & Li: Identified three mechanisms for
   Quality Assurance

       Crowd-sourcing

       Social

       Geographic


Goodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information."
Spatial Statistics 1 (2012): 110-120.


NC GIS Conference 2013                                                                               23 February 2013
Crowd-sourced Approach to Data Quality
                                                        14

 Based on Surowiecki’s “Wisdom of the Crowd”
   Multiple users converge around consensus solutions that
    might escape an individual
   Many independent observations reinforce the validity of a
    single observation
   Concurrence on observed features (e.g. “It’s a bridge.”)

   Convergence on the truth



      The group validates observations & corrects errors



   Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York.

NC GIS Conference 2013                                             23 February 2013
Social Approach to Data Quality
                             15

 Through practices, users acquire reputations
 Users with good reputations are trusted
 Trust and reputation are indicators of stewardship
 As the project evolves, social leadership becomes
   more formalized.

 The Data Working Group of OpenStreetMap fullfills
  this function
 Email lists supplement social stewardship


NC GIS Conference 2013                        23 February 2013
Geographic Tools for Data Quality
                                   16

 Geographic approach draws on formal geographic
   theory:
      Spatial neighbors & auto-correlation (Moran statistics)
      Christaller’s Central Place Theory
      Descriptive Statistics
      Inferential Statistics & Analysis of Variance (ANOVA)
      Richardson plots of linear measurements
      Cluster analysis, e.g. k-means
 These approaches have not been widely adopted for
   use in the OpenStreetMap project…yet

NC GIS Conference 2013                                     23 February 2013
A Quick Survey of Data Quality Tools
                               17

 Two types of tools are in widespread use:


      Error Detection Tools

      Monitoring Tools




NC GIS Conference 2013                        23 February 2013
Error Detection Tools: Keep Right
                             18




NC GIS Conference 2013                      23 February 2013
Error Detection Tools: Map Dust
                             19




NC GIS Conference 2013                     23 February 2013
Error Detection Tools: OpenStreetBugs




NC GIS Conference 2013                 23 February 2013
Error Detection Tools: No Name
                             21




NC GIS Conference 2013                     23 February 2013
Error Detection Tools: MapRoulette
                           22




NC GIS Conference 2013                    23 February 2013
Monitoring Tools
                                23




NC GIS Conference 2013                      23 February 2013
Monitoring Tools: OpenStreetMap Watch List
                  (OWL)
                         24




NC GIS Conference 2013            23 February 2013
Monitoring Tools: GeoFabrik Map Compare
                         25




NC GIS Conference 2013           23 February 2013
Monitoring Tools: Who Did It
                               26




NC GIS Conference 2013                           23 February 2013
Monitoring Tools: ITO TIGER Reviewed
                         27




NC GIS Conference 2013              23 February 2013
Monitoring Tools: ITO TIGER Reviewed
                         28




NC GIS Conference 2013              23 February 2013
Monitoring Tools: Green Means Go
                          29




NC GIS Conference 2013                  23 February 2013
Monitoring Tools: Who’s Around Me
                          30




NC GIS Conference 2013                  23 February 2013
Social Controls
                                31

 OpenStreetMap - Data Working Group (DWG)
   Resolving disputes between users

   Processes & protocols for data imports

   Investigates copyright infringement

   Deals with issues of vandalism and fraud

   Suspends or closes user accounts (in case of abuse)

   IP blocking (in case of abuse)




NC GIS Conference 2013                              23 February 2013
How do Social Methods Treat Vandalism?
                                32

 OpenStreetMap is not immune from malicious intent
   Copyright infringement (e.g. copying from Google Maps)

   Graffiti

   Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)

   Spam

 Tools for Managing Vandalism
   Detect using daily diffs

   UserActivity – batch comparison of two versions of the
    database
   Revert – undo changeset to previous version

   Virtual Ban


NC GIS Conference 2013                                 23 February 2013
Summary Review
                                 33

 Three methods for data quality control
   Crowd-sourced

   Social

   Geographic

 OpenStreetMap has crowd-sourced and social tools
   for managing data quality
      Error & Monitoring tools
      Data Working Group - Social
 Geographic methods are experimental at this time
 Increasingly complete geographic features will lead
   to better tools
NC GIS Conference 2013                        23 February 2013
Lessons Learned about OSM Data Quality
                                                       34

 Successive editing by multiple users can improve
   accuracy…up to a point
      Haklay suggests that few improvements are made beyond the
       13th edit
      Semantic differences are not easy to resolve – “Tag wars”
      Obscure edits do not always get corrected if there are no local
       mappers that take ownership
 Social approaches will acquire more authority
   Are part-time, volunteer staffers enough to guarantee data
    quality?
   What are appropriate metrics for trust and reputation?

     Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and
     Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703g
NC GIS Conference 2013                                                                           23 February 2013
Thank You
                                                                   35

 Questions?




 Steven Johnson
   (e) stevejohnson@deloitte.com

   (t) @geomantic




             This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see:
             http://creativecommons.org/licenses/by-sa/3.0/




NC GIS Conference 2013                                                                                              23 February 2013

Contenu connexe

Tendances

Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08Anita Graser
 
Geographic information system
Geographic information systemGeographic information system
Geographic information systemDhaval Jalalpara
 
The Application of GIS in Urban Planning
The Application of GIS in Urban PlanningThe Application of GIS in Urban Planning
The Application of GIS in Urban Planningagungwah
 
Geodatabase with GIS & RS
Geodatabase with GIS & RSGeodatabase with GIS & RS
Geodatabase with GIS & RSMohammed_82
 
Future of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise PlatformFuture of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise PlatformSSP Innovations
 
Geographic information system
Geographic information systemGeographic information system
Geographic information systemSumanta Das
 
GIS and Petroleum Land Management
GIS and Petroleum Land ManagementGIS and Petroleum Land Management
GIS and Petroleum Land Managementwlgardnerjr
 
Geographical information system in transportation planning
Geographical information system in transportation planning Geographical information system in transportation planning
Geographical information system in transportation planning shayiqRashid
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GISJoe Larson
 
Gis powerpoint
Gis powerpointGis powerpoint
Gis powerpointkaushdave
 
Open source health gis presentation final
Open source health gis  presentation finalOpen source health gis  presentation final
Open source health gis presentation finalJISC GECO
 
Why Does GIS Matter
Why Does GIS MatterWhy Does GIS Matter
Why Does GIS MatterSong Gao
 
Geographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas IndustryGeographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas IndustryFrancois Viljoen
 
A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...Toshikazu Seto
 
MODERN trends of GIS
MODERN trends of GISMODERN trends of GIS
MODERN trends of GISVAISHALI JAIN
 
CKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みCKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みYoichi Kayama
 

Tendances (20)

Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
 
Geographic information system
Geographic information systemGeographic information system
Geographic information system
 
The Application of GIS in Urban Planning
The Application of GIS in Urban PlanningThe Application of GIS in Urban Planning
The Application of GIS in Urban Planning
 
Geodatabase with GIS & RS
Geodatabase with GIS & RSGeodatabase with GIS & RS
Geodatabase with GIS & RS
 
Future of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise PlatformFuture of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise Platform
 
Geographic information system
Geographic information systemGeographic information system
Geographic information system
 
GIS and Petroleum Land Management
GIS and Petroleum Land ManagementGIS and Petroleum Land Management
GIS and Petroleum Land Management
 
Introduction To GIS
Introduction To GISIntroduction To GIS
Introduction To GIS
 
Geographical information system in transportation planning
Geographical information system in transportation planning Geographical information system in transportation planning
Geographical information system in transportation planning
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GIS
 
Gis powerpoint
Gis powerpointGis powerpoint
Gis powerpoint
 
Open source health gis presentation final
Open source health gis  presentation finalOpen source health gis  presentation final
Open source health gis presentation final
 
Gis
GisGis
Gis
 
Why Does GIS Matter
Why Does GIS MatterWhy Does GIS Matter
Why Does GIS Matter
 
survey paper 2
survey paper 2survey paper 2
survey paper 2
 
Geographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas IndustryGeographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas Industry
 
Get Big Geo Data
Get Big Geo DataGet Big Geo Data
Get Big Geo Data
 
A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...
 
MODERN trends of GIS
MODERN trends of GISMODERN trends of GIS
MODERN trends of GIS
 
CKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みCKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試み
 

Similaire à OpenStreetMap Data Quality

Exploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classificationExploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classificationJacinto Estima
 
GIS for geophysics.pptx
GIS for geophysics.pptxGIS for geophysics.pptx
GIS for geophysics.pptxThomasHundasa1
 
Land information system in Nepal
Land information system in NepalLand information system in Nepal
Land information system in NepalQust04
 
Thesispresentatie maart
Thesispresentatie maartThesispresentatie maart
Thesispresentatie maartRobin De Croon
 
oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022AbdilbasitHamid
 
MoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptxMoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptxAbdilbasitHamid
 
Converting Relational to Graph Databases
Converting Relational to Graph DatabasesConverting Relational to Graph Databases
Converting Relational to Graph DatabasesAntonio Maccioni
 
Gis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded VersionGis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded Versionpdcaris
 
Arc news - Fall-2015
Arc news - Fall-2015Arc news - Fall-2015
Arc news - Fall-2015what3words
 
New way for GIS Development(Gaia3D)
New way for  GIS Development(Gaia3D)New way for  GIS Development(Gaia3D)
New way for GIS Development(Gaia3D)slhead1
 
Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)Shashank Singh
 
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...David Rozas
 
Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417BJ Jang
 
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...Grant McKenzie
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and GeomaticsRich Heimann
 
COST Actions: ENERGIC, Mapping and the citizen sensor.
COST Actions: ENERGIC,  Mapping and the citizen sensor.COST Actions: ENERGIC,  Mapping and the citizen sensor.
COST Actions: ENERGIC, Mapping and the citizen sensor.Vyron
 

Similaire à OpenStreetMap Data Quality (20)

Understanding the Volunteer in VGI
Understanding the Volunteer in VGIUnderstanding the Volunteer in VGI
Understanding the Volunteer in VGI
 
Exploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classificationExploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classification
 
GIS for geophysics.pptx
GIS for geophysics.pptxGIS for geophysics.pptx
GIS for geophysics.pptx
 
Land information system in Nepal
Land information system in NepalLand information system in Nepal
Land information system in Nepal
 
Thesispresentatie maart
Thesispresentatie maartThesispresentatie maart
Thesispresentatie maart
 
oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022
 
MoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptxMoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptx
 
Converting Relational to Graph Databases
Converting Relational to Graph DatabasesConverting Relational to Graph Databases
Converting Relational to Graph Databases
 
Gis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded VersionGis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded Version
 
Arc news - Fall-2015
Arc news - Fall-2015Arc news - Fall-2015
Arc news - Fall-2015
 
New way for GIS Development(Gaia3D)
New way for  GIS Development(Gaia3D)New way for  GIS Development(Gaia3D)
New way for GIS Development(Gaia3D)
 
AAG panel discussion on great lakes africa
AAG panel discussion on great lakes africaAAG panel discussion on great lakes africa
AAG panel discussion on great lakes africa
 
Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)
 
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
 
Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417
 
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
 
Smart Citizens
Smart CitizensSmart Citizens
Smart Citizens
 
Smart Citizens
Smart CitizensSmart Citizens
Smart Citizens
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and Geomatics
 
COST Actions: ENERGIC, Mapping and the citizen sensor.
COST Actions: ENERGIC,  Mapping and the citizen sensor.COST Actions: ENERGIC,  Mapping and the citizen sensor.
COST Actions: ENERGIC, Mapping and the citizen sensor.
 

Dernier

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 

Dernier (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 

OpenStreetMap Data Quality

  • 1. Managing Data Quality in OpenStreetMap TOOLS FOR AN ACTIVE MAPPING COMMUNITY NC GIS CONFERENCE 2013 This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/
  • 2. Overview 2  The Short History of the OpenStreetMap Revolution  Assessing Open Source Data Quality  Overview of Tools  Creating Tools that Matter NC GIS Conference 2013 23 February 2013
  • 3. Overview: Key Questions 3  How can crowd-sourced projects manage data quality effectively?  What tools exist for monitoring data quality in OpenStreetMap?  What conclusions can be drawn about existing tools?  What is the future of data quality in crowd-sourced projects? NC GIS Conference 2013 23 February 2013
  • 4. OpenStreetMap is… 4  A freely-editable map of the world unconstrained by proprietary ownership  “Wikipedia for maps” NC GIS Conference 2013 23 February 2013
  • 5. The Origins of OpenStreetMap 5  OpenStreetMap.org domain registered by Steve Coast in 2004  Project originated in the United Kingdom, where…  Crown copyright on geospatial data  Little, or no public domain data  Simple goal to create a free, publicly-available database of street centerlines NC GIS Conference 2013 23 February 2013
  • 6. OpenStreetMap is… 6  A freely-editable map of the world unconstrained by proprietary ownership  “Wikipedia for maps” NC GIS Conference 2013 23 February 2013
  • 7. Looks like…a wiki 7 NC GIS Conference 2013 23 February 2013
  • 8. Wiki-based Documentation! 8 NC GIS Conference 2013 23 February 2013
  • 9. Milestones in OpenStreetMap History 9  2004 - OpenStreetMap.org registered by Steve Coast  2005 – Map Limehouse, 1st OpenStreetMap mapping party  2005 – 1000 registered OpenStreetMap users  2006 – OpenStreetMap Foundation established  2007 – 5 million ways in OSM database  2007 – 10,000 registered OpenStreetMap users  2008 - TIGER data import for the US completed  2009 - 100,000 registered OpenStreetMap users  2010 - 200,000 registered OpenStreetMap users  2012 – ~670,000 registered OpenStreetMap users NC GIS Conference 2013 23 February 2013
  • 10. OpenStreetMap User Growth 10 One million registered users worldwide! NC GIS Conference 2013 23 February 2013
  • 11. OpenStreetMap Growth in User Edits 11 NC GIS Conference 2013 23 February 2013
  • 12. OpenStreetMap Database Growth 12 NC GIS Conference 2013 23 February 2013
  • 13. Data Quality in Crowd-sourced Projects 13  Goodchild & Li: Identified three mechanisms for Quality Assurance  Crowd-sourcing  Social  Geographic Goodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information." Spatial Statistics 1 (2012): 110-120. NC GIS Conference 2013 23 February 2013
  • 14. Crowd-sourced Approach to Data Quality 14  Based on Surowiecki’s “Wisdom of the Crowd”  Multiple users converge around consensus solutions that might escape an individual  Many independent observations reinforce the validity of a single observation  Concurrence on observed features (e.g. “It’s a bridge.”)  Convergence on the truth  The group validates observations & corrects errors Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York. NC GIS Conference 2013 23 February 2013
  • 15. Social Approach to Data Quality 15  Through practices, users acquire reputations  Users with good reputations are trusted  Trust and reputation are indicators of stewardship  As the project evolves, social leadership becomes more formalized.  The Data Working Group of OpenStreetMap fullfills this function  Email lists supplement social stewardship NC GIS Conference 2013 23 February 2013
  • 16. Geographic Tools for Data Quality 16  Geographic approach draws on formal geographic theory:  Spatial neighbors & auto-correlation (Moran statistics)  Christaller’s Central Place Theory  Descriptive Statistics  Inferential Statistics & Analysis of Variance (ANOVA)  Richardson plots of linear measurements  Cluster analysis, e.g. k-means  These approaches have not been widely adopted for use in the OpenStreetMap project…yet NC GIS Conference 2013 23 February 2013
  • 17. A Quick Survey of Data Quality Tools 17  Two types of tools are in widespread use:  Error Detection Tools  Monitoring Tools NC GIS Conference 2013 23 February 2013
  • 18. Error Detection Tools: Keep Right 18 NC GIS Conference 2013 23 February 2013
  • 19. Error Detection Tools: Map Dust 19 NC GIS Conference 2013 23 February 2013
  • 20. Error Detection Tools: OpenStreetBugs NC GIS Conference 2013 23 February 2013
  • 21. Error Detection Tools: No Name 21 NC GIS Conference 2013 23 February 2013
  • 22. Error Detection Tools: MapRoulette 22 NC GIS Conference 2013 23 February 2013
  • 23. Monitoring Tools 23 NC GIS Conference 2013 23 February 2013
  • 24. Monitoring Tools: OpenStreetMap Watch List (OWL) 24 NC GIS Conference 2013 23 February 2013
  • 25. Monitoring Tools: GeoFabrik Map Compare 25 NC GIS Conference 2013 23 February 2013
  • 26. Monitoring Tools: Who Did It 26 NC GIS Conference 2013 23 February 2013
  • 27. Monitoring Tools: ITO TIGER Reviewed 27 NC GIS Conference 2013 23 February 2013
  • 28. Monitoring Tools: ITO TIGER Reviewed 28 NC GIS Conference 2013 23 February 2013
  • 29. Monitoring Tools: Green Means Go 29 NC GIS Conference 2013 23 February 2013
  • 30. Monitoring Tools: Who’s Around Me 30 NC GIS Conference 2013 23 February 2013
  • 31. Social Controls 31  OpenStreetMap - Data Working Group (DWG)  Resolving disputes between users  Processes & protocols for data imports  Investigates copyright infringement  Deals with issues of vandalism and fraud  Suspends or closes user accounts (in case of abuse)  IP blocking (in case of abuse) NC GIS Conference 2013 23 February 2013
  • 32. How do Social Methods Treat Vandalism? 32  OpenStreetMap is not immune from malicious intent  Copyright infringement (e.g. copying from Google Maps)  Graffiti  Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)  Spam  Tools for Managing Vandalism  Detect using daily diffs  UserActivity – batch comparison of two versions of the database  Revert – undo changeset to previous version  Virtual Ban NC GIS Conference 2013 23 February 2013
  • 33. Summary Review 33  Three methods for data quality control  Crowd-sourced  Social  Geographic  OpenStreetMap has crowd-sourced and social tools for managing data quality  Error & Monitoring tools  Data Working Group - Social  Geographic methods are experimental at this time  Increasingly complete geographic features will lead to better tools NC GIS Conference 2013 23 February 2013
  • 34. Lessons Learned about OSM Data Quality 34  Successive editing by multiple users can improve accuracy…up to a point  Haklay suggests that few improvements are made beyond the 13th edit  Semantic differences are not easy to resolve – “Tag wars”  Obscure edits do not always get corrected if there are no local mappers that take ownership  Social approaches will acquire more authority  Are part-time, volunteer staffers enough to guarantee data quality?  What are appropriate metrics for trust and reputation? Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703g NC GIS Conference 2013 23 February 2013
  • 35. Thank You 35  Questions?  Steven Johnson  (e) stevejohnson@deloitte.com  (t) @geomantic This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/ NC GIS Conference 2013 23 February 2013