SlideShare a Scribd company logo
1 of 27
OpenStreetMap history
    challenges – methods – reasons
Relevance of Historical Data

    What did the map look
     like two years ago?

      Who was involved in
     editing object X / region Y?

        What exactly happened
            in changeset Z?
Relevance of Historical Data


   Historical visualization


       Retracing vandalism


             Data analysis
OpenStreetMap consists of   • nodes
                            • ways
                            • relations
• nodes
• ways
• relations
              each of those have a version and a
              timestamp, among other things
each of those have a version and a
• nodes       timestamp, among other things
• ways
• relations
              So in theory it’s possible to recreate
              the full history of all objects in
              OpenStreetMap
in practice
this is more
complicated
in practice
this is more
complicated    ways and
               relations
               only have
               references
               to node IDs
in practice
this is more
complicated
ways and
relations      this poses two
only have      problems.
references
to node IDs
1. Moving a node does not
 cause a new way version
1. Moving a node does not
 cause a new way version




     2. References in ways and
    relations do not point to a
          specific version
way version 1


1

v1                               3
                2
                                 v1
                v1




     1. Moving a node does not
      cause a new way version
way version 1

                   2

                   v2
1

v1                               3

                                 v1




     1. Moving a node does not
      cause a new way version
2. References in ways and
relations do not point to a
      specific version
2. References in ways and
relations do not point to a
      specific version
2. References in ways and
relations do not point to a
      specific version
2. References in ways and
relations do not point to a
      specific version
Also, the data is not readily accessible
Also, the data is not readily accessible
        Full history dump is 451GB of XML,
      containing 1.7 billion node versions, 170
               million way versions, and
             4.5 million relation versions.
What is needed?
What is needed?


        • A suitable DB schema
        • PBF format support
        • importer (support)
What is needed?


         • A suitable DB schema
         • PBF format support
         • importer (support)


   .....and ideally, a history API
• A suitable DB schema
• PBF format support
• importer support (osmosis, imposm)
• A suitable DB schema
• PBF format support
• importer support (osmosis, imposm)


       History Hack Weekend
       http://bit.ly/osmhistoryhack
History Hack Weekend
http://bit.ly/osmhistoryhack


        June 10-12
        LinuxHotel
      Essen, Germany
thanks

  OSM > mvexel
twitter > @mvexel
Martijn van Exel


all photos from flickr commons

More Related Content

More from Martijn van Exel (6)

MapRoulette Cooperative Challenges
MapRoulette Cooperative ChallengesMapRoulette Cooperative Challenges
MapRoulette Cooperative Challenges
 
The Golden Age of GIS - Keynote for GIS day 2019
The Golden Age of GIS - Keynote for GIS day 2019The Golden Age of GIS - Keynote for GIS day 2019
The Golden Age of GIS - Keynote for GIS day 2019
 
OpenStreetMap
OpenStreetMapOpenStreetMap
OpenStreetMap
 
Open Places Platform
Open Places PlatformOpen Places Platform
Open Places Platform
 
OpenStreetMap voor IMG100k
OpenStreetMap voor IMG100kOpenStreetMap voor IMG100k
OpenStreetMap voor IMG100k
 
OpenStreetMap Netherlands Since AND import
OpenStreetMap Netherlands Since AND importOpenStreetMap Netherlands Since AND import
OpenStreetMap Netherlands Since AND import
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Dealing with historical data in OpenStreetMap

  • 1. OpenStreetMap history challenges – methods – reasons
  • 2. Relevance of Historical Data What did the map look like two years ago? Who was involved in editing object X / region Y? What exactly happened in changeset Z?
  • 3. Relevance of Historical Data Historical visualization Retracing vandalism Data analysis
  • 4. OpenStreetMap consists of • nodes • ways • relations
  • 5. • nodes • ways • relations each of those have a version and a timestamp, among other things
  • 6. each of those have a version and a • nodes timestamp, among other things • ways • relations So in theory it’s possible to recreate the full history of all objects in OpenStreetMap
  • 7. in practice this is more complicated
  • 8. in practice this is more complicated ways and relations only have references to node IDs
  • 9. in practice this is more complicated ways and relations this poses two only have problems. references to node IDs
  • 10. 1. Moving a node does not cause a new way version
  • 11. 1. Moving a node does not cause a new way version 2. References in ways and relations do not point to a specific version
  • 12. way version 1 1 v1 3 2 v1 v1 1. Moving a node does not cause a new way version
  • 13. way version 1 2 v2 1 v1 3 v1 1. Moving a node does not cause a new way version
  • 14. 2. References in ways and relations do not point to a specific version
  • 15. 2. References in ways and relations do not point to a specific version
  • 16. 2. References in ways and relations do not point to a specific version
  • 17. 2. References in ways and relations do not point to a specific version
  • 18. Also, the data is not readily accessible
  • 19. Also, the data is not readily accessible Full history dump is 451GB of XML, containing 1.7 billion node versions, 170 million way versions, and 4.5 million relation versions.
  • 21. What is needed? • A suitable DB schema • PBF format support • importer (support)
  • 22. What is needed? • A suitable DB schema • PBF format support • importer (support) .....and ideally, a history API
  • 23. • A suitable DB schema • PBF format support • importer support (osmosis, imposm)
  • 24. • A suitable DB schema • PBF format support • importer support (osmosis, imposm) History Hack Weekend http://bit.ly/osmhistoryhack
  • 25.
  • 26. History Hack Weekend http://bit.ly/osmhistoryhack June 10-12 LinuxHotel Essen, Germany
  • 27. thanks OSM > mvexel twitter > @mvexel Martijn van Exel all photos from flickr commons

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n