SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
OpenStreetMap address base:
ready for prime time?
Maxim Dubinin
sim@gis-lab.info
SotM Baltics 2013
2 из 24
3 из 24
Can OpenStreetMap address database be
used to create complete geographic
datasets?
The Question
4 из 24
● Creation of data layers for different
features
● Large areas (Russia)
● Thousands objects
● Practical applications
Setup
5 из 24
1. How good is OSM address database and
fully automatic geocoding?
2. How much does postprocessing help?
3. How is completeness distributed across
Russia?
4. How good is the quality compared to
other geocoders?
You will find answers here for...
6 из 24
● What are the mistakes of geocoding and
how it can be improved?
● What is the right scheme for addressing?
● When will OSM take over the world?
...but, nothing about...
7 из 24
Result — correct lat/long for an address
Result ~ data preparation + geocoding +
postprocessing
● Data prep — make well structured address
● Geocoding — find lat/long for it with osm.org.ru
● Postprocessing — fix it manually if wrong
Some definitions
8 из 24
● OpenPolice — where are the local cops in
Moscow
● Elections — where are the voting stations in
Moscow
● Orphanages — where are the children
orphanages in Russia
Examples
9 из 24
1.How good is OSM address database
and fully automatic geocoding?
2.How much does postprocessing help?
3.How is quality distributed across Russia?
4.How good is the quality compared to
other geocoders?
Question 1
10 из 24
OpenPolice
● Extract all addresses from 112.ru
● Geocode them
● Relate them to buildings in Moscow to get
areas of responsibility
http://gis-lab.info/qa/openpolice.html
11 из 24
Results
● Total: ~41000 addresses in Moscow
12 из 24
1.How good is OSM address database and
fully automatic geocoding?
2.How much does postprocessing
help?
3.How is quality distributed across Russia?
4.How good is the quality compared to
other geocoders?
Question 2
13 из 24
Voting comissions
● Extract all addresses from public database
● Geocode them
● Crowdsource post-processing
http://uikgeo.gis-lab.info
14 из 24
Results
● Total: ~3500 addresses in Moscow
● Before post-processing VS after
post-processing
15 из 24
1.How good is OSM address database and
fully automatic geocoding?
2.How much does postprocessing help?
3.How is completeness distributed
across Russia?
4.How good is quality compared to other
geocoders?
Question 3
16 из 24
Orphanages
● Extract all addresses from public database
● Geocode and post-process them
● All regions of Russia, ~5000 orphanages total,
mean 50 per region
http://gis-lab.info/qa/detdom.html
17 из 24
Orphanages
● Buildings before and after post-proc, % total
18 из 24
Orphanages
● Buildings and streets before and after post-proc,
% total
19 из 24
1.How good is OSM address database
and fully automatic geocoding?
2.How much does postprocessing
help?
3.How is quality distributed across
Russia?
4.How good is the quality compared to
other geocoders?
Question 4
20 из 24
No project, just comparison
● Take few hundreds of addresses in different
parts of Russia
● Geocode them with OSM and Yandex
● For each point, assign score: Building = 3,
street = 2, settlement = 1
● Sum the scores up
● Compare
21 из 24
OSM vs Yandex
● Summed scores for geocoding accuracy
22 из 24
Yandex
● Yandex People's map contribution to total score
23 из 24
● Map more ;)
● Improve automatic geocoding
● Create positive feedback loop with
geocoding projects
How to get better?
24 из 24
https://github.com/simgislab/osmaddress-sotmbaltics13
Sources for this presentation

Contenu connexe

En vedette

Язык R: анализ и визуализация данных
Язык R: анализ и визуализация данныхЯзык R: анализ и визуализация данных
Язык R: анализ и визуализация данныхMatrunich Consulting
 
Dubinin opengov geodata
Dubinin opengov geodataDubinin opengov geodata
Dubinin opengov geodataMaxim Dubinin
 
Место Postgres/PostGIS в экосистеме открытого ПО
Место Postgres/PostGIS в экосистеме  открытого ПОМесто Postgres/PostGIS в экосистеме  открытого ПО
Место Postgres/PostGIS в экосистеме открытого ПОMaxim Dubinin
 
1 spss общие сведения
1 spss общие сведения1 spss общие сведения
1 spss общие сведенияGalina Balashova
 
Качество открытых государственных геоданных
Качество открытых государственных геоданныхКачество открытых государственных геоданных
Качество открытых государственных геоданныхMaxim Dubinin
 
R - как инструмент обработки и анализа пространственных данных
R - как инструмент обработки и анализа пространственных данныхR - как инструмент обработки и анализа пространственных данных
R - как инструмент обработки и анализа пространственных данныхMaxim Dubinin
 
Открытые ГИС: понемногу обо всём
Открытые ГИС: понемногу обо всёмОткрытые ГИС: понемногу обо всём
Открытые ГИС: понемногу обо всёмMaxim Dubinin
 
В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...
В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...
В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...ЗПШ СПбГУ
 
ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...
ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...
ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...Maxim Dubinin
 
Совзонд. Опыт использования open source
Совзонд. Опыт использования open sourceСовзонд. Опыт использования open source
Совзонд. Опыт использования open sourceMaxim Dubinin
 
Визуализация данных на географических картах - 2016
Визуализация данных на географических картах - 2016Визуализация данных на географических картах - 2016
Визуализация данных на географических картах - 2016Gleb Zakhodiakin
 
Прогнозирование - Лекция 7. Oracle Demand Management
Прогнозирование - Лекция 7. Oracle Demand ManagementПрогнозирование - Лекция 7. Oracle Demand Management
Прогнозирование - Лекция 7. Oracle Demand ManagementGleb Zakhodiakin
 
Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)
Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)
Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)Gleb Zakhodiakin
 
Прогнозирование - Лекция 1. Компьютерные инструменты прогнозирования
Прогнозирование - Лекция 1. Компьютерные инструменты прогнозированияПрогнозирование - Лекция 1. Компьютерные инструменты прогнозирования
Прогнозирование - Лекция 1. Компьютерные инструменты прогнозированияGleb Zakhodiakin
 
Трансформация данных в Deductor Studio
Трансформация данных в Deductor StudioТрансформация данных в Deductor Studio
Трансформация данных в Deductor StudioGleb Zakhodiakin
 
Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...
Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...
Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...Gleb Zakhodiakin
 
Прогнозирование - Лекция 4. Регрессионные модели временных рядов
Прогнозирование - Лекция 4. Регрессионные модели временных рядовПрогнозирование - Лекция 4. Регрессионные модели временных рядов
Прогнозирование - Лекция 4. Регрессионные модели временных рядовGleb Zakhodiakin
 
Визуализация данных. Аналитическая платформа Tableau.
Визуализация данных. Аналитическая платформа Tableau.Визуализация данных. Аналитическая платформа Tableau.
Визуализация данных. Аналитическая платформа Tableau.Gleb Zakhodiakin
 
Прогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессия
Прогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессияПрогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессия
Прогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессияGleb Zakhodiakin
 

En vedette (20)

Язык R: анализ и визуализация данных
Язык R: анализ и визуализация данныхЯзык R: анализ и визуализация данных
Язык R: анализ и визуализация данных
 
Dubinin opengov geodata
Dubinin opengov geodataDubinin opengov geodata
Dubinin opengov geodata
 
Место Postgres/PostGIS в экосистеме открытого ПО
Место Postgres/PostGIS в экосистеме  открытого ПОМесто Postgres/PostGIS в экосистеме  открытого ПО
Место Postgres/PostGIS в экосистеме открытого ПО
 
Esri и opensource
Esri и opensourceEsri и opensource
Esri и opensource
 
1 spss общие сведения
1 spss общие сведения1 spss общие сведения
1 spss общие сведения
 
Качество открытых государственных геоданных
Качество открытых государственных геоданныхКачество открытых государственных геоданных
Качество открытых государственных геоданных
 
R - как инструмент обработки и анализа пространственных данных
R - как инструмент обработки и анализа пространственных данныхR - как инструмент обработки и анализа пространственных данных
R - как инструмент обработки и анализа пространственных данных
 
Открытые ГИС: понемногу обо всём
Открытые ГИС: понемногу обо всёмОткрытые ГИС: понемногу обо всём
Открытые ГИС: понемногу обо всём
 
В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...
В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...
В,Л. Волохонский "Исследования в промышленном масштабе: сбор данных, обработк...
 
ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...
ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...
ГИС с открытым исходным кодом: авантюра или бизнес? Взгляд со стороны разрабо...
 
Совзонд. Опыт использования open source
Совзонд. Опыт использования open sourceСовзонд. Опыт использования open source
Совзонд. Опыт использования open source
 
Визуализация данных на географических картах - 2016
Визуализация данных на географических картах - 2016Визуализация данных на географических картах - 2016
Визуализация данных на географических картах - 2016
 
Прогнозирование - Лекция 7. Oracle Demand Management
Прогнозирование - Лекция 7. Oracle Demand ManagementПрогнозирование - Лекция 7. Oracle Demand Management
Прогнозирование - Лекция 7. Oracle Demand Management
 
Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)
Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)
Прогнозирование - Лекция 5. Методология Бокса-Дженкинса (модели ARIMA)
 
Прогнозирование - Лекция 1. Компьютерные инструменты прогнозирования
Прогнозирование - Лекция 1. Компьютерные инструменты прогнозированияПрогнозирование - Лекция 1. Компьютерные инструменты прогнозирования
Прогнозирование - Лекция 1. Компьютерные инструменты прогнозирования
 
Трансформация данных в Deductor Studio
Трансформация данных в Deductor StudioТрансформация данных в Deductor Studio
Трансформация данных в Deductor Studio
 
Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...
Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...
Прогнозирование - Лекция 6. Использование инструментов подготовки данных и пр...
 
Прогнозирование - Лекция 4. Регрессионные модели временных рядов
Прогнозирование - Лекция 4. Регрессионные модели временных рядовПрогнозирование - Лекция 4. Регрессионные модели временных рядов
Прогнозирование - Лекция 4. Регрессионные модели временных рядов
 
Визуализация данных. Аналитическая платформа Tableau.
Визуализация данных. Аналитическая платформа Tableau.Визуализация данных. Аналитическая платформа Tableau.
Визуализация данных. Аналитическая платформа Tableau.
 
Прогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессия
Прогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессияПрогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессия
Прогнозирование - Лекция 2. Корреляционный анализ и простая линейная регрессия
 

Similaire à OpenStreetMap address base: ready for prime time?

Map kathmandu osm nepal presentation - public
Map kathmandu osm nepal presentation - publicMap kathmandu osm nepal presentation - public
Map kathmandu osm nepal presentation - publicRajeev Amatya
 
Clickstream data with spark
Clickstream data with sparkClickstream data with spark
Clickstream data with sparkMarissa Saunders
 
зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...
зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...
зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...apps4allru
 
Android Geo Apps in Soviet Russia: Latitude and longitude find you
Android Geo Apps in Soviet Russia: Latitude and longitude find youAndroid Geo Apps in Soviet Russia: Latitude and longitude find you
Android Geo Apps in Soviet Russia: Latitude and longitude find youAlexey Zinoviev
 
How and why governments should use OpenStreetMap - Pete Lancaster - State of ...
How and why governments should use OpenStreetMap - Pete Lancaster - State of ...How and why governments should use OpenStreetMap - Pete Lancaster - State of ...
How and why governments should use OpenStreetMap - Pete Lancaster - State of ...OSMFstateofthemap
 
The participation loop: helping citizens to get in
The participation loop: helping citizens to get in The participation loop: helping citizens to get in
The participation loop: helping citizens to get in Beniamino Murgante
 
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...DevClub_lv
 
Open Cities Africa (Accra) - Stakeholders OpenStreetMap Introduction
Open Cities Africa (Accra) - Stakeholders OpenStreetMap IntroductionOpen Cities Africa (Accra) - Stakeholders OpenStreetMap Introduction
Open Cities Africa (Accra) - Stakeholders OpenStreetMap IntroductionEnock Seth Nyamador
 

Similaire à OpenStreetMap address base: ready for prime time? (11)

Openstreetmap
OpenstreetmapOpenstreetmap
Openstreetmap
 
Map kathmandu osm nepal presentation - public
Map kathmandu osm nepal presentation - publicMap kathmandu osm nepal presentation - public
Map kathmandu osm nepal presentation - public
 
Clickstream data with spark
Clickstream data with sparkClickstream data with spark
Clickstream data with spark
 
зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...
зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...
зиновьев алексей Droid con(16_9) android geo apps_ in soviet russia latitude ...
 
Android Geo Apps in Soviet Russia: Latitude and longitude find you
Android Geo Apps in Soviet Russia: Latitude and longitude find youAndroid Geo Apps in Soviet Russia: Latitude and longitude find you
Android Geo Apps in Soviet Russia: Latitude and longitude find you
 
How and why governments should use OpenStreetMap - Pete Lancaster - State of ...
How and why governments should use OpenStreetMap - Pete Lancaster - State of ...How and why governments should use OpenStreetMap - Pete Lancaster - State of ...
How and why governments should use OpenStreetMap - Pete Lancaster - State of ...
 
The participation loop: helping citizens to get in
The participation loop: helping citizens to get in The participation loop: helping citizens to get in
The participation loop: helping citizens to get in
 
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
 
Intro to Spatial data
Intro to Spatial data Intro to Spatial data
Intro to Spatial data
 
Open Cities Africa (Accra) - Stakeholders OpenStreetMap Introduction
Open Cities Africa (Accra) - Stakeholders OpenStreetMap IntroductionOpen Cities Africa (Accra) - Stakeholders OpenStreetMap Introduction
Open Cities Africa (Accra) - Stakeholders OpenStreetMap Introduction
 
OpenStreetMap Workshop, Accra
OpenStreetMap Workshop, AccraOpenStreetMap Workshop, Accra
OpenStreetMap Workshop, Accra
 

Dernier

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Dernier (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

OpenStreetMap address base: ready for prime time?

  • 1. OpenStreetMap address base: ready for prime time? Maxim Dubinin sim@gis-lab.info SotM Baltics 2013
  • 3. 3 из 24 Can OpenStreetMap address database be used to create complete geographic datasets? The Question
  • 4. 4 из 24 ● Creation of data layers for different features ● Large areas (Russia) ● Thousands objects ● Practical applications Setup
  • 5. 5 из 24 1. How good is OSM address database and fully automatic geocoding? 2. How much does postprocessing help? 3. How is completeness distributed across Russia? 4. How good is the quality compared to other geocoders? You will find answers here for...
  • 6. 6 из 24 ● What are the mistakes of geocoding and how it can be improved? ● What is the right scheme for addressing? ● When will OSM take over the world? ...but, nothing about...
  • 7. 7 из 24 Result — correct lat/long for an address Result ~ data preparation + geocoding + postprocessing ● Data prep — make well structured address ● Geocoding — find lat/long for it with osm.org.ru ● Postprocessing — fix it manually if wrong Some definitions
  • 8. 8 из 24 ● OpenPolice — where are the local cops in Moscow ● Elections — where are the voting stations in Moscow ● Orphanages — where are the children orphanages in Russia Examples
  • 9. 9 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is quality distributed across Russia? 4.How good is the quality compared to other geocoders? Question 1
  • 10. 10 из 24 OpenPolice ● Extract all addresses from 112.ru ● Geocode them ● Relate them to buildings in Moscow to get areas of responsibility http://gis-lab.info/qa/openpolice.html
  • 11. 11 из 24 Results ● Total: ~41000 addresses in Moscow
  • 12. 12 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is quality distributed across Russia? 4.How good is the quality compared to other geocoders? Question 2
  • 13. 13 из 24 Voting comissions ● Extract all addresses from public database ● Geocode them ● Crowdsource post-processing http://uikgeo.gis-lab.info
  • 14. 14 из 24 Results ● Total: ~3500 addresses in Moscow ● Before post-processing VS after post-processing
  • 15. 15 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is completeness distributed across Russia? 4.How good is quality compared to other geocoders? Question 3
  • 16. 16 из 24 Orphanages ● Extract all addresses from public database ● Geocode and post-process them ● All regions of Russia, ~5000 orphanages total, mean 50 per region http://gis-lab.info/qa/detdom.html
  • 17. 17 из 24 Orphanages ● Buildings before and after post-proc, % total
  • 18. 18 из 24 Orphanages ● Buildings and streets before and after post-proc, % total
  • 19. 19 из 24 1.How good is OSM address database and fully automatic geocoding? 2.How much does postprocessing help? 3.How is quality distributed across Russia? 4.How good is the quality compared to other geocoders? Question 4
  • 20. 20 из 24 No project, just comparison ● Take few hundreds of addresses in different parts of Russia ● Geocode them with OSM and Yandex ● For each point, assign score: Building = 3, street = 2, settlement = 1 ● Sum the scores up ● Compare
  • 21. 21 из 24 OSM vs Yandex ● Summed scores for geocoding accuracy
  • 22. 22 из 24 Yandex ● Yandex People's map contribution to total score
  • 23. 23 из 24 ● Map more ;) ● Improve automatic geocoding ● Create positive feedback loop with geocoding projects How to get better?