2. OpenCage FOSSGIS 2015
Overview
I. place name disambiguation (homonyms)
– with & without spellcheck
II. Nominatim
III. other (open data) geocoders
– 2015 trends
– opportunities to share data, config, tests
IV. shared ranking/scoring data
17. OpenCage FOSSGIS 2015
Nominatim
●
OSM data, minutely updates
●
+ UK postal codes, TIGER
●
1TB PostGIS
●
import in C, setup scripts in PHP, Postgres stored
procedures, PHP frontend, Python&PHP test suite
●
autocomplete if you add Photon geocoder
●
no spellcheck
19. OpenCage FOSSGIS 2015
other geocoders
Closed source Open source, high resources Open source, low resources
Google Maps Mapzen “Pelias” OpenStreetMap “Nominatim”
Bing/Yahoo Mapbox “Carmen” OpenCage (multiple)
Mapquest Mapquest open (Nominatim) geonames
ESRI/ArcGIS Online Foursquare “Quattroshapes” geocod.io (Tiger data)
Baidu Scout Photon (Nominatim)
Yandex Cloudmade geo.io (Nominatim)
TomTom DSTK (Tiger, geonames)
Amazon (Android only) SmartyStreets
Telenav ...
Nokia/Ovi/Here
Apple (iOS only)
...
20. OpenCage FOSSGIS 2015
trends
●
SSD
●
Add commercial sources
●
Full builds, downloadable index
●
High parallel (map/reduce, nodejs), cloud scaling,
noSQL
●
Community building, guidelines
●
Test suites
21. OpenCage FOSSGIS 2015
typical features to improve
●
horizontal scaling
●
autocomplete
●
spellcheck
●
improve text parsing (App 3, 111-113b)
●
crossings (Main & 2nd N, New Orleans)
●
“4km north of $cityname on the N6”
●
tests for non-latin alphabets
●
postal code boundaries
●
localsearch/POIs
22. OpenCage FOSSGIS 2015
what should be shared
●
aka. don't reinvent everything
●
standard test suite to compare geocoders
●
hierarchy data
●
address parsing
●
address formatting
●
language configuration
●
data parsing, e.g. OSM tags
32. OpenCage FOSSGIS 2015
name is one of many factors
ranking examples:
●
Altona
– type: suburb vs train station vs town ins US/Canada
●
Germany
– admin_level=2 (country) vs island
●
Mt everest
– importance: viewpoint vs peak vs island
●
Oktoberfest
– actually a alt_name of Theresienwiese
●
Königsberg
– 10x a peak, 1x old_name of Kaliningrad
●
Hitlerberg
– old_name:1934-1945 of Heigelkopf
33. OpenCage FOSSGIS 2015
status on wikipedia_articles.bin
●
version 1: wikipedia pageview logs
– https://en.wikipedia.org/wiki/Wikipedia:Notability
●
version 2 (current): parsing wikipedia articles and count links
– last updated 2013
– 80m wikipedia entries + 15m redirects
– 0.6m places in OSM have wikipedia tag set (2013: 0.4m)
●
Version 3 (TBD): parsing wikipedia geo exports
– http://de.wikipedia.org/wiki/Wikipedia:WikiProjekt_Georeferenzierung/Haupts
eite/Wikipedia-World/en
– 3.4m entries, more languages, regular dumps, new documentaton
●
version 4 (?)
- used wikidata exports
- used by multiple geocoders
34. OpenCage FOSSGIS 2015
what can mappers do?
●
add wikipedia tags
●
fix administrative levels
●
don't add wrong names (typos)
●
file bugs (github)
http://nominatim.openstreetmap.org/