SlideShare une entreprise Scribd logo
1  sur  35
Télécharger pour lire hors ligne
Monitoring data
consistency in
OpenStreetMap using its
spatial features and tags
semantics
Alfonso Crisci - IBIMET CNR
Maurizio Napolitano - DCL FBK Trento
Francesca De Chiara - DCL FBK Trento
Cristian Consonni - DCL FBK Trento
George Kingsley Zipf
(PD, https://commons.wikimedia.org/wiki/File:George_Kingsley_Zipf_1917.jpg)
Backgrounds #osm3words
OpenstreetMap is a
language of free
representation
of real geographical entities
to build visual patterns
called maps where user
communities works in
participatory style.
Aims build a local areal approach
Do metrics exist to manage OSM
spatial and textual informative
complexity?
Which are the
candidates?
● to build customized
guidelines for thematic
mapping
● to help areal OSM fill gapping
strategies
● detect spatial and informative
gaps
Targets
Parameters looking up the most interesting
• Fractal Dimension Is it possible to measure spatial complexity of
OSM feature?
• Lacunarity Is it possible to identify where OSM contributions have
spatials gaps and how they change over time?
• Textual informative density Is semantics of textual descriptors (keys
and tags) informative?
• Diversity and Dissimilarity Is it possible to detect semantic
differences among different areas/communities at various spatial
scales?
What’s Lacunarity : measure of spatial pattern voids
Gliding Box lacunarity
( Allain Cloitre 1991)
Images & definitions
Marco Diego DOMINIETTO
ETH Zurich
Multimodality Approach To Study
The Fractal Physiology Of Tumor Angiogenesis
Same image complexity different lacunarity
Lacunarity
It is a pattern design analytical
tool and can be defined as a
complementary measure of
fractal dimension.
It allows to distinguish spatial
patterns through the analysis of
their gap (pixel void)
distribution at different scales.
Is rotational invariant ma as
function of the scale.
Information Entropy & Zipf plot: semantic analisys of OSM ‘s wordsets
Textual information density
Zipf's law states that given some corpus of natural language utterances, the
frequency of any word is inversely proportional to its rank in the frequency
table. http://en.wikipedia.org/wiki/Zipf%27s_law
x is the rank of a word in the frequency table;
y is the total number of the word’s occurrences (frequency).
From OSM data is possible to retrieve textual
corpus ( set of words) of keys, tags, keyvalue)
for every bounded area. Two action are possible:
Zipf plot : Description of word set in terms of
distribution of terms. Rare terms detection.
Information entropy : to detect indirectly
textual information density ( Shannon entropy)
http://en.wikipedia.org/wiki/Entropy_%28information_theory%29
Tools analitical framework for OSM data
Osmconvert
Osmfilter
Nepal Civic Hacker @prabhasp
http://prabhasp.github.io/OSMTimeLapseR
tm & ZipfR & LanguageR & qdap &wordcloud
Openstreetmap & Osmar & fractaldim
Urbanisation Regime and Environmental Impact: Analysis and Modelling
of Urban Patterns, Clustering and Metamorphoses
GDAL lacunarity and fractal dimensions
Spatial-tools library
Christian Kaiser
http://github.com/christiankaiser/spatial-tools
raster & rgdal & spatstat
R packages
http://github.com/alfcrisci/osm_analitics
Areas Zoom.level 12 Scale 1:150,000 Admin-centre centered
Trento Northern
Italy
Florence Central Italy Matera Southern Italy
OSMTimeLapseR
Medium city
Large Community
High density of
features
Large urban area
Large Community
High density of
features
Small urban area
Young Community
Recent mapping
OSM History Data preview
Raster Density Maps
A. Feature density
B. Users density
( at least one edit)
A. Version Count density
A. Local complexity
Fractal dimension isoentropic
method
Davies and Hall (1999)
Lexical Analysis
a. Zipf plot keys
b. Wordcloud keys
c. Histogram keys/ N_users
d. Venn diagram keys/user
e. Clustering users by key
Temporal Evolution
I. Year Feature amount
II. Year Lacunarity index
Tag Lexical Analysis
a. Zipf plot of selected key-values
b. Lexical diversity by keys
c. Treemap users by key
d. Treemap values by key
e. Word-network of user by keys
Aerial view Trento spatial resolution 20 m
Feature
density
Users density
Aerial view Trento spatial resolution 20 m
Version Count density
Local complexity
(pixel-area where complexity is lower <2 )
Aerial view Florence spatial resolution 100 m
Feature density Users density
Aerial view Florence spatial resolution 100 m
Version Count density Local complexity
(pixel-area where complexity is lower <2 )
Aerial view Matera spatial resolution 100 m
Feature density Users density
Areal view Matera spatial resolution 100 m
Version Count density Local complexity
(pixel-area where complexity is lower <2 )
Lexical analysis (1) Trento
Lexical analysis (2) Trento
Lexical analysis (2) Trento
Lexical analysis (1) Florence
Lexical analisys (2) Florence
Lexical analysis (2) Firenze
Lexical analysis (1) Matera
Lexical analysis (2) Matera
Lexical analysis (2) Matera
Tags insight Amenity in Trento
Information entropy Diversity
MATERA FIRENZE
"shop","amenity","tourism","man_made","natural","l
eisure","landuse","wikipedia" "shop","amenity","tourism","man_made"
Using several diversity index corpus of values for keys is possible to see the different use of tags in cities
Tags insight Amenity in Trento
Temporal view Trento
Temporal view Firenze
Temporal view Matera
Main Findings
• Spatial complexities in OSM for a specific area could be detected and monitored in
space and time by using complexity metric.
•The lacunarity decay show well the OSM informativity growth but its reliability
depends by the spatial scale used. In densely mapped areas small resolutions are
required (20 m or 10 m).
•Lacunarity thresholds for OSM quality assessment needs further investigations in
relation to the zoom level involved and the keys ( tags) monitored.
•Local fractal dimension indicates well where are area with a low complexity.
Main Findings
•Lexical statistical frameworks works with OSM data and describe their informativity
and the differences that exist among areal communities.
•Textual informativity parameters show the general terms’ abundancy in OSM and
demonstrate that is a really rich informative environment .
• Areal keyset, and only in certain tags, follow a natural language distribution ( Zipf’s
law emerges!) and integrating Information entropy analysis for different spatial scales (
zoom level) is possible to infer on information suitability of the area done by OSM
users’ community.
•All kind of investigation must ever to take into account population and user density
Conclusions
•Spatial and textual “complexity” parameters seems promising tools to help
the assessment of data quality in specific area.
•They main role is to quantify the amount but need to be linked with other
areal metrics ( population & mappers density, OSM feature density).
•The need is to define proper metrics linked to these parameters presented
…...to create osm services as well.
•Suggestions are welcome!
Contacts
Thank you!
Contacts:
Alfonso Crisci
mail: :
a.crisci@ibimet.cnr.italfcrisci@gmail.com
@alfcrisci
Download presentation
http://www.slideshare.net/alfcrisci/sot-m-eu2014crisci
Appendix Fractal dimension: measure of spatial complexity state
A fractal dimension is a ratio providing a
statistical index of complexity comparing how
detail in a pattern (strictly speaking, a fractal
pattern) changes with the scale at which it is
measured.
http://en.wikipedia.org/wiki/Fractal_Dimension
Images
Marco Diego DOMINIETTO
ETH Zurich
Multimodality Approach To Study
The Fractal Physiology Of Tumor
Angiogenesis
Batty, M., and Longley, P. (1994). Fractal Cities: A Geometry of Form and
Function, Academic Press, San Diego, CA, at www.fractalcities.org
We need much better statistics that pertain to the
different kinds of dynamics and their variation over
time and space. (Batty,1994)

Contenu connexe

Similaire à Data Consistency in OpenStreetMap

Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSwapnil Shrivastav
 
An Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain ProblemsAn Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain Problemsertekg
 
Supporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data Analytics Supporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data Analytics paganibr
 
Supporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data AnalyticsSupporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data AnalyticsIrene Celino
 
Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...HannahParr3
 
Fuzzy foss4g 2006 tim waters poster
Fuzzy foss4g 2006 tim waters posterFuzzy foss4g 2006 tim waters poster
Fuzzy foss4g 2006 tim waters posterchippy
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysispaperpublications3
 
A Data Scientist Exploration in the World of Heterogeneous Open Geospatial Data
A Data Scientist Exploration in the World of Heterogeneous Open Geospatial DataA Data Scientist Exploration in the World of Heterogeneous Open Geospatial Data
A Data Scientist Exploration in the World of Heterogeneous Open Geospatial DataGloria Re Calegari
 
20131106 acm geocrowd
20131106 acm geocrowd20131106 acm geocrowd
20131106 acm geocrowdDongpo Deng
 
Weather events identification in social media streams: tools to detect their ...
Weather events identification in social media streams: tools to detect their ...Weather events identification in social media streams: tools to detect their ...
Weather events identification in social media streams: tools to detect their ...Alfonso Crisci
 
No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...
No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...
No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...ACTUONDA
 
Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...
Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...
Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...Enrique Frias-Martinez
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionIJEACS
 
5B_1_Neogeography for the rural urban classification of england and wales
5B_1_Neogeography for the rural urban classification of england and wales5B_1_Neogeography for the rural urban classification of england and wales
5B_1_Neogeography for the rural urban classification of england and walesGISRUK conference
 
Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...Itinera Nova
 
cxcxc program ssk-cug 2010 - standardized systematization of knowledge via ...
cxcxc program   ssk-cug 2010 - standardized systematization of knowledge via ...cxcxc program   ssk-cug 2010 - standardized systematization of knowledge via ...
cxcxc program ssk-cug 2010 - standardized systematization of knowledge via ...Ionel Gabriel Niculescu
 
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...Christophe Debruyne
 
Polyphonic images of the city, mapping human landscapes through user generate...
Polyphonic images of the city, mapping human landscapes through user generate...Polyphonic images of the city, mapping human landscapes through user generate...
Polyphonic images of the city, mapping human landscapes through user generate...Giorgia Lupi
 

Similaire à Data Consistency in OpenStreetMap (20)

Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis Tools
 
An Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain ProblemsAn Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain Problems
 
Supporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data Analytics Supporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data Analytics
 
Supporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data AnalyticsSupporting Geo-Ontology Engineering through Spatial Data Analytics
Supporting Geo-Ontology Engineering through Spatial Data Analytics
 
Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...
 
Fuzzy foss4g 2006 tim waters poster
Fuzzy foss4g 2006 tim waters posterFuzzy foss4g 2006 tim waters poster
Fuzzy foss4g 2006 tim waters poster
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysis
 
A Data Scientist Exploration in the World of Heterogeneous Open Geospatial Data
A Data Scientist Exploration in the World of Heterogeneous Open Geospatial DataA Data Scientist Exploration in the World of Heterogeneous Open Geospatial Data
A Data Scientist Exploration in the World of Heterogeneous Open Geospatial Data
 
2 prayla
2 prayla2 prayla
2 prayla
 
2 prayla
2 prayla2 prayla
2 prayla
 
20131106 acm geocrowd
20131106 acm geocrowd20131106 acm geocrowd
20131106 acm geocrowd
 
Weather events identification in social media streams: tools to detect their ...
Weather events identification in social media streams: tools to detect their ...Weather events identification in social media streams: tools to detect their ...
Weather events identification in social media streams: tools to detect their ...
 
No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...
No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...
No more BITS - Blind Insignificant Technologies ands Systems by Roger Roberts...
 
Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...
Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...
Urban Analysis for the XXI Century: Using Pervasive Infrastructures for Model...
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern Detection
 
5B_1_Neogeography for the rural urban classification of england and wales
5B_1_Neogeography for the rural urban classification of england and wales5B_1_Neogeography for the rural urban classification of england and wales
5B_1_Neogeography for the rural urban classification of england and wales
 
Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...Development of the database, the website and the online transcription platfor...
Development of the database, the website and the online transcription platfor...
 
cxcxc program ssk-cug 2010 - standardized systematization of knowledge via ...
cxcxc program   ssk-cug 2010 - standardized systematization of knowledge via ...cxcxc program   ssk-cug 2010 - standardized systematization of knowledge via ...
cxcxc program ssk-cug 2010 - standardized systematization of knowledge via ...
 
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
 
Polyphonic images of the city, mapping human landscapes through user generate...
Polyphonic images of the city, mapping human landscapes through user generate...Polyphonic images of the city, mapping human landscapes through user generate...
Polyphonic images of the city, mapping human landscapes through user generate...
 

Plus de Alfonso Crisci

monitoraggio isola di calore
monitoraggio isola di caloremonitoraggio isola di calore
monitoraggio isola di caloreAlfonso Crisci
 
Mappatura_hotspot_termici
Mappatura_hotspot_termiciMappatura_hotspot_termici
Mappatura_hotspot_termiciAlfonso Crisci
 
Ecosemiotica del territorio
Ecosemiotica del territorioEcosemiotica del territorio
Ecosemiotica del territorioAlfonso Crisci
 
Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali
Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali
Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali Alfonso Crisci
 
La città che scotta: le prospettive e i dati per la valutazione della resilie...
La città che scotta: le prospettive e i dati per la valutazione della resilie...La città che scotta: le prospettive e i dati per la valutazione della resilie...
La città che scotta: le prospettive e i dati per la valutazione della resilie...Alfonso Crisci
 
Classificazione tipi di tempo e Alluvioni in Toscana
Classificazione tipi di tempo e Alluvioni in ToscanaClassificazione tipi di tempo e Alluvioni in Toscana
Classificazione tipi di tempo e Alluvioni in ToscanaAlfonso Crisci
 
Summer Heat Risk Index: how to integrate recent climatic changes and soil ...
Summer Heat Risk Index:    how to integrate recent climatic changes and soil ...Summer Heat Risk Index:    how to integrate recent climatic changes and soil ...
Summer Heat Risk Index: how to integrate recent climatic changes and soil ...Alfonso Crisci
 
Public crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media dataPublic crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media dataAlfonso Crisci
 
Cemento e l'eroica vendetta del letame
Cemento e l'eroica vendetta del letameCemento e l'eroica vendetta del letame
Cemento e l'eroica vendetta del letameAlfonso Crisci
 
Heat Wave risk mapping in Europe for elderly people
Heat Wave risk mapping in Europe for elderly peopleHeat Wave risk mapping in Europe for elderly people
Heat Wave risk mapping in Europe for elderly peopleAlfonso Crisci
 
IBIMET Heat WAVE resiliency
IBIMET Heat WAVE resiliency IBIMET Heat WAVE resiliency
IBIMET Heat WAVE resiliency Alfonso Crisci
 
Flyers Smart Cities and Big Data
Flyers Smart Cities and Big Data Flyers Smart Cities and Big Data
Flyers Smart Cities and Big Data Alfonso Crisci
 

Plus de Alfonso Crisci (20)

monitoraggio isola di calore
monitoraggio isola di caloremonitoraggio isola di calore
monitoraggio isola di calore
 
Terrazzi
TerrazziTerrazzi
Terrazzi
 
Mappatura_hotspot_termici
Mappatura_hotspot_termiciMappatura_hotspot_termici
Mappatura_hotspot_termici
 
Ecosemiotica del territorio
Ecosemiotica del territorioEcosemiotica del territorio
Ecosemiotica del territorio
 
Complessità nascoste
Complessità nascosteComplessità nascoste
Complessità nascoste
 
Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali
Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali
Sistemi digitali per la geolocalizzazione di piante aromatiche e medicinali
 
Mappiamo biodiversita
Mappiamo biodiversitaMappiamo biodiversita
Mappiamo biodiversita
 
Resilienza climatica
Resilienza climaticaResilienza climatica
Resilienza climatica
 
La città che scotta: le prospettive e i dati per la valutazione della resilie...
La città che scotta: le prospettive e i dati per la valutazione della resilie...La città che scotta: le prospettive e i dati per la valutazione della resilie...
La città che scotta: le prospettive e i dati per la valutazione della resilie...
 
Ibimet sommerso
Ibimet sommersoIbimet sommerso
Ibimet sommerso
 
Classificazione tipi di tempo e Alluvioni in Toscana
Classificazione tipi di tempo e Alluvioni in ToscanaClassificazione tipi di tempo e Alluvioni in Toscana
Classificazione tipi di tempo e Alluvioni in Toscana
 
Summer Heat Risk Index: how to integrate recent climatic changes and soil ...
Summer Heat Risk Index:    how to integrate recent climatic changes and soil ...Summer Heat Risk Index:    how to integrate recent climatic changes and soil ...
Summer Heat Risk Index: how to integrate recent climatic changes and soil ...
 
Public crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media dataPublic crowd-sensing of heat-waves by social media data
Public crowd-sensing of heat-waves by social media data
 
Italian weather type
Italian weather typeItalian weather type
Italian weather type
 
Not only rome burns
Not only rome burnsNot only rome burns
Not only rome burns
 
Cemento e l'eroica vendetta del letame
Cemento e l'eroica vendetta del letameCemento e l'eroica vendetta del letame
Cemento e l'eroica vendetta del letame
 
#SoilDay Roma
#SoilDay Roma #SoilDay Roma
#SoilDay Roma
 
Heat Wave risk mapping in Europe for elderly people
Heat Wave risk mapping in Europe for elderly peopleHeat Wave risk mapping in Europe for elderly people
Heat Wave risk mapping in Europe for elderly people
 
IBIMET Heat WAVE resiliency
IBIMET Heat WAVE resiliency IBIMET Heat WAVE resiliency
IBIMET Heat WAVE resiliency
 
Flyers Smart Cities and Big Data
Flyers Smart Cities and Big Data Flyers Smart Cities and Big Data
Flyers Smart Cities and Big Data
 

Dernier

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 

Dernier (20)

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 

Data Consistency in OpenStreetMap

  • 1. Monitoring data consistency in OpenStreetMap using its spatial features and tags semantics Alfonso Crisci - IBIMET CNR Maurizio Napolitano - DCL FBK Trento Francesca De Chiara - DCL FBK Trento Cristian Consonni - DCL FBK Trento George Kingsley Zipf (PD, https://commons.wikimedia.org/wiki/File:George_Kingsley_Zipf_1917.jpg)
  • 2. Backgrounds #osm3words OpenstreetMap is a language of free representation of real geographical entities to build visual patterns called maps where user communities works in participatory style.
  • 3. Aims build a local areal approach Do metrics exist to manage OSM spatial and textual informative complexity? Which are the candidates? ● to build customized guidelines for thematic mapping ● to help areal OSM fill gapping strategies ● detect spatial and informative gaps Targets
  • 4. Parameters looking up the most interesting • Fractal Dimension Is it possible to measure spatial complexity of OSM feature? • Lacunarity Is it possible to identify where OSM contributions have spatials gaps and how they change over time? • Textual informative density Is semantics of textual descriptors (keys and tags) informative? • Diversity and Dissimilarity Is it possible to detect semantic differences among different areas/communities at various spatial scales?
  • 5. What’s Lacunarity : measure of spatial pattern voids Gliding Box lacunarity ( Allain Cloitre 1991) Images & definitions Marco Diego DOMINIETTO ETH Zurich Multimodality Approach To Study The Fractal Physiology Of Tumor Angiogenesis Same image complexity different lacunarity Lacunarity It is a pattern design analytical tool and can be defined as a complementary measure of fractal dimension. It allows to distinguish spatial patterns through the analysis of their gap (pixel void) distribution at different scales. Is rotational invariant ma as function of the scale.
  • 6. Information Entropy & Zipf plot: semantic analisys of OSM ‘s wordsets Textual information density Zipf's law states that given some corpus of natural language utterances, the frequency of any word is inversely proportional to its rank in the frequency table. http://en.wikipedia.org/wiki/Zipf%27s_law x is the rank of a word in the frequency table; y is the total number of the word’s occurrences (frequency). From OSM data is possible to retrieve textual corpus ( set of words) of keys, tags, keyvalue) for every bounded area. Two action are possible: Zipf plot : Description of word set in terms of distribution of terms. Rare terms detection. Information entropy : to detect indirectly textual information density ( Shannon entropy) http://en.wikipedia.org/wiki/Entropy_%28information_theory%29
  • 7. Tools analitical framework for OSM data Osmconvert Osmfilter Nepal Civic Hacker @prabhasp http://prabhasp.github.io/OSMTimeLapseR tm & ZipfR & LanguageR & qdap &wordcloud Openstreetmap & Osmar & fractaldim Urbanisation Regime and Environmental Impact: Analysis and Modelling of Urban Patterns, Clustering and Metamorphoses GDAL lacunarity and fractal dimensions Spatial-tools library Christian Kaiser http://github.com/christiankaiser/spatial-tools raster & rgdal & spatstat R packages http://github.com/alfcrisci/osm_analitics
  • 8. Areas Zoom.level 12 Scale 1:150,000 Admin-centre centered Trento Northern Italy Florence Central Italy Matera Southern Italy OSMTimeLapseR Medium city Large Community High density of features Large urban area Large Community High density of features Small urban area Young Community Recent mapping
  • 9. OSM History Data preview Raster Density Maps A. Feature density B. Users density ( at least one edit) A. Version Count density A. Local complexity Fractal dimension isoentropic method Davies and Hall (1999) Lexical Analysis a. Zipf plot keys b. Wordcloud keys c. Histogram keys/ N_users d. Venn diagram keys/user e. Clustering users by key Temporal Evolution I. Year Feature amount II. Year Lacunarity index Tag Lexical Analysis a. Zipf plot of selected key-values b. Lexical diversity by keys c. Treemap users by key d. Treemap values by key e. Word-network of user by keys
  • 10. Aerial view Trento spatial resolution 20 m Feature density Users density
  • 11. Aerial view Trento spatial resolution 20 m Version Count density Local complexity (pixel-area where complexity is lower <2 )
  • 12. Aerial view Florence spatial resolution 100 m Feature density Users density
  • 13. Aerial view Florence spatial resolution 100 m Version Count density Local complexity (pixel-area where complexity is lower <2 )
  • 14. Aerial view Matera spatial resolution 100 m Feature density Users density
  • 15. Areal view Matera spatial resolution 100 m Version Count density Local complexity (pixel-area where complexity is lower <2 )
  • 25. Tags insight Amenity in Trento
  • 26. Information entropy Diversity MATERA FIRENZE "shop","amenity","tourism","man_made","natural","l eisure","landuse","wikipedia" "shop","amenity","tourism","man_made" Using several diversity index corpus of values for keys is possible to see the different use of tags in cities
  • 27. Tags insight Amenity in Trento
  • 31. Main Findings • Spatial complexities in OSM for a specific area could be detected and monitored in space and time by using complexity metric. •The lacunarity decay show well the OSM informativity growth but its reliability depends by the spatial scale used. In densely mapped areas small resolutions are required (20 m or 10 m). •Lacunarity thresholds for OSM quality assessment needs further investigations in relation to the zoom level involved and the keys ( tags) monitored. •Local fractal dimension indicates well where are area with a low complexity.
  • 32. Main Findings •Lexical statistical frameworks works with OSM data and describe their informativity and the differences that exist among areal communities. •Textual informativity parameters show the general terms’ abundancy in OSM and demonstrate that is a really rich informative environment . • Areal keyset, and only in certain tags, follow a natural language distribution ( Zipf’s law emerges!) and integrating Information entropy analysis for different spatial scales ( zoom level) is possible to infer on information suitability of the area done by OSM users’ community. •All kind of investigation must ever to take into account population and user density
  • 33. Conclusions •Spatial and textual “complexity” parameters seems promising tools to help the assessment of data quality in specific area. •They main role is to quantify the amount but need to be linked with other areal metrics ( population & mappers density, OSM feature density). •The need is to define proper metrics linked to these parameters presented …...to create osm services as well. •Suggestions are welcome!
  • 34. Contacts Thank you! Contacts: Alfonso Crisci mail: : a.crisci@ibimet.cnr.italfcrisci@gmail.com @alfcrisci Download presentation http://www.slideshare.net/alfcrisci/sot-m-eu2014crisci
  • 35. Appendix Fractal dimension: measure of spatial complexity state A fractal dimension is a ratio providing a statistical index of complexity comparing how detail in a pattern (strictly speaking, a fractal pattern) changes with the scale at which it is measured. http://en.wikipedia.org/wiki/Fractal_Dimension Images Marco Diego DOMINIETTO ETH Zurich Multimodality Approach To Study The Fractal Physiology Of Tumor Angiogenesis Batty, M., and Longley, P. (1994). Fractal Cities: A Geometry of Form and Function, Academic Press, San Diego, CA, at www.fractalcities.org We need much better statistics that pertain to the different kinds of dynamics and their variation over time and space. (Batty,1994)