SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Computing and Querying Topological Relations in Linked
Geographic Data
Using Strict, Approximate, and Metrically-Refined Topology
Blake Regalia1, Krzysztof Janowicz1, Grant McKenzie2
2019/07/09
1STKO Lab, University of California, Santa Barbara, USA
2Platial Analysis Lab, McGill University, Montreal, CA
blake.regalia@gmail.com
Figure 1 lod-cloud.net
The Web of Linked Open Data , aka the LOD cloud, is an open, interlinked
collection of cross-domain knowledge bases from governments, industries,
academic institutions and non-profit organizations.
blake.regalia@gmail.com
Figure 2 LOD-a-lot: lod-a-lot.lod.labs.vu.nl/
When cleaned, the cloud contains more than 28.3 billion unique
machine-readable statements derived from manual input, annotated datasets,
extracted texts, sensor observations, conceptual abstractions, and so on.
blake.regalia@gmail.com
Representations of concrete objects from the world such as persons and places
constitute a majority of central nodes within the multigraph, linking statements
and relations across datasets.
Figure 3 lod-cloud.net
DBpedia and GeoNames are the most central dataset hubs for geographic
data in the cloud. DBpedia alone:
“[...] currently describes 4.58 million things, [...] including 1,445,000
persons, 735,000 places (including 478,000 populated places)”
https://wiki.dbpedia.org/about/facts-figures
blake.regalia@gmail.com
Figure 4 DBpedia excerpt from phuzzy.link
blake.regalia@gmail.com
Figure 5 San Diego on OpenStreetMap
blake.regalia@gmail.com
Motivation
The majority of geographic identifiers on LOD cloud are represented geometrically
as point coordinates , severely limiting their potential for spatial analysis.
However, simply integrating more complex geometries into the LOD cloud will
be of limited use to the Geospatial Linked Data community because:
• Graph queries involving spatial operations on high-resolution geometries do
not scale well over large datasets.
• Geometries are merely a means to an end for spatial analysis, the proper
geometric representation of real-world entities varies by place type, scale,
and task.
• Spatial extensions to RDF triplestores, such as GeoSPARQL, compute
topology on-demand and without context , meaning that topology is
computed from the geometries alone; place types are ignored.
• Computing topology requires pre-processing steps to clean geometries
for errors such as sliver polygons, something not currently supported by
Linked Data based frameworks.
blake.regalia@gmail.com
Topology Matters, Metric Refines:
“In geographic space, topology is considered to be first-class information,
whereas metric properties, such as distances and shapes, are used as
refinements that are frequently less exactly captured.”
Egenhofer and Mark 1995b
blake.regalia@gmail.com
Problem Statement:
Following the slogan that “Topology Matters, Metric Refines”, knowledge graphs
will benefit from explicit topological relations in addition to complex
geometries and other place-specific properties.
Computing topology based on geometry alone is not sufficient in the context of
Linked Data for two reasons we focus on here:
1. Heterogeneous data sources and crowd-sourced datasets propagate
digitization errors along complex geometries, meaning that open data clouds
will always require pre-processing steps in order to compute topology.
2. Topology also depends on domain knowledge, vagueness and uncertainty
principles (Bennet, 2001).
blake.regalia@gmail.com
Research Objective:
Enrich the geographic LOD cloud with topological relations rather than
purely complex geometries.
Contribution 1:
Align OpenStreetMap features with DBpedia places to combine complex
geometries (polylines and polygons) with rich place type information .
blake.regalia@gmail.com
Knowledge Base Alignment:
Starting with all DBpedia places in the contiguous United States; select those
with the following place types and align them with their matching feature in
OpenStreetMap:
Polylines:
• Roadways
• Streams
Polygons:
• Cities
• Counties
• Parks
36,520 features in total .
blake.regalia@gmail.com
blake.regalia@gmail.com
Contribution 2:
Precompute and materialize strict, approximate, and metrically refined
topological relations between cleaned geometries using place type knowledge.
blake.regalia@gmail.com
Strict Toplogy:
Figure 6 Region Connection Calculus (RCC8)
blake.regalia@gmail.com
Approximate Toplogy:
Figure 7 Broad boundaries (Clementini et al.)
blake.regalia@gmail.com
Metrically Refined Toplogy:
Figure 8 Nine Splitting Measures (Egenhofer et al.)
blake.regalia@gmail.com
blake.regalia@gmail.com
The resulting topological relations dataset produces 120,681 distinct RDF
statements covering various topological relations between features of the
selected place types within the contiguous United States.
Putting these topological relations to use, we attempt to validate any existing
relations between places on DBpedia.
blake.regalia@gmail.com
Distribution of all place-place relation predicates:
1. 50% Adjacency: primary and inter-primary cardinal direction relations, as
well as “adjacent communities” linksets.
2. 48% Partonomy: dbo:isPartOf, dbo:largestCity, dbo:countySeat,
dbo:location predicates.
blake.regalia@gmail.com
1. Unverifiable: Relation is absent from our dataset
2. Verifiable: Relation exists in both datasets, but the topological relation we
observe does not align with the expected predicate.
3. Accurate: Relation exists in both datasets and the topological relation aligns
with the expected predicate.
4. Supplemental: Relation is absent from DBpedia; demonstrates volume of
our contribution to enriching the knowledge base.
blake.regalia@gmail.com
How does this compare to GeoSPARQL?
blake.regalia@gmail.com
Figure 9 Selecting bordering counties when topology must be computed on-demand creates a
combinatorial explosion. Our precomputed approach is optimized for graph queries since the
topological relations are already materialized in the triplestore.
blake.regalia@gmail.com
Figure 10 GeoSPARQL fails to capture any relations due to tiny sliver polygons. Our approach
captures all 8 externally connected relations and qualifies one as barely touches.
blake.regalia@gmail.com
How can topology be used on geospatial linked data in practice?
blake.regalia@gmail.com
Figure 11 SPARQL query asking: What persons of historical significance were born in a city along the
Mississippi River and then died in another city also along the river?
blake.regalia@gmail.com
blake.regalia@gmail.com
Topological Reasoning:
“Coastal cities” are a class of cities that have an adjacency relation to the ocean.
“Landlocked regions” are administrative regions that lack direct access to some
resource due to being entirely surrounded by other administrative regions.
“Accessibility” reasoning:
Parks that can be accessed via counties that are non-tangential proper parts to
their state boundary.
blake.regalia@gmail.com
Thank you
blake.regalia@gmail.com
blake.regalia@gmail.com
blake.regalia@gmail.com
blake.regalia@gmail.com

Contenu connexe

Tendances

NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Distributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache SparkDistributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache SparkAnastasios Theodosiou
 
Learning Social Networks From Web Documents Using Support
Learning Social Networks From Web Documents Using SupportLearning Social Networks From Web Documents Using Support
Learning Social Networks From Web Documents Using Supportceya
 
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSAPPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSIJwest
 
A LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKS
A LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKSA LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKS
A LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKScsandit
 
Node XL - features and demo
Node XL - features and demoNode XL - features and demo
Node XL - features and demoMayank Mohan
 
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...IOSR Journals
 
2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-BoxMarc Smith
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Cluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector MachineCluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector MachineCSCJournals
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisFred Stutzman
 
Relation of Coffee Break and Productivity
Relation of Coffee Break and ProductivityRelation of Coffee Break and Productivity
Relation of Coffee Break and ProductivityJavaCoffeeIQ.com
 
A comparative study of social network analysis tools
A comparative study of social network analysis toolsA comparative study of social network analysis tools
A comparative study of social network analysis toolsDavid Combe
 
Community analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networksCommunity analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networksMarco Brambilla
 
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksThe Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksEditor IJCATR
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachAndry Alamsyah
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network AnalysisMarc Smith
 

Tendances (20)

NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Distributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache SparkDistributed Link Prediction in Large Scale Graphs using Apache Spark
Distributed Link Prediction in Large Scale Graphs using Apache Spark
 
Learning Social Networks From Web Documents Using Support
Learning Social Networks From Web Documents Using SupportLearning Social Networks From Web Documents Using Support
Learning Social Networks From Web Documents Using Support
 
Ijetcas14 347
Ijetcas14 347Ijetcas14 347
Ijetcas14 347
 
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSAPPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
 
A LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKS
A LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKSA LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKS
A LINK-BASED APPROACH TO ENTITY RESOLUTION IN SOCIAL NETWORKS
 
Node XL - features and demo
Node XL - features and demoNode XL - features and demo
Node XL - features and demo
 
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
An Efficient Modified Common Neighbor Approach for Link Prediction in Social ...
 
2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box
 
Complex networks - Assortativity
Complex networks -  AssortativityComplex networks -  Assortativity
Complex networks - Assortativity
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Cluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector MachineCluster Based Web Search Using Support Vector Machine
Cluster Based Web Search Using Support Vector Machine
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Ppt
PptPpt
Ppt
 
Relation of Coffee Break and Productivity
Relation of Coffee Break and ProductivityRelation of Coffee Break and Productivity
Relation of Coffee Break and Productivity
 
A comparative study of social network analysis tools
A comparative study of social network analysis toolsA comparative study of social network analysis tools
A comparative study of social network analysis tools
 
Community analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networksCommunity analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networks
 
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksThe Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network Approach
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis
 

Similaire à Topological Relations in Linked Geographic Data

A comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectionA comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectioncsandit
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Spatial association discovery process using frequent subgraph mining
Spatial association discovery process using frequent subgraph miningSpatial association discovery process using frequent subgraph mining
Spatial association discovery process using frequent subgraph miningTELKOMNIKA JOURNAL
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Mariana Damova, Ph.D
 
Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...
Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...
Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...Blake Regalia
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...Armin Haller
 
Proposal for nested document support in Lucene
Proposal for nested document support in LuceneProposal for nested document support in Lucene
Proposal for nested document support in LuceneMark Harwood
 
11.0004www.iiste.org call for paper.on demand quality of web services using r...
11.0004www.iiste.org call for paper.on demand quality of web services using r...11.0004www.iiste.org call for paper.on demand quality of web services using r...
11.0004www.iiste.org call for paper.on demand quality of web services using r...Alexander Decker
 
4.on demand quality of web services using ranking by multi criteria 31-35
4.on demand quality of web services using ranking by multi criteria 31-354.on demand quality of web services using ranking by multi criteria 31-35
4.on demand quality of web services using ranking by multi criteria 31-35Alexander Decker
 
Deep Learning for Graphs
Deep Learning for GraphsDeep Learning for Graphs
Deep Learning for GraphsDeepLearningBlr
 
Database Management System(UNIT 1)
Database Management System(UNIT 1)Database Management System(UNIT 1)
Database Management System(UNIT 1)SURBHI SAROHA
 
Database Management System(UNIT 1)
Database Management System(UNIT 1)Database Management System(UNIT 1)
Database Management System(UNIT 1)SURBHI SAROHA
 

Similaire à Topological Relations in Linked Geographic Data (20)

A comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detectionA comprehensive survey of link mining and anomalies detection
A comprehensive survey of link mining and anomalies detection
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Spatial association discovery process using frequent subgraph mining
Spatial association discovery process using frequent subgraph miningSpatial association discovery process using frequent subgraph mining
Spatial association discovery process using frequent subgraph mining
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011
 
Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...
Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...
Moon Landing or Safari? A Study of Systematic Errors and their Causes in Geog...
 
Poster Final
Poster FinalPoster Final
Poster Final
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
Proposal for nested document support in Lucene
Proposal for nested document support in LuceneProposal for nested document support in Lucene
Proposal for nested document support in Lucene
 
11.0004www.iiste.org call for paper.on demand quality of web services using r...
11.0004www.iiste.org call for paper.on demand quality of web services using r...11.0004www.iiste.org call for paper.on demand quality of web services using r...
11.0004www.iiste.org call for paper.on demand quality of web services using r...
 
4.on demand quality of web services using ranking by multi criteria 31-35
4.on demand quality of web services using ranking by multi criteria 31-354.on demand quality of web services using ranking by multi criteria 31-35
4.on demand quality of web services using ranking by multi criteria 31-35
 
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdfMODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
 
Ngdm09 han gao
Ngdm09 han gaoNgdm09 han gao
Ngdm09 han gao
 
Spatial Data Model 2
Spatial Data Model 2Spatial Data Model 2
Spatial Data Model 2
 
Ijariie1184
Ijariie1184Ijariie1184
Ijariie1184
 
Ijariie1184
Ijariie1184Ijariie1184
Ijariie1184
 
Mujungi Davis
Mujungi DavisMujungi Davis
Mujungi Davis
 
Mujungi Davis
Mujungi DavisMujungi Davis
Mujungi Davis
 
Deep Learning for Graphs
Deep Learning for GraphsDeep Learning for Graphs
Deep Learning for Graphs
 
Database Management System(UNIT 1)
Database Management System(UNIT 1)Database Management System(UNIT 1)
Database Management System(UNIT 1)
 
Database Management System(UNIT 1)
Database Management System(UNIT 1)Database Management System(UNIT 1)
Database Management System(UNIT 1)
 

Dernier

CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫qfactory1
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squaresusmanzain586
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Biological classification of plants with detail
Biological classification of plants with detailBiological classification of plants with detail
Biological classification of plants with detailhaiderbaloch3
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 

Dernier (20)

CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squares
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Biological classification of plants with detail
Biological classification of plants with detailBiological classification of plants with detail
Biological classification of plants with detail
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 

Topological Relations in Linked Geographic Data

  • 1. Computing and Querying Topological Relations in Linked Geographic Data Using Strict, Approximate, and Metrically-Refined Topology Blake Regalia1, Krzysztof Janowicz1, Grant McKenzie2 2019/07/09 1STKO Lab, University of California, Santa Barbara, USA 2Platial Analysis Lab, McGill University, Montreal, CA blake.regalia@gmail.com
  • 2. Figure 1 lod-cloud.net The Web of Linked Open Data , aka the LOD cloud, is an open, interlinked collection of cross-domain knowledge bases from governments, industries, academic institutions and non-profit organizations. blake.regalia@gmail.com
  • 3. Figure 2 LOD-a-lot: lod-a-lot.lod.labs.vu.nl/ When cleaned, the cloud contains more than 28.3 billion unique machine-readable statements derived from manual input, annotated datasets, extracted texts, sensor observations, conceptual abstractions, and so on. blake.regalia@gmail.com
  • 4. Representations of concrete objects from the world such as persons and places constitute a majority of central nodes within the multigraph, linking statements and relations across datasets. Figure 3 lod-cloud.net DBpedia and GeoNames are the most central dataset hubs for geographic data in the cloud. DBpedia alone: “[...] currently describes 4.58 million things, [...] including 1,445,000 persons, 735,000 places (including 478,000 populated places)” https://wiki.dbpedia.org/about/facts-figures blake.regalia@gmail.com
  • 5. Figure 4 DBpedia excerpt from phuzzy.link blake.regalia@gmail.com
  • 6. Figure 5 San Diego on OpenStreetMap blake.regalia@gmail.com
  • 7. Motivation The majority of geographic identifiers on LOD cloud are represented geometrically as point coordinates , severely limiting their potential for spatial analysis. However, simply integrating more complex geometries into the LOD cloud will be of limited use to the Geospatial Linked Data community because: • Graph queries involving spatial operations on high-resolution geometries do not scale well over large datasets. • Geometries are merely a means to an end for spatial analysis, the proper geometric representation of real-world entities varies by place type, scale, and task. • Spatial extensions to RDF triplestores, such as GeoSPARQL, compute topology on-demand and without context , meaning that topology is computed from the geometries alone; place types are ignored. • Computing topology requires pre-processing steps to clean geometries for errors such as sliver polygons, something not currently supported by Linked Data based frameworks. blake.regalia@gmail.com
  • 8. Topology Matters, Metric Refines: “In geographic space, topology is considered to be first-class information, whereas metric properties, such as distances and shapes, are used as refinements that are frequently less exactly captured.” Egenhofer and Mark 1995b blake.regalia@gmail.com
  • 9. Problem Statement: Following the slogan that “Topology Matters, Metric Refines”, knowledge graphs will benefit from explicit topological relations in addition to complex geometries and other place-specific properties. Computing topology based on geometry alone is not sufficient in the context of Linked Data for two reasons we focus on here: 1. Heterogeneous data sources and crowd-sourced datasets propagate digitization errors along complex geometries, meaning that open data clouds will always require pre-processing steps in order to compute topology. 2. Topology also depends on domain knowledge, vagueness and uncertainty principles (Bennet, 2001). blake.regalia@gmail.com
  • 10. Research Objective: Enrich the geographic LOD cloud with topological relations rather than purely complex geometries. Contribution 1: Align OpenStreetMap features with DBpedia places to combine complex geometries (polylines and polygons) with rich place type information . blake.regalia@gmail.com
  • 11. Knowledge Base Alignment: Starting with all DBpedia places in the contiguous United States; select those with the following place types and align them with their matching feature in OpenStreetMap: Polylines: • Roadways • Streams Polygons: • Cities • Counties • Parks 36,520 features in total . blake.regalia@gmail.com
  • 13. Contribution 2: Precompute and materialize strict, approximate, and metrically refined topological relations between cleaned geometries using place type knowledge. blake.regalia@gmail.com
  • 14. Strict Toplogy: Figure 6 Region Connection Calculus (RCC8) blake.regalia@gmail.com
  • 15. Approximate Toplogy: Figure 7 Broad boundaries (Clementini et al.) blake.regalia@gmail.com
  • 16. Metrically Refined Toplogy: Figure 8 Nine Splitting Measures (Egenhofer et al.) blake.regalia@gmail.com
  • 18. The resulting topological relations dataset produces 120,681 distinct RDF statements covering various topological relations between features of the selected place types within the contiguous United States. Putting these topological relations to use, we attempt to validate any existing relations between places on DBpedia. blake.regalia@gmail.com
  • 19. Distribution of all place-place relation predicates: 1. 50% Adjacency: primary and inter-primary cardinal direction relations, as well as “adjacent communities” linksets. 2. 48% Partonomy: dbo:isPartOf, dbo:largestCity, dbo:countySeat, dbo:location predicates. blake.regalia@gmail.com
  • 20. 1. Unverifiable: Relation is absent from our dataset 2. Verifiable: Relation exists in both datasets, but the topological relation we observe does not align with the expected predicate. 3. Accurate: Relation exists in both datasets and the topological relation aligns with the expected predicate. 4. Supplemental: Relation is absent from DBpedia; demonstrates volume of our contribution to enriching the knowledge base. blake.regalia@gmail.com
  • 21. How does this compare to GeoSPARQL? blake.regalia@gmail.com
  • 22. Figure 9 Selecting bordering counties when topology must be computed on-demand creates a combinatorial explosion. Our precomputed approach is optimized for graph queries since the topological relations are already materialized in the triplestore. blake.regalia@gmail.com
  • 23. Figure 10 GeoSPARQL fails to capture any relations due to tiny sliver polygons. Our approach captures all 8 externally connected relations and qualifies one as barely touches. blake.regalia@gmail.com
  • 24. How can topology be used on geospatial linked data in practice? blake.regalia@gmail.com
  • 25. Figure 11 SPARQL query asking: What persons of historical significance were born in a city along the Mississippi River and then died in another city also along the river? blake.regalia@gmail.com
  • 27. Topological Reasoning: “Coastal cities” are a class of cities that have an adjacency relation to the ocean. “Landlocked regions” are administrative regions that lack direct access to some resource due to being entirely surrounded by other administrative regions. “Accessibility” reasoning: Parks that can be accessed via counties that are non-tangential proper parts to their state boundary. blake.regalia@gmail.com