SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
AGO (Awesemantic-Geo): Towards a More Efficient Paradigm of
Storing and Querying Spatial Data on the Semantic Web
ESIP SemTech Telecon - 2019-05-28
Blake Regalia1
2019/05/28
1STKO Lab, University of California, Santa Barbara, USA
blake.regalia@gmail.com
On the Web of Linked Data:
Geometry data beyond simple points and bounding boxes (e.g., multi/polylines,
multi/polygons, collections, etc.) should not always be encoded using RDF
Literals for several reasons including readability, performance, and reusability.
Instead, geometric representations of features should be referenced using RDF
Named Nodes (i.e., IRIs – specifically, URLs) that support content-negotiation
for the raw geometry data.
Geometric attributes and topological relations should be precomputed and
materialized as RDF in the triplestore alongside the source data.
blake.regalia@gmail.com
The W3C Basic Geo Vocabulary, circa 2003, set out to:
“[explore] the possibilities of representing mapping/location data in RDF, and does not
attempt to address many of the issues covered in the professional GIS world”
Established a simple, minimalistic vocabulary for describing points with a latitude
and longitude value using the WGS84 reference datum.
Considered good enough for the basic needs of many!
blake.regalia@gmail.com
Resource about London Heathrow Airport using W3C Basic Geo in 2003-01-10.
blake.regalia@gmail.com
Needs of geospatial data not met by W3C Basic Geo:
• Support for Coordinate Reference Systems (CRS)
• Geometries beyond Point such as polylines, polygons, etc.
• Separation of entity and geometry, which have cardinality of 1-to-many
• Merging of geometries (would result in cross product of lat/lngs)
blake.regalia@gmail.com
‘NeoGeo’ drafted a vocabulary that extended W3C Basic Geo to support more
geometry types. It encoded a geometry’s entire structure in RDF:
blake.regalia@gmail.com
1 @prefix ngeo: <http://geovocab.org/geometry#> .
2
3 _:polygon rdf:type ngeo:Polygon ;
4 ngeo:exterior [
5 rdf:type ngeo:LinearRing ;
6 ngeo:posList (
7 [ geo:lat -29; geo:long 16 ]
8 [ geo:lat -28; geo:long 33 ]
9 [ geo:lat -34; geo:long 27 ]
10 [ geo:lat -29; geo:long 16 ]
11 )
12 ] ;
13 ngeo:interior [
14 rdf:type ngeo:LinearRing ;
15 ngeo:posList (
16 [ geo:lat -29.5; geo:long 27 ]
17 [ geo:lat -28.5; geo:long 28.5 ]
18 [ geo:lat -31; geo:long 28 ]
19 [ geo:lat -29.5; geo:long 27 ]
20 )
21 ] .
blake.regalia@gmail.com
1 dbr:Perth rdfs:label "Perth"@en ;
2 geosparql:hasGeometry [
3 geosparql:asWKT "<http://www.opengis.net/def/crs/EPSG/0/4326>
POINT(115.85888671875
31.952222824097)"^^geosparql:wktLiteral
→
→
4 ] ;
GeoSPARQL addressed all the issues because it:
• separated places from their geometric representations via a blank node
• joined CRS, latitude and longitude values (in fact the entire geometry) into a
single RDF literal.
blake.regalia@gmail.com
Geometries from the Web are dirty, especially given the heterogenous nature of Linked Data.
This makes querying topology not so straightforward as GeoSPARQL intends:
blake.regalia@gmail.com
Computing topology on demand (e.g., using GeoSPARQL functions) can be very
expensive given complex geometries:
blake.regalia@gmail.com
The National Map
Figure 1 https://viewer.nationalmap.gov/basic/
blake.regalia@gmail.com
The National Map as Linked Data
Convert National Map datasets (e.g., GNIS, NHD, Transportation data, and so
on) to Linked Data.
DLG data consists of many high-resolution geometries; multi/polylines and
multi/polygons.
Starting with the NHD alone, consists of more than 100 GB of geometry data in
binary format.
Complex geometries pose a challenge to computing spatial queries on-demand
(e.g., querying topology across all feature)
blake.regalia@gmail.com
Figure 2 National Hydrology Dataset (NHD) coverage for California
blake.regalia@gmail.com
In addition to storing the raw textual data of an RDF Literal (including all
whitespaces, order of coordinates, etc.), GeoSPARQL-enabled triplestores will
also perform spatial indexing, often creating a copy of the geometries in a
binary format.
The National Hydrology Dataset (NHD) geometries for California take up 4.0 GB in
binary format. To be GeoSPARQL compatible, this dataset alone requires 10.1
GB, an approx 2.5x increase in storage requirement.
blake.regalia@gmail.com
GeoSPARQL explicitly allows geometry data to be stored in RDF Literals as either
WKT or GML (two human-readable formats).
However, beyond simple points and bounding boxes, these geometries are not
human-readable:
blake.regalia@gmail.com
Shortcomings of using RDF Literals to store geometry data:
• Geometries have interesting attributes that are static and easily
precomputed, but RDF Literals cannot store properties (i.e., they are leaf
nodes)
• RDF Literals are not linkable across the Web of Linked Data, making them
less reusable
• Client applications are able to make use of binary formats but are forced
to cope with less efficient formats required by GeoSPARQL
• Server-side triplestores that support GeoSPARQL will usually store a
redundant copy of the geometry in a binary format anyway for spatial
operations
blake.regalia@gmail.com
Raw Geometry in RDF
Storing human-readable serializations for geometry:
• Because of duplication and overhead, consumes approximately 2.5 times the amount
of storage space as binary
• Does little for transparency since long strings of coordinates are not human-readable
• As a data format, does not facilitate spatial querying; systems rely on copies of
geometry in binary formats
• Are not optimal for storage/transmission (e.g., web clients)
Furthermore, using RDF Literals:
• Obstructs the ability to include metadata about the geometry itself (cannot assign
properties to RDF Literals)
• Prevents the reuse of geometries within a dataset (e.g., cannot create composite
geometries)
• Limits the interlinking of geometries across the Web of Linked Data (Bob cannot
simply reference geometries in Alice’s dataset)
blake.regalia@gmail.com
1 osm:relation_2316598
2 rdfs:label "Western Australia"@en ;
3 geosparql:hasGeometry [
4 geosparql:asWKT
"<http://www.opengis.net/def/crs/EPSG/0/4326>
POLYGON((128.9999986 -14.4290140, 128.9999714
-14.8798443, ...))"^^geosparql:wktLiteral
→
→
→
5 ] ;
6 # instead...
7
blake.regalia@gmail.com
1 osm:relation_2316598
2 rdfs:label "Western Australia"@en ;
3 geosparql:hasGeometry [
4 geosparql:asWKT
‘<http://www.opengis.net/def/crs/EPSG/0/4326>
POLYGON((128.9999986 -14.4290140, 128.9999714
-14.8798443, ...)) geosparql:wktLiteral
→
→
→
5 ] ;
6 # instead... get rid of blank node and use URI
7 ago:hasGeometry ex:WesternAustraliaPolygon ;
blake.regalia@gmail.com
Our approach (nicknamed AGO) is to:
• Eliminate the need to store human-readable representations of geometries
beyond simple points and bounding boxes
• Require each geometry is represented using an IRI (i.e., an RDF Named
Node)
• Those IRIs should be URLs that support content-negotiation to dereference
a geometry for either (a) it’s RDF metadata or (b) the raw geometry data in a
format requested by the client and supported by the server.
blake.regalia@gmail.com
Additionally, we recommend:
• Precompute (static) geometric attributes and materialize them for each
geometry (area, perimeter, centroid, etc.)
• Leverage geometry data to aid in precomputing topological relations
between features (using OWA) and materialize them on the features
blake.regalia@gmail.com
Attributes of a geometry are easily accessible:
blake.regalia@gmail.com
What a query looks like using AGO:
blake.regalia@gmail.com
Compatibility:
• Allow geometry data to persist in a spatially-indexed geodatabase,
accessible to triplestore for on-demand spatial querying (e.g., for
computing distances, unions, intersections, etc.) and accessible to client by
virtue of dereferencing server
• Remains compatible with GeoSPARQL query functions in practice!
blake.regalia@gmail.com
blake.regalia@gmail.com
blake.regalia@gmail.com
blake.regalia@gmail.com
blake.regalia@gmail.com
Content-Negotiation for Geometry Data
Dereference the IRI of a geometry to fetch its data in a variety of formats.
curl "https://usgs.link/geometry/feature?id=42" -H "Accept:$MIME_TYPE"
MIME Type Description Returns
text/html Web interface <!DOCTYPE html><html lang="en">...
text/plain Well-Known Text POLYGON((113.1016 -38.062 ...))
application/gml+xml GML <gml:Polygon><gml:Exterior>...
application/vnd.geo+json GeoJSON {"type":"Polygon","coordinates":...}
application/octet-stream Well-Known Binary 01 06 00 00 20 E6 10 00 00 01...
Client web applications may choose a format that streamlines downloading and rendering
geometries on a map.
blake.regalia@gmail.com
In summary, a comparison of strategies to storing and using geometry:
Trait GeoSPARQL NeoGeo AGO
Efficient geometry storage 
Geometry can persist externally 1 
Content-negotiation for geometry format  
Uniform RDF structure  
Composite geometries  
Determine geometry type 2   
Access bounding box 2 
Access raw geometry 2  
1 = Geometry can persist in a local geodatabase or even on a remote system and without copies.
2 = From the triples’ RDF data alone (e.g., without using SPARQL).
blake.regalia@gmail.com
Publications
Regalia, B., Janowicz, K., and McKenzie, G. (2019) Computing and Querying
Strict, Approximate, and Metrically-Refined Topological Relations in Linked
Geographic Data. Transactions in GIS, 2019
Regalia, B., Janowicz, K., Mai, G., Varanka, D., and Usery, L. E.. (2018)
GNIS-LD: Serving and Visualizing the Geographic Names Information
System As Linked Data. ESWC 2018
Regalia, B., Janowicz, K., and McKenzie, G. (2017) Revisiting the
Representation of and Need for Raw Geometries on the Linked Data Web.
LDOW@ WWW 2017
Regalia, B., Janowicz, K. and Gao, S. (2016) VOLT: A Provenance Producing,
Transparent SPARQL Proxy for the On-Demand Computation of Linked Data
and its Applications to Spatiotemporally Dependent Data. ESWC 2016
blake.regalia@gmail.com

Contenu connexe

Tendances

Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsNikolaos Konstantinou
 
LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove
 
GRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereGRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereChimezie Ogbuji
 
Exchange and Consumption of Huge RDF Data
Exchange and Consumption of Huge RDF DataExchange and Consumption of Huge RDF Data
Exchange and Consumption of Huge RDF DataMario Arias
 
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...Victor de Boer
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016Sergio Fernández
 
Linked Data Usecases
Linked Data UsecasesLinked Data Usecases
Linked Data UsecasesMyungjin Lee
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of DataRinke Hoekstra
 
morph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationmorph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationNandana Mihindukulasooriya
 
Ukgovld registry-intro
Ukgovld registry-introUkgovld registry-intro
Ukgovld registry-introDave Reynolds
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Sergio Fernández
 
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...Paula Díaz
 
Efficient Record De-Duplication Identifying Using Febrl Framework
Efficient Record De-Duplication Identifying Using Febrl FrameworkEfficient Record De-Duplication Identifying Using Febrl Framework
Efficient Record De-Duplication Identifying Using Febrl FrameworkIOSR Journals
 
The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)Myungjin Lee
 

Tendances (20)

Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF Graphs
 
LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
GRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereGRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and Where
 
Exchange and Consumption of Huge RDF Data
Exchange and Consumption of Huge RDF DataExchange and Consumption of Huge RDF Data
Exchange and Consumption of Huge RDF Data
 
Compact Representation of Large RDF Data Sets for Publishing and Exchange
Compact Representation of Large RDF Data Sets for Publishing and ExchangeCompact Representation of Large RDF Data Sets for Publishing and Exchange
Compact Representation of Large RDF Data Sets for Publishing and Exchange
 
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
Linked Data Usecases
Linked Data UsecasesLinked Data Usecases
Linked Data Usecases
 
Binary RDF for Scalable Publishing, Exchanging and Consumption in the Web of ...
Binary RDF for Scalable Publishing, Exchanging and Consumption in the Web of ...Binary RDF for Scalable Publishing, Exchanging and Consumption in the Web of ...
Binary RDF for Scalable Publishing, Exchanging and Consumption in the Web of ...
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
Statistical Linked Data
Statistical Linked DataStatistical Linked Data
Statistical Linked Data
 
morph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationmorph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementation
 
Ukgovld registry-intro
Ukgovld registry-introUkgovld registry-intro
Ukgovld registry-intro
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
 
Efficient Record De-Duplication Identifying Using Febrl Framework
Efficient Record De-Duplication Identifying Using Febrl FrameworkEfficient Record De-Duplication Identifying Using Febrl Framework
Efficient Record De-Duplication Identifying Using Febrl Framework
 
Unit3 slides
Unit3 slidesUnit3 slides
Unit3 slides
 
The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)
 
p27
p27p27
p27
 

Similaire à Towards a More Efficient Paradigm of Storing and Querying Spatial Data on the Semantic Web

Revisiting the Representation of and Need for Raw Geometries on the Linked Da...
Revisiting the Representation of and Need for Raw Geometries on the Linked Da...Revisiting the Representation of and Need for Raw Geometries on the Linked Da...
Revisiting the Representation of and Need for Raw Geometries on the Linked Da...Blake Regalia
 
Going for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataGoing for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataEDINA, University of Edinburgh
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2Dimitris Kontokostas
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseBringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseJimmy Angelakos
 
Comparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryComparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryGhislain Atemezing
 
Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataRoberto García
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRathachai Chawuthai
 
Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresKostis Kyzirakos
 
GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...
GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...
GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...Blake Regalia
 
NAPSG 2010 Fire/EMS Conference - Data Sharing Basics
NAPSG 2010 Fire/EMS Conference - Data Sharing BasicsNAPSG 2010 Fire/EMS Conference - Data Sharing Basics
NAPSG 2010 Fire/EMS Conference - Data Sharing Basicspdituri
 
JSON-LD update DC 2017
JSON-LD update DC 2017JSON-LD update DC 2017
JSON-LD update DC 2017Gregg Kellogg
 
LinkedGeoData and GeoKnow
LinkedGeoData and GeoKnowLinkedGeoData and GeoKnow
LinkedGeoData and GeoKnowgeoknow
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinthsDaniel Camarda
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFDimitris Kontokostas
 
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsGenerating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsChristophe Debruyne
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Advances in gml for geospatial applications slide
Advances in gml for geospatial applications slideAdvances in gml for geospatial applications slide
Advances in gml for geospatial applications slideVittoriano Muttillo
 
Semantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sourcesSemantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sourcesDeniz Kılınç
 
Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013
Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013
Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013Kostis Kyzirakos
 

Similaire à Towards a More Efficient Paradigm of Storing and Querying Spatial Data on the Semantic Web (20)

Revisiting the Representation of and Need for Raw Geometries on the Linked Da...
Revisiting the Representation of and Need for Raw Geometries on the Linked Da...Revisiting the Representation of and Need for Raw Geometries on the Linked Da...
Revisiting the Representation of and Need for Raw Geometries on the Linked Da...
 
Going for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataGoing for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial Metadata
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseBringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
 
Comparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryComparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their Geometry
 
Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial data
 
JSON-LD Update
JSON-LD UpdateJSON-LD Update
JSON-LD Update
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
 
Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF Stores
 
GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...
GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...
GNIS-LD: Serving and Visualizing the Geographic Names Information System Gaze...
 
NAPSG 2010 Fire/EMS Conference - Data Sharing Basics
NAPSG 2010 Fire/EMS Conference - Data Sharing BasicsNAPSG 2010 Fire/EMS Conference - Data Sharing Basics
NAPSG 2010 Fire/EMS Conference - Data Sharing Basics
 
JSON-LD update DC 2017
JSON-LD update DC 2017JSON-LD update DC 2017
JSON-LD update DC 2017
 
LinkedGeoData and GeoKnow
LinkedGeoData and GeoKnowLinkedGeoData and GeoKnow
LinkedGeoData and GeoKnow
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDF
 
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsGenerating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Advances in gml for geospatial applications slide
Advances in gml for geospatial applications slideAdvances in gml for geospatial applications slide
Advances in gml for geospatial applications slide
 
Semantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sourcesSemantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sources
 
Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013
Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013
Geographica: A Benchmark for Geospatial RDF Stores - ISWC 2013
 

Dernier

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 

Dernier (20)

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 

Towards a More Efficient Paradigm of Storing and Querying Spatial Data on the Semantic Web

  • 1. AGO (Awesemantic-Geo): Towards a More Efficient Paradigm of Storing and Querying Spatial Data on the Semantic Web ESIP SemTech Telecon - 2019-05-28 Blake Regalia1 2019/05/28 1STKO Lab, University of California, Santa Barbara, USA blake.regalia@gmail.com
  • 2. On the Web of Linked Data: Geometry data beyond simple points and bounding boxes (e.g., multi/polylines, multi/polygons, collections, etc.) should not always be encoded using RDF Literals for several reasons including readability, performance, and reusability. Instead, geometric representations of features should be referenced using RDF Named Nodes (i.e., IRIs – specifically, URLs) that support content-negotiation for the raw geometry data. Geometric attributes and topological relations should be precomputed and materialized as RDF in the triplestore alongside the source data. blake.regalia@gmail.com
  • 3. The W3C Basic Geo Vocabulary, circa 2003, set out to: “[explore] the possibilities of representing mapping/location data in RDF, and does not attempt to address many of the issues covered in the professional GIS world” Established a simple, minimalistic vocabulary for describing points with a latitude and longitude value using the WGS84 reference datum. Considered good enough for the basic needs of many! blake.regalia@gmail.com
  • 4. Resource about London Heathrow Airport using W3C Basic Geo in 2003-01-10. blake.regalia@gmail.com
  • 5. Needs of geospatial data not met by W3C Basic Geo: • Support for Coordinate Reference Systems (CRS) • Geometries beyond Point such as polylines, polygons, etc. • Separation of entity and geometry, which have cardinality of 1-to-many • Merging of geometries (would result in cross product of lat/lngs) blake.regalia@gmail.com
  • 6. ‘NeoGeo’ drafted a vocabulary that extended W3C Basic Geo to support more geometry types. It encoded a geometry’s entire structure in RDF: blake.regalia@gmail.com
  • 7. 1 @prefix ngeo: <http://geovocab.org/geometry#> . 2 3 _:polygon rdf:type ngeo:Polygon ; 4 ngeo:exterior [ 5 rdf:type ngeo:LinearRing ; 6 ngeo:posList ( 7 [ geo:lat -29; geo:long 16 ] 8 [ geo:lat -28; geo:long 33 ] 9 [ geo:lat -34; geo:long 27 ] 10 [ geo:lat -29; geo:long 16 ] 11 ) 12 ] ; 13 ngeo:interior [ 14 rdf:type ngeo:LinearRing ; 15 ngeo:posList ( 16 [ geo:lat -29.5; geo:long 27 ] 17 [ geo:lat -28.5; geo:long 28.5 ] 18 [ geo:lat -31; geo:long 28 ] 19 [ geo:lat -29.5; geo:long 27 ] 20 ) 21 ] . blake.regalia@gmail.com
  • 8. 1 dbr:Perth rdfs:label "Perth"@en ; 2 geosparql:hasGeometry [ 3 geosparql:asWKT "<http://www.opengis.net/def/crs/EPSG/0/4326> POINT(115.85888671875 31.952222824097)"^^geosparql:wktLiteral → → 4 ] ; GeoSPARQL addressed all the issues because it: • separated places from their geometric representations via a blank node • joined CRS, latitude and longitude values (in fact the entire geometry) into a single RDF literal. blake.regalia@gmail.com
  • 9. Geometries from the Web are dirty, especially given the heterogenous nature of Linked Data. This makes querying topology not so straightforward as GeoSPARQL intends: blake.regalia@gmail.com
  • 10. Computing topology on demand (e.g., using GeoSPARQL functions) can be very expensive given complex geometries: blake.regalia@gmail.com
  • 11. The National Map Figure 1 https://viewer.nationalmap.gov/basic/ blake.regalia@gmail.com
  • 12. The National Map as Linked Data Convert National Map datasets (e.g., GNIS, NHD, Transportation data, and so on) to Linked Data. DLG data consists of many high-resolution geometries; multi/polylines and multi/polygons. Starting with the NHD alone, consists of more than 100 GB of geometry data in binary format. Complex geometries pose a challenge to computing spatial queries on-demand (e.g., querying topology across all feature) blake.regalia@gmail.com
  • 13. Figure 2 National Hydrology Dataset (NHD) coverage for California blake.regalia@gmail.com
  • 14. In addition to storing the raw textual data of an RDF Literal (including all whitespaces, order of coordinates, etc.), GeoSPARQL-enabled triplestores will also perform spatial indexing, often creating a copy of the geometries in a binary format. The National Hydrology Dataset (NHD) geometries for California take up 4.0 GB in binary format. To be GeoSPARQL compatible, this dataset alone requires 10.1 GB, an approx 2.5x increase in storage requirement. blake.regalia@gmail.com
  • 15. GeoSPARQL explicitly allows geometry data to be stored in RDF Literals as either WKT or GML (two human-readable formats). However, beyond simple points and bounding boxes, these geometries are not human-readable: blake.regalia@gmail.com
  • 16. Shortcomings of using RDF Literals to store geometry data: • Geometries have interesting attributes that are static and easily precomputed, but RDF Literals cannot store properties (i.e., they are leaf nodes) • RDF Literals are not linkable across the Web of Linked Data, making them less reusable • Client applications are able to make use of binary formats but are forced to cope with less efficient formats required by GeoSPARQL • Server-side triplestores that support GeoSPARQL will usually store a redundant copy of the geometry in a binary format anyway for spatial operations blake.regalia@gmail.com
  • 17. Raw Geometry in RDF Storing human-readable serializations for geometry: • Because of duplication and overhead, consumes approximately 2.5 times the amount of storage space as binary • Does little for transparency since long strings of coordinates are not human-readable • As a data format, does not facilitate spatial querying; systems rely on copies of geometry in binary formats • Are not optimal for storage/transmission (e.g., web clients) Furthermore, using RDF Literals: • Obstructs the ability to include metadata about the geometry itself (cannot assign properties to RDF Literals) • Prevents the reuse of geometries within a dataset (e.g., cannot create composite geometries) • Limits the interlinking of geometries across the Web of Linked Data (Bob cannot simply reference geometries in Alice’s dataset) blake.regalia@gmail.com
  • 18. 1 osm:relation_2316598 2 rdfs:label "Western Australia"@en ; 3 geosparql:hasGeometry [ 4 geosparql:asWKT "<http://www.opengis.net/def/crs/EPSG/0/4326> POLYGON((128.9999986 -14.4290140, 128.9999714 -14.8798443, ...))"^^geosparql:wktLiteral → → → 5 ] ; 6 # instead... 7 blake.regalia@gmail.com
  • 19. 1 osm:relation_2316598 2 rdfs:label "Western Australia"@en ; 3 geosparql:hasGeometry [ 4 geosparql:asWKT ‘<http://www.opengis.net/def/crs/EPSG/0/4326> POLYGON((128.9999986 -14.4290140, 128.9999714 -14.8798443, ...)) geosparql:wktLiteral → → → 5 ] ; 6 # instead... get rid of blank node and use URI 7 ago:hasGeometry ex:WesternAustraliaPolygon ; blake.regalia@gmail.com
  • 20. Our approach (nicknamed AGO) is to: • Eliminate the need to store human-readable representations of geometries beyond simple points and bounding boxes • Require each geometry is represented using an IRI (i.e., an RDF Named Node) • Those IRIs should be URLs that support content-negotiation to dereference a geometry for either (a) it’s RDF metadata or (b) the raw geometry data in a format requested by the client and supported by the server. blake.regalia@gmail.com
  • 21. Additionally, we recommend: • Precompute (static) geometric attributes and materialize them for each geometry (area, perimeter, centroid, etc.) • Leverage geometry data to aid in precomputing topological relations between features (using OWA) and materialize them on the features blake.regalia@gmail.com
  • 22. Attributes of a geometry are easily accessible: blake.regalia@gmail.com
  • 23. What a query looks like using AGO: blake.regalia@gmail.com
  • 24. Compatibility: • Allow geometry data to persist in a spatially-indexed geodatabase, accessible to triplestore for on-demand spatial querying (e.g., for computing distances, unions, intersections, etc.) and accessible to client by virtue of dereferencing server • Remains compatible with GeoSPARQL query functions in practice! blake.regalia@gmail.com
  • 29. Content-Negotiation for Geometry Data Dereference the IRI of a geometry to fetch its data in a variety of formats. curl "https://usgs.link/geometry/feature?id=42" -H "Accept:$MIME_TYPE" MIME Type Description Returns text/html Web interface <!DOCTYPE html><html lang="en">... text/plain Well-Known Text POLYGON((113.1016 -38.062 ...)) application/gml+xml GML <gml:Polygon><gml:Exterior>... application/vnd.geo+json GeoJSON {"type":"Polygon","coordinates":...} application/octet-stream Well-Known Binary 01 06 00 00 20 E6 10 00 00 01... Client web applications may choose a format that streamlines downloading and rendering geometries on a map. blake.regalia@gmail.com
  • 30. In summary, a comparison of strategies to storing and using geometry: Trait GeoSPARQL NeoGeo AGO Efficient geometry storage Geometry can persist externally 1 Content-negotiation for geometry format Uniform RDF structure Composite geometries Determine geometry type 2 Access bounding box 2 Access raw geometry 2 1 = Geometry can persist in a local geodatabase or even on a remote system and without copies. 2 = From the triples’ RDF data alone (e.g., without using SPARQL). blake.regalia@gmail.com
  • 31. Publications Regalia, B., Janowicz, K., and McKenzie, G. (2019) Computing and Querying Strict, Approximate, and Metrically-Refined Topological Relations in Linked Geographic Data. Transactions in GIS, 2019 Regalia, B., Janowicz, K., Mai, G., Varanka, D., and Usery, L. E.. (2018) GNIS-LD: Serving and Visualizing the Geographic Names Information System As Linked Data. ESWC 2018 Regalia, B., Janowicz, K., and McKenzie, G. (2017) Revisiting the Representation of and Need for Raw Geometries on the Linked Data Web. LDOW@ WWW 2017 Regalia, B., Janowicz, K. and Gao, S. (2016) VOLT: A Provenance Producing, Transparent SPARQL Proxy for the On-Demand Computation of Linked Data and its Applications to Spatiotemporally Dependent Data. ESWC 2016 blake.regalia@gmail.com