Open access and open data: international trends and strategic context
Boyan brodaric environmentalinfrastructure
1. GIN: A Cyberinfrastructure and GeoPortal
for Canadian Groundwater Data
Boyan Brodaric
Geological Survey of Canada
Natural Resources Canada
B. Brodaric—GIN
Cyberra Summit 2010 1
Banff, 22 Sept. 2010
2. Themes
Data Cyberinfrastructure (CI)
web-based resources for data interoperability
Spatial Data (cyber)Infrastructure (SDI)
open standards for geographically located features and observations
Groundwater Information Network (GIN)
Canadian network for groundwater data
B. Brodaric—GIN
Cyberra Summit 2010 2
Banff, 22 Sept. 2010
3. GW data in Canada
Distributed, Uncoordinated data
Feds (< 10), provs & terrs (<50), municipalities (100s?),
watershed authorities (100s?)
Heterogeneous data
Data use, content, structure, systems (dbs, sensors)
Use
Variable Volume Budgets
Use (e.g. extraction, vulnerability): ?
Budgets (e.g. regional recharge): 10s? Reservoirs
Reservoirs (e.g. aquifers): 100s
Observations (e.g. wells, monitoring): 1Ms-10Ms Observations
Variable Quality
Completeness, consistency, location
B. Brodaric—GIN
Cyberra Summit 2010 3
Banff, 22 Sept. 2010
4. GW data in Canada
Ontario & Quebec
schematic and semantic heterogeneity
in water-well data
Quebec rock type
Ontario rock type
B. Brodaric—GIN
Cyberra Summit 2010 4
Banff, 22 Sept. 2010
5. Recent calls for action
GW Data Access
More online access
Consolidate access
Better data quality
More data (use, monitoring)
GW Data Management
B. Brodaric—GIN
Cyberra Summit 2010 5
Banff, 22 Sept. 2010
6. Approach
Groundwater Information Network (GIN)
NRCan, 9 prov/terr (YK, BC, AB, SK, MB, ON, QC, NS, NL), USGS
Seamless access to GW information
Start with water well databases then sensors
GeoConnections seed funding Jan2008-Mar2009
Principles
Distributed: data stays with owners
Seamless: acts as one virtual database
Multi-access: multiple portals, tools
Standards-based: nat’l CGDI & int’l OGC/ISO standards
e.g. Groundwater ML (GWML)
WaterML
GeoSciML
B. Brodaric—GIN
Cyberra Summit 2010 6
Banff, 22 Sept. 2010
8. Approach: data interoperability
Overcome levels of data heterogeneity
pragmatic GW Practices (data usage)
semantic GW Ontology (data content)
schema GWML, WaterML (data structure) Groundwater
OGC
syntax GML (data language)
system WFS, WMS,… (data systems)
B. Brodaric—GIN
Cyberra Summit 2010 8
Banff, 22 Sept. 2010
9. Approach: interop architectures
Catalog Warehouse Network
central registry central database central mediator, registry
unconsolidated access consolidated access consolidated access
common standards common standards common standards
fragmented data duplicate, delayed data virtual, real-time data
e.g. US-CUAHSI e.g. AU-AWRIS, EU-WISE e.g. GIN
OGC OGC
OGC OGC
registry mediator registry
ON registry QC
OGC OGC OGC OGC
ON QC ON QC
B. Brodaric—GIN
Cyberra Summit 2010 9
Banff, 22 Sept. 2010
10. Approach: design
Groundwater Information Network
GIN Advanced:
3D, analysis
GML GWML WaterML
WMS, WFS, SOS
GWML GML-BC GML-AB GML-SK GML-ON GWML GML
GML
WMS, WFS, SOS
B. Brodaric—GIN
Cyberra Summit 2010 10
Banff, 22 Sept. 2010
11. Typical mediator architecture
Ontology!
reasoner"
matcher"
Client! Wrapper
!
“find all water wells with global
" ON
unconsolidated materials”
!
sand
clay
Mediator! local
"
soil
<RockMaterial>
<geneticCategory>
<CGI_TermValue> global" Wrapper
!
<value…>Sedimentary</value> QC
</CGI_TermValue>
</geneticCategory> global
" SABL
<lithology>
… ARGL
<name…>Sand</name> TERR
</lithology>
Registry!
metadata" local
"
send query distribute query translate query (globallocal)
receive results integrate results translate results (localglobal)
distribute results
B. Brodaric—GIN
Cyberra Summit 2010 11
Banff, 22 Sept. 2010
18. Conclusions
Groundwater data interoperability achieved
for water well information and preliminarily sensors
Dynamic mediation effective and efficient
modest data volumes are realistic within wait-times
Open geospatial standards for schemas and
systems are essential
B. Brodaric—GIN
Cyberra Summit 2010 18
Banff, 22 Sept. 2010
21. Semantics: types of ontologies
Global Ontology!
general concepts
Upper-Level ontology !
(DOLCE ʻamount-of-matterʼ)"
Domain ontology ! public schema
public vocabulary
(GeoSciML ʻlithologyʼ, "
GeoSciML ʻsandʼ)"
local schema
local vocabulary
Application Application
ontology ! ontology !
(ON ʻmaterial1ʼ, (QC ʻmatprimʼ,
ON ʻsandʼ)" QC ʻSABLʼ)"
sand SABL
clay ARGL
soil TERR
B. Brodaric—GIN
Cyberra Summit 2010 21
Banff, 22 Sept. 2010
22. Schematics: GWML example
standard
structure
<gsml:lithology> standard
<gsml:ControlledConcept gml:id="gin.cc.2d-2"> content
<gsml:identifier codeSpace="urn:ietf:rfc:2141">urn:x-ngwd:vocabulary:gin:2d-2"</gsml:identifier>
<gsml:name codeSpace="urn:x-ngwd:classifierScheme:GIN:Lithology:2008" xml:lang="fr">Argile</gsml:name>
<gsml:name codeSpace="urn:x-ngwd:classifierScheme:GIN:Lithology:2008" xml:lang="eng">Clay</gsml:name>
<gml:description>A naturally occurring material composed primarily of fine-grained minerals.
It is generally plastic at appropriate water contents and will harden when
dried of fired (Neuendorf et al. 2005)</gml:description>
</gsml:lithology>
<gsml:material>
<gsml:UnconsolidatedMaterial>
<gsml:consolidationDegree>
<gsml:CGI_TermValue>
<gsml:value codeSpace="urn:cgi:classifierScheme:BGS:consolidationTerms">UNCONSOLIDATED</gsml:value>
</gsml:CGI_TermValue>
</gsml:consolidationDegree>
<gsml:physicalProperty>
<gwml:HydrogeologicDescription>
<gwml:hydraulicConductivity>
<gsml:CGI_NumericValue>
<gsml:qualifier>approximate</gsml:qualifier>
<gsml:principalValue uom="y_K_md-1">0.001</gsml:principalValue>
</gsml:CGI_NumericValue>
</gwml:hydraulicConductivity>
</gwml:HydrogeologicDescription>
</gsml:physicalProperty>
</gsml:UnconsolidatedMaterial>
</gsml:material>
B. Brodaric—GIN
Cyberra Summit 2010 22
Banff, 22 Sept. 2010
31. Next Steps
More geographic coverage
other Canadian partners
Higher quality data
time-indexed data: water levels, flow rates, quality… SOS
More types of data
aquifers, geology, 3D,… WCS
More tools
3D Modeling,…
More infrastructure
CWS, OWL Reasoner/Service!
B. Brodaric—GIN
Cyberra Summit 2010 31
Banff, 22 Sept. 2010