SlideShare une entreprise Scribd logo
1  sur  3
Ecoinformatics International Technical Collaboration Partnership
International Web Meeting - Linked Open Data and Environmental Information
Day 1 – December 6, 2010

Geospatial Topic – Dave Smith


December 6, 2010



                                                                                           Dave Smith
                                                                              USEPA/OEI/OIC/IESD/ISSB
                                                                                 smith.davidg@epa.gov
                                                                                          202-566-0797


Document Change History
  Revision     Date                            Author                  Description
 1.0          12/6/2010       David G. Smith                        Initial Version



FRS as a Linked Open Data Pilot - Background

EPA maintains a database of facilities, which is aggregated from a variety of sources – 32 federal
databases (mostly EPA, along with a few others such as Energy Information Administration), and 57
state and tribal databases. Information about facilities is conflated from these sources, to include
facility name and geographic location (to include spatial feature type such as point or polygon, latitude,
longitude, coordinate reference system, and collection metadata), physical and mailing address, points
of contact, activities conducted at the given location (via North American Industry Classification System -
NAICS and its’ predecessor, Standard Industrial Classification - SIC codes), and any associated program
identifiers, permit numbers, and other related items.

This in turn serves as a geospatial foundation piece for some of EPA’s reporting and mapping tools and
capabilities, such as Envirofacts, MyEnvironment and other tools, allowing parametric data and reports
from a variety of programs to be linked to facilities.

Currently this integration is being done via traditional means, i.e. Relational Database Management
System queries; additionally, web services and APIs are limited - as such, integration opportunity is
generally limited to what we can do within the Agency.
EcoInformatics – Geospatial Discussion
                              November 11, 2010                          December 6, 2010


Opportunity

Via Linked Open Data approaches, there is opportunity and potential for publishing this facilities data
framework to allow analysis across other agencies as well, such as Occupational Safety and Health
Administration - OSHA or Mine Safety and Health Administration - MSHA enforcement histories,
offshore platforms using Bureau of Ocean Energy Management, Regulation and Enforcement - BOEMRE
data, and other types of cross-cutting, government-wide approaches, as more Linked Open Data assets
become available.

Initial Efforts

EPA is still in the planning stages – we have published some initial FRS data as RDF via Data.gov,
however we are now working to iteratively refine our LOD publishing approach, through the use of a
“cookbook” approach which we hope to be able to apply to a number of EPA datasets, which will
establish a framework to provide consistent methodologies and approaches for publishing Linked Open
Data agencywide. Part of this will be to leverage existing agency investments in metadata, data
dictionaries, terminologies and ontologies, toward further contextualizing of agency data assets.

For FRS, we hope to contextualize the various facets of the data, e.g. corporate/organizational entity,
points of contact, activities and other aspects.

Geospatial Enablement

There are multiple aspects to geo-enablement via Linked Open Data – one being how to represent the
features in a manner that works for mapping, such as points, lines, polygons and associated topologies,
the associated coordinates, along with metadata describing such things as coordinate reference systems
and locational accuracy estimates.

For the geospatial feature component of FRS, we hope to look at current OGC standards and efforts,
such as the GeoSemantics SWG, as well as emergent GeoSPARQL efforts, and to collaborate with the
Spatial Ontology Community of Practice (SOCOP). We will need to delve into the most efficacious means
of representing features, such as GeoRSS, along with current coordinate reference systems (e.g. NAD83)
toward interoperability and geospatial analysis.

Another aspect of this deals with the geography of interest, delving into relating the facility attribute
ontology with the surrounding terrain ontology to contextualize, for example, if we are dealing with a
mining facility, can one relate the facility interest with other datasets such as geology, stratigraphy, and
other mining-related data?

These may require some tuning in how we collect and model data, for example, most of our data has
historically been program-specific, with some of these subtler nuances currently only reachable through
imperfect derivation, based on things like NAICS code.

Next Steps
                                                     2
EcoInformatics – Geospatial Discussion
                              November 11, 2010                       December 6, 2010


We hope to collaborate with our counterparts in other agencies on best practices and lessons learned –
in the case of EPA’s Facility Registry System, there are direct, tangible, and implementable pieces which
we can put into motion, and there is opportunity to develop a more robust Linked Open Data approach,
an effort which has already kicked off.




                                                    3

Contenu connexe

Tendances

NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012
IUPUI
 
Data behind figures in AAS journals
Data behind figures in AAS journalsData behind figures in AAS journals
Data behind figures in AAS journals
Chris Biemesderfer
 

Tendances (8)

NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012
 
SEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability ResearchSEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability Research
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Data behind figures in AAS journals
Data behind figures in AAS journalsData behind figures in AAS journals
Data behind figures in AAS journals
 
Data management federal requirements 9 2015
Data management federal requirements 9 2015Data management federal requirements 9 2015
Data management federal requirements 9 2015
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
An On-line Collaborative Data Management System
An On-line Collaborative Data Management SystemAn On-line Collaborative Data Management System
An On-line Collaborative Data Management System
 
Va sla nov 15 final
Va sla nov 15 finalVa sla nov 15 final
Va sla nov 15 final
 

Similaire à EcoInformatics FRS Presentation - Discussion 20101206

sers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenariosers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenario
Rudolf Husar
 
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
Rudolf Husar
 
Data management plans
Data management plansData management plans
Data management plans
Brad Houston
 

Similaire à EcoInformatics FRS Presentation - Discussion 20101206 (20)

Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information Retrieval
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programs
 
RDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOneRDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOne
 
Matching data detection for the integration system
Matching data detection for the integration systemMatching data detection for the integration system
Matching data detection for the integration system
 
sers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenariosers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenario
 
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
 
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxf
 
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
 
Data management plans
Data management plansData management plans
Data management plans
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
Implementation of Matching Tree Technique for Online Record Linkage
Implementation of Matching Tree Technique for Online Record LinkageImplementation of Matching Tree Technique for Online Record Linkage
Implementation of Matching Tree Technique for Online Record Linkage
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
Data integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics caseData integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics case
 
Data integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics caseData integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics case
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
 

Plus de Dave Smith / USEPA Office of Environmental Information

Plus de Dave Smith / USEPA Office of Environmental Information (11)

DC Web API Meetup Oct 4 2016
DC Web API Meetup Oct 4 2016DC Web API Meetup Oct 4 2016
DC Web API Meetup Oct 4 2016
 
GeoDC Maker Talks: GPS-Enabled Sensor Platforms using Arduino
GeoDC Maker Talks:  GPS-Enabled Sensor Platforms using ArduinoGeoDC Maker Talks:  GPS-Enabled Sensor Platforms using Arduino
GeoDC Maker Talks: GPS-Enabled Sensor Platforms using Arduino
 
FRS Emergency Response Data Quality Initiatives
FRS Emergency Response Data Quality InitiativesFRS Emergency Response Data Quality Initiatives
FRS Emergency Response Data Quality Initiatives
 
Chemical Facilities Safety - Executive Order 13560
Chemical Facilities Safety - Executive Order 13560Chemical Facilities Safety - Executive Order 13560
Chemical Facilities Safety - Executive Order 13560
 
HIFLD Presentation Fall 2013
HIFLD Presentation Fall 2013HIFLD Presentation Fall 2013
HIFLD Presentation Fall 2013
 
Linked Data W3C 20110629
Linked Data W3C  20110629Linked Data W3C  20110629
Linked Data W3C 20110629
 
ESRI DevMeetup 201100607
ESRI DevMeetup 201100607ESRI DevMeetup 201100607
ESRI DevMeetup 201100607
 
Linked GeoData - WhereCampDC 20110610
Linked GeoData - WhereCampDC 20110610Linked GeoData - WhereCampDC 20110610
Linked GeoData - WhereCampDC 20110610
 
Health Data Initiative 20110609
Health Data Initiative 20110609Health Data Initiative 20110609
Health Data Initiative 20110609
 
FRS Linked Open Data Concept v1.3 20101130
FRS Linked Open Data Concept v1.3 20101130FRS Linked Open Data Concept v1.3 20101130
FRS Linked Open Data Concept v1.3 20101130
 
EcoInformatics FRS Presentation 20101206
EcoInformatics FRS Presentation 20101206EcoInformatics FRS Presentation 20101206
EcoInformatics FRS Presentation 20101206
 

Dernier

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Dernier (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 

EcoInformatics FRS Presentation - Discussion 20101206

  • 1. Ecoinformatics International Technical Collaboration Partnership International Web Meeting - Linked Open Data and Environmental Information Day 1 – December 6, 2010 Geospatial Topic – Dave Smith December 6, 2010 Dave Smith USEPA/OEI/OIC/IESD/ISSB smith.davidg@epa.gov 202-566-0797 Document Change History Revision Date Author Description 1.0 12/6/2010 David G. Smith Initial Version FRS as a Linked Open Data Pilot - Background EPA maintains a database of facilities, which is aggregated from a variety of sources – 32 federal databases (mostly EPA, along with a few others such as Energy Information Administration), and 57 state and tribal databases. Information about facilities is conflated from these sources, to include facility name and geographic location (to include spatial feature type such as point or polygon, latitude, longitude, coordinate reference system, and collection metadata), physical and mailing address, points of contact, activities conducted at the given location (via North American Industry Classification System - NAICS and its’ predecessor, Standard Industrial Classification - SIC codes), and any associated program identifiers, permit numbers, and other related items. This in turn serves as a geospatial foundation piece for some of EPA’s reporting and mapping tools and capabilities, such as Envirofacts, MyEnvironment and other tools, allowing parametric data and reports from a variety of programs to be linked to facilities. Currently this integration is being done via traditional means, i.e. Relational Database Management System queries; additionally, web services and APIs are limited - as such, integration opportunity is generally limited to what we can do within the Agency.
  • 2. EcoInformatics – Geospatial Discussion November 11, 2010 December 6, 2010 Opportunity Via Linked Open Data approaches, there is opportunity and potential for publishing this facilities data framework to allow analysis across other agencies as well, such as Occupational Safety and Health Administration - OSHA or Mine Safety and Health Administration - MSHA enforcement histories, offshore platforms using Bureau of Ocean Energy Management, Regulation and Enforcement - BOEMRE data, and other types of cross-cutting, government-wide approaches, as more Linked Open Data assets become available. Initial Efforts EPA is still in the planning stages – we have published some initial FRS data as RDF via Data.gov, however we are now working to iteratively refine our LOD publishing approach, through the use of a “cookbook” approach which we hope to be able to apply to a number of EPA datasets, which will establish a framework to provide consistent methodologies and approaches for publishing Linked Open Data agencywide. Part of this will be to leverage existing agency investments in metadata, data dictionaries, terminologies and ontologies, toward further contextualizing of agency data assets. For FRS, we hope to contextualize the various facets of the data, e.g. corporate/organizational entity, points of contact, activities and other aspects. Geospatial Enablement There are multiple aspects to geo-enablement via Linked Open Data – one being how to represent the features in a manner that works for mapping, such as points, lines, polygons and associated topologies, the associated coordinates, along with metadata describing such things as coordinate reference systems and locational accuracy estimates. For the geospatial feature component of FRS, we hope to look at current OGC standards and efforts, such as the GeoSemantics SWG, as well as emergent GeoSPARQL efforts, and to collaborate with the Spatial Ontology Community of Practice (SOCOP). We will need to delve into the most efficacious means of representing features, such as GeoRSS, along with current coordinate reference systems (e.g. NAD83) toward interoperability and geospatial analysis. Another aspect of this deals with the geography of interest, delving into relating the facility attribute ontology with the surrounding terrain ontology to contextualize, for example, if we are dealing with a mining facility, can one relate the facility interest with other datasets such as geology, stratigraphy, and other mining-related data? These may require some tuning in how we collect and model data, for example, most of our data has historically been program-specific, with some of these subtler nuances currently only reachable through imperfect derivation, based on things like NAICS code. Next Steps 2
  • 3. EcoInformatics – Geospatial Discussion November 11, 2010 December 6, 2010 We hope to collaborate with our counterparts in other agencies on best practices and lessons learned – in the case of EPA’s Facility Registry System, there are direct, tangible, and implementable pieces which we can put into motion, and there is opportunity to develop a more robust Linked Open Data approach, an effort which has already kicked off. 3