SlideShare une entreprise Scribd logo
1  sur  36
Publishing EPA Data as
      Linked Data
                A brief by
           Michael Pendleton
 EPA Office of Environmental Information
     pendleton.michael@epa.gov
What is driving us?
“We’re moving from managing documents
 to managing discrete pieces of open data
 and content which can be tagged, shared,
  secured, mashed up and presented in the
 way that is most useful for the consumer
                     of that information.”

      -- Report on Digital Government: Building a 21st Century Platform to
                                         Better Serve the American People
Goal: Make Open Data, Content, and
     Web APIs the New Default
Linked Data
What’s It All About?

 • Speak the Language of the Web
 • Just as you surf web pages, linked data lets you surf
   data.
 • SOAP was about making the web try to work like
   applications; REST was about making applications
   work like the web.
 • Linked Data is about making your DATA work like the
   web.


  Slide Credit: David G. Smith
                                 U.S. Environmental Protection Agency   4
  Aug 16, 2011 presentation
RDF is a lingua
franca for data
   exchange
Linked Data
Basics
• Tim Berners-Lee:                    5-Star model for publishing
     data




Slide Credit: David G. Smith   U.S. Environmental Protection Agency   6
• Linked Data is about
 publishing and
 consuming data
 using international
 data standards
• Based on 20 year
 old idea (the Web)
• A system of linked
 information systems
Global requirements
• Comprehensively link
  legislation & regulations
  for more effective
  government

• Explain context, source,
  version & publication
  date with the data itself

• We need global
  standards for metadata
The mission of the Government Linked
Data (GLD) Working Group is to provide
standards and other information which
help governments around the world
publish their data as effective and usable
Linked Data using Semantic Web
technologies.
Best Practices

Vocabulary Guidance

Community Building
US EPA publishes lots of CSV files ...
And now,
            Linked Open Data ...
•   A proof-of-concept launched 2011 with 5 Star Linked Data

•   Publication of 1.3M facilities (FRS) and the substances (SRS)
    regulated by the EPA

•   TRI program links to 25 years of data on major polluters

•   Additional pilots in 2012 incorporating EPA and anonymized
    electronic medical records (EMR) data from Sentara
    Healthcare

•   5 Star Linked Open Data to be hosted & accessible on an EPA
    production Web site in summer 2012
Increase re-use by publishing
        Linked Data
  •   Empower users to create their own views of data to
      satisfy different applications

  •   Build a community around the data in which users help
      each other to curate and connect as needed

  •   Skip the supermodel - Leave data in the multiple “best
      of breed” systems; wrap and expose on the Web of Data
There is a Process


Identify
 Identify   Model
            Model      Name
                       Name    Describe
                               Describe   Convert
                                          Convert   Publish
                                                    Publish




                          Maintain
7 steps to publishing Linked Data
•   Identify a dataset others are likely to want to re-use
•   Modeling
    •   Onsite modeling session (half day)
    •   Linked Data modeling supported by experts
    •   Validate the model with data owners/stewards
•   Publish data on the Web (opendata.epa.gov) per Best Practices
•   Produce automated scripts to maintain current data
•   Announce Linked Open Data sets *
•   Review usage reports to support relevance & user feedback


             * Pending EPA Systems Security Plan approval
Open Data Platforms
•   We’re using Callimachus, a Web
    platform for data-driven applications
    based on Linked Data principles.

•   It is hosted on Amazon EC2 and we
    have 24x7x365 data & application
    support.

•   There are other data platforms, we
    selected this one because it is fully
    W3C standards compliant, no vendor
    “lock in”

•   It’s Open Source (Apache 2.0)
Recommendations
• Linked Data promotes goals of transparency &
  economic development during times of fiscal
  austerity
 •  Publish in reusable format (RDF family of
    standards)
 •  Use OPEN vs proprietary in data formats
 •  Define a URI Policy and Strategy
 •  Use best practices and vocabularies exist --
    don’t recreate the wheel
Publishing Linked Data
will require continual
nurturing but the
rewards are worth it
Resources
•   VisibleGovernment.ca Website http://visiblegovernment.ca
•   Hack, Mash and Peer: Crowdsourcing Government Transparency, Jerry Brito, George
    Mason University, http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1023485
•   Blog on UK Environment Agency Water Quality, see
    http://data.southampton.ac.uk/datasets.html
•   Southampton Open Data Service, see http://data.southampton.ac.uk/datasets.html
•   Blog post on Clean Energy data from Reegle, see http://blog.semantic-
    web.at/2012/04/13/reegle-info-linked-open-energy-data-cloud/
•   Blog post on Publishing Linked Open Data in Tight Economic Times, 30-Jan-2012,
    http://3roundstones.com/2012/01/30/publishing-linked-open-data-makes-good-sense-in-
    tight-economic-times/
•   Blog post on HealthData.gov from US Health & Human Services, 4-June-2012,
    http://www.healthdata.gov/blog/welcome-new-healthdatagov
•   Blog post on US HHS Domain Challenge 1: Metadata, 2-June-2012,
    http://www.healthdata.gov/blog/domain-challenge-1-metadata
Coming soon ...
•   Best Practices for Publishing Linked Data (editor’s Draft
    20-Apr-2012), see https://dvcs.w3.org/hg/gld/raw-
    file/default/bp/index.html

•   Linked Data Cookbook, see
    http://www.w3.org/2011/gld/wiki/Linked_Data_Cookboo
    k

•   Linked Data Directory, see http://dir.w3.org

•   Attend the 2012 International Open Government Data
    Conference co-sponsored by data.gov & The World Bank
    10-12 July 2012, Washington DC, see
    http://www.data.gov/communities/conference
This work is Copyright © 2011-2012 3 Round Stones Inc.
It is licensed under the Creative Commons Attribution 3.0 Unported License
Full details at: http://creativecommons.org/licenses/by/3.0/

You are free:

       to Share — to copy, distribute and transmit the work



       to Remix — to adapt the work



Under the following conditions:
       Attribution. You must attribute the work in the manner specified by the
       author or licensor (but not in any way that suggests that they endorse you
       or your use of the work).

       Share Alike. If you alter, transform, or build upon this work, you may
       distribute the resulting work only under the same or similar license to this
       one.
Credits
         Jennifer Bell,
                               http://www.slideshare.net/jenniferbell
    VisibleGovernment.ca
         (CC-BY-SA)


                               http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/
  1-5 Star Linked Data image


   LOD Cloud Diagrams
   Richard Cyganiak, Anja      http://lod-cloud.net/
   Jentzsch, (CC-BY-SA)




             Book covers © their respective owners and used under Fair Use for educational purposes



© 2012 Bernadette Hyland, released under a CC-BY-SA license

Contenu connexe

Tendances

Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global HealthPhilip Bourne
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaidatascienceiqss
 
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...UKSG: connecting the knowledge community
 
Publishing Data on the Web
Publishing Data on the Web Publishing Data on the Web
Publishing Data on the Web Centro Web
 
DMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumDMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumCarly Strasser
 
Leveraging the dmp tool
Leveraging the dmp toolLeveraging the dmp tool
Leveraging the dmp toolBrian Zelip
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointMark Parsons
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...datascienceiqss
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHPhilip Bourne
 
BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020Philip Bourne
 
FAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The HyveFAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The HyveKees van Bochove
 

Tendances (20)

Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global Health
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
 
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
 
Publishing Data on the Web
Publishing Data on the Web Publishing Data on the Web
Publishing Data on the Web
 
Open Data in a Day - Introduction to Open Data
Open Data in a Day - Introduction to Open DataOpen Data in a Day - Introduction to Open Data
Open Data in a Day - Introduction to Open Data
 
W3 c semantic web activity
W3 c semantic web activityW3 c semantic web activity
W3 c semantic web activity
 
DMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumDMPTool for UMass eScience Symposium
DMPTool for UMass eScience Symposium
 
Leveraging the dmp tool
Leveraging the dmp toolLeveraging the dmp tool
Leveraging the dmp tool
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the Point
 
Research Data Management: Policy Development
Research Data Management: Policy DevelopmentResearch Data Management: Policy Development
Research Data Management: Policy Development
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
 
Current opinions in drug discovery public compound databases
Current opinions in drug discovery public compound databasesCurrent opinions in drug discovery public compound databases
Current opinions in drug discovery public compound databases
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Public Compound Databases
Public Compound DatabasesPublic Compound Databases
Public Compound Databases
 
Ucmp 20150407
Ucmp 20150407Ucmp 20150407
Ucmp 20150407
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIH
 
BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
FAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The HyveFAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The Hyve
 

En vedette

Sharing Data on the Web
Sharing Data on the WebSharing Data on the Web
Sharing Data on the Web3 Round Stones
 
3 Round Stones at the New England Health Datapalooza Oct 3, 2012
3 Round Stones at the New England Health Datapalooza Oct 3, 20123 Round Stones at the New England Health Datapalooza Oct 3, 2012
3 Round Stones at the New England Health Datapalooza Oct 3, 20123 Round Stones
 
Linked Data Explorer for Asthma/COPD
Linked Data Explorer for Asthma/COPDLinked Data Explorer for Asthma/COPD
Linked Data Explorer for Asthma/COPD3 Round Stones
 
Government Linked Data URI Design
Government Linked Data URI DesignGovernment Linked Data URI Design
Government Linked Data URI Design3 Round Stones
 
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round StonesLightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones3 Round Stones
 
Linked data tutorial 20111102
Linked data tutorial 20111102Linked data tutorial 20111102
Linked data tutorial 201111023 Round Stones
 
LOD for Entrepreneurs 20111115
LOD for Entrepreneurs 20111115LOD for Entrepreneurs 20111115
LOD for Entrepreneurs 201111153 Round Stones
 
Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 20123 Round Stones
 
Linked Data and the Future of Publishing
Linked Data and the Future of PublishingLinked Data and the Future of Publishing
Linked Data and the Future of Publishing3 Round Stones
 
Linking Open Government Data
Linking Open Government DataLinking Open Government Data
Linking Open Government Data3 Round Stones
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notesBernadette Hyland-Wood
 
Rdf explained by Suess and me
Rdf explained by Suess and meRdf explained by Suess and me
Rdf explained by Suess and me3 Round Stones
 
Linked Data Cookbook for Government Agencies
Linked Data Cookbook for Government AgenciesLinked Data Cookbook for Government Agencies
Linked Data Cookbook for Government Agencies3 Round Stones
 
SemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise DataSemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise Data3 Round Stones
 
Linked Government Data Panel
Linked Government Data PanelLinked Government Data Panel
Linked Government Data Panel3 Round Stones
 
Callimachus introduction 20111021
Callimachus introduction 20111021Callimachus introduction 20111021
Callimachus introduction 201110213 Round Stones
 
OMG Callimachus Demo 20120322 small
OMG Callimachus Demo 20120322 smallOMG Callimachus Demo 20120322 small
OMG Callimachus Demo 20120322 small3 Round Stones
 
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...3 Round Stones
 

En vedette (20)

Sharing Data on the Web
Sharing Data on the WebSharing Data on the Web
Sharing Data on the Web
 
3 Round Stones at the New England Health Datapalooza Oct 3, 2012
3 Round Stones at the New England Health Datapalooza Oct 3, 20123 Round Stones at the New England Health Datapalooza Oct 3, 2012
3 Round Stones at the New England Health Datapalooza Oct 3, 2012
 
Linked Data Explorer for Asthma/COPD
Linked Data Explorer for Asthma/COPDLinked Data Explorer for Asthma/COPD
Linked Data Explorer for Asthma/COPD
 
Government Linked Data URI Design
Government Linked Data URI DesignGovernment Linked Data URI Design
Government Linked Data URI Design
 
Open by Default
Open by DefaultOpen by Default
Open by Default
 
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round StonesLightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
 
Linked data tutorial 20111102
Linked data tutorial 20111102Linked data tutorial 20111102
Linked data tutorial 20111102
 
Health Datapalooza 2013: Linked Data
Health Datapalooza 2013: Linked DataHealth Datapalooza 2013: Linked Data
Health Datapalooza 2013: Linked Data
 
LOD for Entrepreneurs 20111115
LOD for Entrepreneurs 20111115LOD for Entrepreneurs 20111115
LOD for Entrepreneurs 20111115
 
Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012
 
Linked Data and the Future of Publishing
Linked Data and the Future of PublishingLinked Data and the Future of Publishing
Linked Data and the Future of Publishing
 
Linking Open Government Data
Linking Open Government DataLinking Open Government Data
Linking Open Government Data
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notes
 
Rdf explained by Suess and me
Rdf explained by Suess and meRdf explained by Suess and me
Rdf explained by Suess and me
 
Linked Data Cookbook for Government Agencies
Linked Data Cookbook for Government AgenciesLinked Data Cookbook for Government Agencies
Linked Data Cookbook for Government Agencies
 
SemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise DataSemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise Data
 
Linked Government Data Panel
Linked Government Data PanelLinked Government Data Panel
Linked Government Data Panel
 
Callimachus introduction 20111021
Callimachus introduction 20111021Callimachus introduction 20111021
Callimachus introduction 20111021
 
OMG Callimachus Demo 20120322 small
OMG Callimachus Demo 20120322 smallOMG Callimachus Demo 20120322 small
OMG Callimachus Demo 20120322 small
 
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
 

Similaire à EPA OEI Linked Data Process

Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentMartin Kaltenböck
 
Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information3 Round Stones
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutionsOpen Data Support
 
Jarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataJarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataMustafa Jarrar
 
SemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challengesSemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challengesAndrew Woolf
 
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open DataBernadette Hyland-Wood
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBernadette Hyland-Wood
 
Brief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data ScientistBrief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data Scientist3 Round Stones
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020Sarah Jones
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?Li Ding
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsBrand Niemann
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesOpen Data Support
 
W3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked DataW3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked Data3 Round Stones
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy datareeep
 

Similaire à EPA OEI Linked Data Process (20)

Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
Linked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter HaaseLinked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter Haase
 
Jarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataJarrar: Introduction to Linked Data
Jarrar: Introduction to Linked Data
 
SemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challengesSemWeb 4 Gov – opportunities and challenges
SemWeb 4 Gov – opportunities and challenges
 
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data Scientist
 
Brief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data ScientistBrief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data Scientist
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?
 
Linked data life cycles
Linked data life cyclesLinked data life cycles
Linked data life cycles
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data Dashboards
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and Examples
 
W3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked DataW3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked Data
 
Open Data is not Enough
Open Data is not EnoughOpen Data is not Enough
Open Data is not Enough
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy data
 

Plus de 3 Round Stones

US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data3 Round Stones
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach3 Round Stones
 
W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 20143 Round Stones
 
Why Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David WoodWhy Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David Wood3 Round Stones
 
Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 20143 Round Stones
 
Publising Data on the Web
Publising Data on the WebPublising Data on the Web
Publising Data on the Web3 Round Stones
 
Callimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 TutorialCallimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 Tutorial3 Round Stones
 
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...3 Round Stones
 
Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 201402033 Round Stones
 
Data Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round StonesData Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round Stones3 Round Stones
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project3 Round Stones
 
Linked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the WebLinked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the Web3 Round Stones
 
Delivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked DataDelivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked Data3 Round Stones
 
The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration3 Round Stones
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 20133 Round Stones
 
Sharing data on the web (2013)
Sharing data on the web (2013)Sharing data on the web (2013)
Sharing data on the web (2013)3 Round Stones
 
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/201202263 Round Stones
 
US National Archives & Open Government Data
US National Archives & Open Government DataUS National Archives & Open Government Data
US National Archives & Open Government Data3 Round Stones
 
US EPA OSWER Linked Data Workshop 1-Feb-2013
US EPA OSWER Linked Data Workshop 1-Feb-2013US EPA OSWER Linked Data Workshop 1-Feb-2013
US EPA OSWER Linked Data Workshop 1-Feb-20133 Round Stones
 
Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 201301293 Round Stones
 

Plus de 3 Round Stones (20)

US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014
 
Why Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David WoodWhy Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David Wood
 
Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014
 
Publising Data on the Web
Publising Data on the WebPublising Data on the Web
Publising Data on the Web
 
Callimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 TutorialCallimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 Tutorial
 
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
 
Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203
 
Data Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round StonesData Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round Stones
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project
 
Linked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the WebLinked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the Web
 
Delivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked DataDelivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked Data
 
The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
 
Sharing data on the web (2013)
Sharing data on the web (2013)Sharing data on the web (2013)
Sharing data on the web (2013)
 
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
 
US National Archives & Open Government Data
US National Archives & Open Government DataUS National Archives & Open Government Data
US National Archives & Open Government Data
 
US EPA OSWER Linked Data Workshop 1-Feb-2013
US EPA OSWER Linked Data Workshop 1-Feb-2013US EPA OSWER Linked Data Workshop 1-Feb-2013
US EPA OSWER Linked Data Workshop 1-Feb-2013
 
Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129
 

Dernier

Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 

Dernier (20)

Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 

EPA OEI Linked Data Process

  • 1. Publishing EPA Data as Linked Data A brief by Michael Pendleton EPA Office of Environmental Information pendleton.michael@epa.gov
  • 2. What is driving us? “We’re moving from managing documents to managing discrete pieces of open data and content which can be tagged, shared, secured, mashed up and presented in the way that is most useful for the consumer of that information.” -- Report on Digital Government: Building a 21st Century Platform to Better Serve the American People
  • 3. Goal: Make Open Data, Content, and Web APIs the New Default
  • 4. Linked Data What’s It All About? • Speak the Language of the Web • Just as you surf web pages, linked data lets you surf data. • SOAP was about making the web try to work like applications; REST was about making applications work like the web. • Linked Data is about making your DATA work like the web. Slide Credit: David G. Smith U.S. Environmental Protection Agency 4 Aug 16, 2011 presentation
  • 5. RDF is a lingua franca for data exchange
  • 6. Linked Data Basics • Tim Berners-Lee: 5-Star model for publishing data Slide Credit: David G. Smith U.S. Environmental Protection Agency 6
  • 7. • Linked Data is about publishing and consuming data using international data standards • Based on 20 year old idea (the Web) • A system of linked information systems
  • 8.
  • 9. Global requirements • Comprehensively link legislation & regulations for more effective government • Explain context, source, version & publication date with the data itself • We need global standards for metadata
  • 10. The mission of the Government Linked Data (GLD) Working Group is to provide standards and other information which help governments around the world publish their data as effective and usable Linked Data using Semantic Web technologies.
  • 12. US EPA publishes lots of CSV files ...
  • 13. And now, Linked Open Data ... • A proof-of-concept launched 2011 with 5 Star Linked Data • Publication of 1.3M facilities (FRS) and the substances (SRS) regulated by the EPA • TRI program links to 25 years of data on major polluters • Additional pilots in 2012 incorporating EPA and anonymized electronic medical records (EMR) data from Sentara Healthcare • 5 Star Linked Open Data to be hosted & accessible on an EPA production Web site in summer 2012
  • 14. Increase re-use by publishing Linked Data • Empower users to create their own views of data to satisfy different applications • Build a community around the data in which users help each other to curate and connect as needed • Skip the supermodel - Leave data in the multiple “best of breed” systems; wrap and expose on the Web of Data
  • 15. There is a Process Identify Identify Model Model Name Name Describe Describe Convert Convert Publish Publish Maintain
  • 16.
  • 17.
  • 18.
  • 19. 7 steps to publishing Linked Data • Identify a dataset others are likely to want to re-use • Modeling • Onsite modeling session (half day) • Linked Data modeling supported by experts • Validate the model with data owners/stewards • Publish data on the Web (opendata.epa.gov) per Best Practices • Produce automated scripts to maintain current data • Announce Linked Open Data sets * • Review usage reports to support relevance & user feedback * Pending EPA Systems Security Plan approval
  • 20. Open Data Platforms • We’re using Callimachus, a Web platform for data-driven applications based on Linked Data principles. • It is hosted on Amazon EC2 and we have 24x7x365 data & application support. • There are other data platforms, we selected this one because it is fully W3C standards compliant, no vendor “lock in” • It’s Open Source (Apache 2.0)
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31. Recommendations • Linked Data promotes goals of transparency & economic development during times of fiscal austerity • Publish in reusable format (RDF family of standards) • Use OPEN vs proprietary in data formats • Define a URI Policy and Strategy • Use best practices and vocabularies exist -- don’t recreate the wheel
  • 32. Publishing Linked Data will require continual nurturing but the rewards are worth it
  • 33. Resources • VisibleGovernment.ca Website http://visiblegovernment.ca • Hack, Mash and Peer: Crowdsourcing Government Transparency, Jerry Brito, George Mason University, http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1023485 • Blog on UK Environment Agency Water Quality, see http://data.southampton.ac.uk/datasets.html • Southampton Open Data Service, see http://data.southampton.ac.uk/datasets.html • Blog post on Clean Energy data from Reegle, see http://blog.semantic- web.at/2012/04/13/reegle-info-linked-open-energy-data-cloud/ • Blog post on Publishing Linked Open Data in Tight Economic Times, 30-Jan-2012, http://3roundstones.com/2012/01/30/publishing-linked-open-data-makes-good-sense-in- tight-economic-times/ • Blog post on HealthData.gov from US Health & Human Services, 4-June-2012, http://www.healthdata.gov/blog/welcome-new-healthdatagov • Blog post on US HHS Domain Challenge 1: Metadata, 2-June-2012, http://www.healthdata.gov/blog/domain-challenge-1-metadata
  • 34. Coming soon ... • Best Practices for Publishing Linked Data (editor’s Draft 20-Apr-2012), see https://dvcs.w3.org/hg/gld/raw- file/default/bp/index.html • Linked Data Cookbook, see http://www.w3.org/2011/gld/wiki/Linked_Data_Cookboo k • Linked Data Directory, see http://dir.w3.org • Attend the 2012 International Open Government Data Conference co-sponsored by data.gov & The World Bank 10-12 July 2012, Washington DC, see http://www.data.gov/communities/conference
  • 35. This work is Copyright © 2011-2012 3 Round Stones Inc. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.
  • 36. Credits Jennifer Bell, http://www.slideshare.net/jenniferbell VisibleGovernment.ca (CC-BY-SA) http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/ 1-5 Star Linked Data image LOD Cloud Diagrams Richard Cyganiak, Anja http://lod-cloud.net/ Jentzsch, (CC-BY-SA) Book covers © their respective owners and used under Fair Use for educational purposes © 2012 Bernadette Hyland, released under a CC-BY-SA license

Notes de l'éditeur

  1. The recently published report by White House described the information, platform and presentation layers of digital services agencies are to provide. The EPA joins government authorities around the world who are defining plans based on Open data and open APIs.
  2. A lot of people in governments around the world are publishing data on the Web of Data. We ’ re familiar with portals such as data.gov.uk and data.gov . Often this is in the form of CSV files but an increasing amount is available as well modeled LINKED DATA. We just participated in the International Open Data Conference which showed the open (government) data community is really thriving:  450 in-person participants from over 50 countries, 4000 online participants, over 2000 tweets & 162 speakers.
  3. Not all of Open Government content is Linked Data. But a growing number of data sets are available as 4-5 star linked data. Use of structured data is actively promoted by international standards groups like the W3C and major search engines, Google, Yahoo!, Bing, Yandex.
  4. This presentation discusses the increasing number of high value data sets being published by the EPA as 5 star Linked Data. This means data is publishing on the Web in both human & machine readable formats. A human can read the nicely formatted content AND a machine can find, access and re-use the machine readable format if it is published in the Web ’ s data exchange format, RDF.
  5. There are a growing number of resources on this topic, several have been authored by EPA ’ s Linked Data contractors, Dr. David Wood and Bernadette Hyland, and their colleague Dr. Tom Health. Links to all the projects described in this talk are included at the end of this presentation.
  6. Data formats and standards sometimes sounds like alphabet soup to many people The EPA is a member of the W3C Government Linked Data Working Group. We have a practical focus on removing the friction from the Web publishing process and specifically, are working to make it easier for government authorities to publish DATA on the WEB.
  7. The GLD working group works with leading academics, and has guests from the private sector & non-profits who define use cases for open government data. They describe the need for government agencies to publish content that describes the relative authority of a piece of data, for example, case law and regulation.
  8. The EPA is a member of the W3C and are active on the Government Linked Data Working Group, along with our colleague George Thomas from HHS. We are one year into the working group ’ s two year charter. Our mission is ...
  9. The GLD WG is on track to publish BEST PRACTICES, Vocabulary Guidance as W3C Recommendations which are the standards of the World Wide Web. We ’ ve also produced a Linked Data Directory of projects, products and service providers, and a Government Linked Data Cookbook describing a step by step approach for developers.
  10. So where are we today? The EPA already publishes a huge amount of information as CSV files and through portals like Envirofacts. Unfortunately, that data is often hard to find, without context. Furthermore, it ’ s written from a regulatory perspective. It is not re-usable for other scientists and the public without significant re-structuring.
  11. Our goals are to broaden access and re-use of this important data that tax payers have paid us to collect, and to reduce the burden of compliance for regulated entities.
  12. So here is the exact process: Identify the data, model exemplar records -- what you are going to carry forward. Name all of the NOUNs. Turn the records into URIs. Next, describe RESOURCES with vocabularies. Write a script or process to convert from say the CSV to RDF. Automate it so it is easy to maintain. This is routinely done in 30-60 day sprint, with the involvement of the EPA data steward, a project manager and 2 Linked Data experts, part time.
  13. We draw “ ball and stick ” diagrams that describe how all the data is RELATED to each other. That is all there is to Linked Data, it is a view of data and its relationships to other pieces of information. Other people can come along and add more relationships and information they have.
  14. Then we produce scripts that convert CSV to RDF. These scripts can be run ANYTIME there is an update to the underlying CSV extract from the relational database that today stores the data.
  15. So let ’ s review the entire process for producing Linked Open Data and we ’ ll show you what the UI looks like next... OEI has followed this process with 3 different data sets of varying complexity, size and data quality. Each data set was published on an interim cloud server on Amazon EC2 with part time involvement by several EPA staff and a couple of contractors within 60 days. See http://usepa.3roundstones.net We expect the System Security Plan for the production data platform to be approved this summer & we ’ ll host as much Linked Data as EPA produces.
  16. The data platform landscape is emerging. Data.gov is using Socrata for 1-3 star data. We felt it was important to avoid vendor lock-in from proprietary formats & ETL processes, so we chose a Web Standards compliant, open source platform specializing in 4 & 5 star data. It’s commercially supported and available via the cloud.
  17. Once we had the data modeled, validated with SMEs, we converted & loaded into Callimachus. We spent about 1 hour creating templates to view the data in Callimachus. So here is the power of LOD in action -- Within one hour, we could view the data, navigate through the data and verify the contents without being a DBA or Java developer!
  18. A designer with CSS skills can help us make it look pretty with a nice CSS theme. Thus, Web developers with HTML, CSS and RDFa / SPARQL skills can create data driven Web applications. No understanding of semantics, deep RDF knowledge is required.
  19. Callimachus ’ forms driven interface allows authorized users to modify the underlying triples in the database -- we are round tripping create/modify/delete to a triple store via a Web page!
  20. This is an example of an application that was created in less than 3 days by a Web developer using Callimachus. The data sources included EPA FRS, SRS and TRI Linked Data, spreadsheet data from ABT Associates on corporate ownership (as CSV), Open Street Maps content from the Web (Linked Data Cloud).
  21. If you have permissions, you can edit the underlying data stored in the database (an RDF triple store). Several different triple stores are supported by Callimachus. A triple store is effectively just a “ library ” to Callimachus -- as long as it stores the data standards (RDF, SPARQL), it doesn ’ t matter.
  22. Note the fixed name and added comment.
  23. A history of changes is kept. Note the change to the name and the added comment, along with the time/date and name of the user who made the edit.
  24. If you ’ re interested in the maturity of the RDF family of standards, here is the technology “ layer cake ” . The data exchanges standards are mature and well defined. The world ’ s leading technology companies are supporting RDF in their products including Oracle 11g, IBM DB2, EMC. The world ’ s leading search engines including Google, Yahoo!, Bing (Microsoft) and Yandex are displaying content with RDF (RDFa & RDFa Lite). That is why we ’ re joining leading governments worldwide to publish our valuable content as LOD.
  25. Your ability to move into the future will be ensured by publishing data to the Web. Use data exchange standards. Define URI policy, document it and help people to comply. Leverage existing vocabularies. Despite what you think, you are probably talking about many of the same objects (people, organizations, assets, scientific terms, etc as someone else), so use a shared vocabulary to realize the benefits of Linked Data.
  26. This presentation is licensed under a Creative Commons BY-SA license, allowing you to share and remix its contents as long as you give us attribution and share alike.