SlideShare une entreprise Scribd logo
1  sur  44
Semantic web and linked data
     for data set publication
         Dave Reynolds, Epimorphics Ltd
                                @der42
Outline
   Background on linked data
   Roles in data set publishing
   Case study: Environment Agency
   Lessons
Linked data background
Linked data ...

    publishing data on the web ...

   ... to enable integration, linking and reuse
       across silos
Linked data
Apply the principles to the web to publication of data
The linked data web:
     is a global network of things
     each identified by a URI
     fetching a URI gives a set of statements   in RDF
     things connected by typed links
     open, anyone can say anything about anything else


Linked data is “data you can click on”
Example schools information
         http://education.data.gov.uk/id/school/401874
Example schools information
                 http://education.data.gov.uk/id/school/401874   a        School


                  label                              phase
                                  district                           “Secondary”
“Cardiff High School”

                                 “Cardiff”
Example schools information
                  http://education.data.gov.uk/id/school/401874               a        school:School



                                                          phase
                       label
                                      district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT   label          “Cardiff”
Example schools information
                  http://education.data.gov.uk/id/school/401874               rdf:type      school:School


                  rdfs:label                       school:phase

                                    school:district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT        rdfs:label     “Cardiff”
Example schools information
                  http://education.data.gov.uk/id/school/401874               rdf:type      school:School


                  rdfs:label                       school:phase

                                    school:district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT        label          “Cardiff”



             http://data.ordnancesurvey.co.uk/id/7000000000025484


     admingeo:ward
                                                       spatial:extent

                      admingeo:parish
                                                              GML: 310499.4 184176.6 310476.5 ...
Example schools information
                  http://education.data.gov.uk/id/school/401874               rdf:type      school:School


                  rdfs:label                       school:phase

                                    school:district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT        label          “Cardiff”

                                    owl:sameAs

             http://data.ordnancesurvey.co.uk/id/7000000000025484


     admingeo:ward
                                                       spatial:extent

                      admingeo:parish
                                                              GML: 310499.4 184176.6 310476.5 ...
Role in data set publication
   well suited to describing things
       schools, companies, animal species, music tracks, tv programmes ...

   what about datasets?
       environmental measurements, experimental results, statistical analyses ...
Approach 1 : Data catalogues
   treat the dataset as a single resource, identify with a URI
   provide metadata as linked data
       descriptive
       categorical
       technical and structural


Benefits?
       separate of metadata from resource & repository
       easy aggregation of metadata into catalogues
       schema-less enables use-specific annotations and links
       use of sharable category schemes and reference data
=> support for discovery
Approach 2 : Fine grain publication
   publish the data set itself as linked data
       entities, terms, individual records in data identified by URIs
       data set structure and ontologies linked from data
       still include dataset metadata


Benefits?
       all benefits of approach 1 to support discovery
       self-describing
       data slices addressable (trace back, provenance, annotation)
       integration across sets - reuse of terms for dimensions, units, values
       fine grained access
=> integration, comparison, context, data as a service
bathing water quality

                                              what we do...

                            start of season

                                  15th May                                  Press interest




                                               bathing season
what information                                                20-22 samples in 22weeks
is relevant to the public
about beaches
                                30th Sept
                            annual report
                    what       November
                    we do

                               December
how linkable data helps
             Tenby
             Tourist Information Centre
             Unit 2 , The Gateway Complex
             Tenby. Wales , SA70 7LT
             Tel: 01834 842 402
             Fax: 01834 845 439
             Email: tenby.tic@pembrokeshire.gov.uk




                                                     Photo by Skellig2008 (flickr)
Publishing the Bathing Water Quality data set

                             Bathing           Sampling          Zones Of              Assessment
  Vocabularies
                             Waters             Points           Influence                 s


                                                                          e.g. http://location.data.gov.uk/def/ef/SampingPoint



       URI Set
                             Bathing           Sampling               Zone Of
Reference Data               Waters             Points               Influence

                                                                e.g. http://location.data.gov.uk/so/ef/SamplingPoint/bwsp.eaew



                                            Assessme            http://environment.data.gov.uk/data/bathing-water-quality
 Observation
                                               nt
   Datasets
                              void:subset              void:subset

                                                       In-season
                                Annual
                                                         Weekly
            .../compliance     Complianc                               .../in-season
                                                       Assessme
                                  e
                                                           nt
Data cube vocabulary
   collaborative development
    sponsored by data.gov.uk
   simple, flexible vocabulary
   mirrors core information models from:
        SDMX (Statistical Data and Metadata eXchange)
        DDI (Data Documentation Initiative)
   extension to SCOVO vocabulary




image: dullhunk @ flickr
Data cube model
    A set of observations
     indexed by dimensions
     describing measures
     interpreted according to attributes
(e.g. region)
 dimension




                                measure(s)    attributes


                              • population   unit of measure = count
                                = 32,567     status = preliminary
                                             ...



                dimension
                (e.g. time)
Data cube vocabulary
1. Top level
   DataSet                        qb:DataStructureDefinition
                                                                         qb:component

       provenance and metadata                                         qb:sliceKey

       structure                   qb:structure

                                   qb:DataSet                      qb:SliceKey
                                                      qb:slice
                                                                 qb:sliceStructure
                                    qb:dataset
                                                      qb:Slice

                                                                    qb:subSlice



                                                       qb:observation

                                  qb:Observation
                                   dimension values
                                   measure value(s)
                                   attribute values
Data cube vocabulary
1. Top level
   DataSet                               qb:DataStructureDefinition
                                                                                qb:component

       provenance and metadata                                                qb:sliceKey

       structure                          qb:structure

   Observation                           qb:DataSet                      qb:SliceKey

       measured values, at dimensions                       qb:slice
                                                                        qb:sliceStructure
                                           qb:dataset
        with attributes                                      qb:Slice

       direct link to DataSet                                             qb:subSlice



                                                              qb:observation

                                         qb:Observation
                                          dimension values
                                          measure value(s)
                                          attribute values
Data cube vocabulary
1. Top level
   DataSet                               qb:DataStructureDefinition
                                                                                qb:component

       provenance and metadata                                                qb:sliceKey

       structure                          qb:structure

   Observation                           qb:DataSet                      qb:SliceKey

       measured values, at dimensions                       qb:slice
                                                                        qb:sliceStructure
                                           qb:dataset
        with attributes                                      qb:Slice

       direct link to DataSet                                             qb:subSlice


   Slice                                                     qb:observation

                                         qb:Observation
       optional grouping by fixing
        dimensions                        dimension values
                                          measure value(s)
                                          attribute values
       guide to presentation
       allows for abbreviated data
Data cube vocabulary
2. Data Structure Definition
   explicit definition of cube
                                       qb:DataSet
    structure, inline in the data                            qb:structure


   enables                         qb:DataStructureDefinition
       validation                                               qb:component

       visualization
       discovery                   qb:ComponentSpecification

       abbreviation                                      qb:componentRequired
                                                          qb:componentAttachment
                                                          qb:order

                                           qb:dimension

                                           qb:measure

                                           qb:attribute
Bathing Water Quality cubes
   measures
       total coliform count, entero virus count, ...
       sample classification
   dimensions
       sampling point
       sampling week
       sampling year
   attributes
       abnormal weather
Everything has a URI
                      Selected Lists and
                       Individual Bathing Waters
                      Lists and Individual
                       Assessments
                          In-Season or Annual
                           Compliance
                      Vocabulary Terms
                      Datasets (and subsets)
                      Presented as:
                          HTML, (for people)
                          JSON, XML, RDF and CSV
                           (for programs)
Data Platform and Applications


  Web of Linked Data




                       http://environment.data.gov.uk/lab/bwq-os.html
Outcomes
   bathing water quality information available
       as both data set and set of web APIs
       updated weekly (in season)
   third party applications to use and combine the data
   seed a web of environmental and location data
       reference identifiers can be reused for related information
       URI patterns designed to be compatible with INSPIRE
Wrapping up




image: erika g. @ flickr.com
Lessons
   importance of reference identifiers
   developer accessibility
       linked data API
   publish once, consume many ways
   importance of maintenance and QoS expectation
   reusable patterns:
       reusable vocabularies - Data Cube, org ...
       URI patterns
       provenance – OMPV and specializations
   incremental approach
Acknowledgements
   Alex Coley (Environment Agency)
       for slides 17, 18, and for sponsoring the bathing water quality
        data publication
   Stuart Williams
       developer of the bathing water application and slides 19,27,28
   John Sheridan (The National Archive)
       for sponsoring the development of data cube
   Richard Cyganiak, Jeni Tennison
       co-developers of the data cube vocabulary
fin.
                                         fin.




image: Christian Haugen @ flickr.com
Spare
Linked data principles
   Use URIs as names for things
   Use HTTP URIs so that people can look up those names
   When someone looks up a URI, provide useful
    information, using the standards (RDF*, SPARQL)
   Include links to other URIs, so that they can discover
    more things



             Pattern of application of semantic
                         web stack
Linked open data cloud: 2007




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Linked open data cloud: 2009




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Linked open data cloud: 2010




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Accessing all this data
   link following
       HTTP GET, follow links, aggregate relevant statements
   query
       SPARQL
SPARQL
   core idea is pattern matching
       graph patterns with variables
       any subgraph which matches yields row of bindings
                  ont:districtAdministrative          rdfs:label
        ?school                                []                    “Cardiff”

   syntax based on Turtle syntax for RDF
   web API endpoints
   lots of power
       filters                    sub-queries           federated query
       optionals                  property chains       update
       named graphs               aggregation           construct
Accessing all this data
   link following
       HTTP GET, follow links, aggregate relevant statements
   query
       SPARQL
   linked data API
       RESTful API onto linked data resources
       simple query, usable without RDF stack, web dev friendly
       easy to layer visualizations and UIs on top
   third parties
       search engines and aggregators e.g. Sindice, sameAs.org
Semantic web layer cake
Data.gov.uk
visualizations on top of linked data
Data.gov.uk – linked datasets and APIs

Contenu connexe

Similaire à Using linked data for dataset publication

Learning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesLearning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesStefan Dietze
 
Putting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationPutting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationMathieu d'Aquin
 
Linked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech LondonLinked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech LondonDave Reynolds
 
Environmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz LondonEnvironmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz LondonAlex Coley
 
Introduction to linked data and the semantic web
Introduction to linked data and the semantic webIntroduction to linked data and the semantic web
Introduction to linked data and the semantic webDave Reynolds
 
Linked Data Hypercubes
Linked Data HypercubesLinked Data Hypercubes
Linked Data HypercubesDave Reynolds
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013Stefan Dietze
 
Linked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & RepositoriesLinked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & RepositoriesStefan Dietze
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
Linked data introduction w exempel
Linked data introduction w exempelLinked data introduction w exempel
Linked data introduction w exempelKerstin Forsberg
 
Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...Ray Schwartz
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Laurent Lefort
 
Information Intermediaries
Information IntermediariesInformation Intermediaries
Information IntermediariesDave Reynolds
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless OpportunityRachel Frick
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Alex Coley
 
Creating Visualizations with Linked Open Data
Creating Visualizations with Linked Open DataCreating Visualizations with Linked Open Data
Creating Visualizations with Linked Open DataAlvaro Graves
 
Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Mathieu d'Aquin
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedStefan Dietze
 

Similaire à Using linked data for dataset publication (20)

Learning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesLearning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
 
Putting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationPutting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in education
 
Linked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech LondonLinked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech London
 
Environmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz LondonEnvironmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz London
 
Introduction to linked data and the semantic web
Introduction to linked data and the semantic webIntroduction to linked data and the semantic web
Introduction to linked data and the semantic web
 
Linked Data Hypercubes
Linked Data HypercubesLinked Data Hypercubes
Linked Data Hypercubes
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
 
Linked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & RepositoriesLinked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & Repositories
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Icm sem tech_master
Icm sem tech_masterIcm sem tech_master
Icm sem tech_master
 
Linked data introduction w exempel
Linked data introduction w exempelLinked data introduction w exempel
Linked data introduction w exempel
 
Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
 
Information Intermediaries
Information IntermediariesInformation Intermediaries
Information Intermediaries
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
 
Creating Visualizations with Linked Open Data
Creating Visualizations with Linked Open DataCreating Visualizations with Linked Open Data
Creating Visualizations with Linked Open Data
 
Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Dernier (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Using linked data for dataset publication

  • 1. Semantic web and linked data for data set publication Dave Reynolds, Epimorphics Ltd @der42
  • 2. Outline  Background on linked data  Roles in data set publishing  Case study: Environment Agency  Lessons
  • 4. Linked data ... publishing data on the web ... ... to enable integration, linking and reuse across silos
  • 5. Linked data Apply the principles to the web to publication of data The linked data web:  is a global network of things  each identified by a URI  fetching a URI gives a set of statements in RDF  things connected by typed links  open, anyone can say anything about anything else Linked data is “data you can click on”
  • 6. Example schools information http://education.data.gov.uk/id/school/401874
  • 7. Example schools information http://education.data.gov.uk/id/school/401874 a School label phase district “Secondary” “Cardiff High School” “Cardiff”
  • 8. Example schools information http://education.data.gov.uk/id/school/401874 a school:School phase label district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT label “Cardiff”
  • 9. Example schools information http://education.data.gov.uk/id/school/401874 rdf:type school:School rdfs:label school:phase school:district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT rdfs:label “Cardiff”
  • 10. Example schools information http://education.data.gov.uk/id/school/401874 rdf:type school:School rdfs:label school:phase school:district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT label “Cardiff” http://data.ordnancesurvey.co.uk/id/7000000000025484 admingeo:ward spatial:extent admingeo:parish GML: 310499.4 184176.6 310476.5 ...
  • 11. Example schools information http://education.data.gov.uk/id/school/401874 rdf:type school:School rdfs:label school:phase school:district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT label “Cardiff” owl:sameAs http://data.ordnancesurvey.co.uk/id/7000000000025484 admingeo:ward spatial:extent admingeo:parish GML: 310499.4 184176.6 310476.5 ...
  • 12.
  • 13. Role in data set publication  well suited to describing things  schools, companies, animal species, music tracks, tv programmes ...  what about datasets?  environmental measurements, experimental results, statistical analyses ...
  • 14. Approach 1 : Data catalogues  treat the dataset as a single resource, identify with a URI  provide metadata as linked data  descriptive  categorical  technical and structural Benefits?  separate of metadata from resource & repository  easy aggregation of metadata into catalogues  schema-less enables use-specific annotations and links  use of sharable category schemes and reference data => support for discovery
  • 15. Approach 2 : Fine grain publication  publish the data set itself as linked data  entities, terms, individual records in data identified by URIs  data set structure and ontologies linked from data  still include dataset metadata Benefits?  all benefits of approach 1 to support discovery  self-describing  data slices addressable (trace back, provenance, annotation)  integration across sets - reuse of terms for dimensions, units, values  fine grained access => integration, comparison, context, data as a service
  • 16.
  • 17. bathing water quality what we do... start of season 15th May Press interest bathing season what information 20-22 samples in 22weeks is relevant to the public about beaches 30th Sept annual report what November we do December
  • 18. how linkable data helps Tenby Tourist Information Centre Unit 2 , The Gateway Complex Tenby. Wales , SA70 7LT Tel: 01834 842 402 Fax: 01834 845 439 Email: tenby.tic@pembrokeshire.gov.uk Photo by Skellig2008 (flickr)
  • 19. Publishing the Bathing Water Quality data set Bathing Sampling Zones Of Assessment Vocabularies Waters Points Influence s e.g. http://location.data.gov.uk/def/ef/SampingPoint URI Set Bathing Sampling Zone Of Reference Data Waters Points Influence e.g. http://location.data.gov.uk/so/ef/SamplingPoint/bwsp.eaew Assessme http://environment.data.gov.uk/data/bathing-water-quality Observation nt Datasets void:subset void:subset In-season Annual Weekly .../compliance Complianc .../in-season Assessme e nt
  • 20. Data cube vocabulary  collaborative development sponsored by data.gov.uk  simple, flexible vocabulary  mirrors core information models from:  SDMX (Statistical Data and Metadata eXchange)  DDI (Data Documentation Initiative)  extension to SCOVO vocabulary image: dullhunk @ flickr
  • 21. Data cube model A set of observations  indexed by dimensions  describing measures  interpreted according to attributes (e.g. region) dimension measure(s) attributes • population unit of measure = count = 32,567 status = preliminary ... dimension (e.g. time)
  • 22. Data cube vocabulary 1. Top level  DataSet qb:DataStructureDefinition qb:component  provenance and metadata qb:sliceKey  structure qb:structure qb:DataSet qb:SliceKey qb:slice qb:sliceStructure qb:dataset qb:Slice qb:subSlice qb:observation qb:Observation dimension values measure value(s) attribute values
  • 23. Data cube vocabulary 1. Top level  DataSet qb:DataStructureDefinition qb:component  provenance and metadata qb:sliceKey  structure qb:structure  Observation qb:DataSet qb:SliceKey  measured values, at dimensions qb:slice qb:sliceStructure qb:dataset with attributes qb:Slice  direct link to DataSet qb:subSlice qb:observation qb:Observation dimension values measure value(s) attribute values
  • 24. Data cube vocabulary 1. Top level  DataSet qb:DataStructureDefinition qb:component  provenance and metadata qb:sliceKey  structure qb:structure  Observation qb:DataSet qb:SliceKey  measured values, at dimensions qb:slice qb:sliceStructure qb:dataset with attributes qb:Slice  direct link to DataSet qb:subSlice  Slice qb:observation qb:Observation  optional grouping by fixing dimensions dimension values measure value(s) attribute values  guide to presentation  allows for abbreviated data
  • 25. Data cube vocabulary 2. Data Structure Definition  explicit definition of cube qb:DataSet structure, inline in the data qb:structure  enables qb:DataStructureDefinition  validation qb:component  visualization  discovery qb:ComponentSpecification  abbreviation qb:componentRequired qb:componentAttachment qb:order qb:dimension qb:measure qb:attribute
  • 26. Bathing Water Quality cubes  measures  total coliform count, entero virus count, ...  sample classification  dimensions  sampling point  sampling week  sampling year  attributes  abnormal weather
  • 27. Everything has a URI  Selected Lists and Individual Bathing Waters  Lists and Individual Assessments  In-Season or Annual Compliance  Vocabulary Terms  Datasets (and subsets)  Presented as:  HTML, (for people)  JSON, XML, RDF and CSV (for programs)
  • 28. Data Platform and Applications Web of Linked Data http://environment.data.gov.uk/lab/bwq-os.html
  • 29. Outcomes  bathing water quality information available  as both data set and set of web APIs  updated weekly (in season)  third party applications to use and combine the data  seed a web of environmental and location data  reference identifiers can be reused for related information  URI patterns designed to be compatible with INSPIRE
  • 30. Wrapping up image: erika g. @ flickr.com
  • 31. Lessons  importance of reference identifiers  developer accessibility  linked data API  publish once, consume many ways  importance of maintenance and QoS expectation  reusable patterns:  reusable vocabularies - Data Cube, org ...  URI patterns  provenance – OMPV and specializations  incremental approach
  • 32. Acknowledgements  Alex Coley (Environment Agency)  for slides 17, 18, and for sponsoring the bathing water quality data publication  Stuart Williams  developer of the bathing water application and slides 19,27,28  John Sheridan (The National Archive)  for sponsoring the development of data cube  Richard Cyganiak, Jeni Tennison  co-developers of the data cube vocabulary
  • 33. fin. fin. image: Christian Haugen @ flickr.com
  • 34. Spare
  • 35. Linked data principles  Use URIs as names for things  Use HTTP URIs so that people can look up those names  When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)  Include links to other URIs, so that they can discover more things Pattern of application of semantic web stack
  • 36. Linked open data cloud: 2007 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 37. Linked open data cloud: 2009 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 38. Linked open data cloud: 2010 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 39. Accessing all this data  link following  HTTP GET, follow links, aggregate relevant statements  query  SPARQL
  • 40. SPARQL  core idea is pattern matching  graph patterns with variables  any subgraph which matches yields row of bindings ont:districtAdministrative rdfs:label ?school [] “Cardiff”  syntax based on Turtle syntax for RDF  web API endpoints  lots of power  filters  sub-queries  federated query  optionals  property chains  update  named graphs  aggregation  construct
  • 41. Accessing all this data  link following  HTTP GET, follow links, aggregate relevant statements  query  SPARQL  linked data API  RESTful API onto linked data resources  simple query, usable without RDF stack, web dev friendly  easy to layer visualizations and UIs on top  third parties  search engines and aggregators e.g. Sindice, sameAs.org
  • 44. Data.gov.uk – linked datasets and APIs

Notes de l'éditeur

  1. Context about bathing water quality
  2. context