SlideShare une entreprise Scribd logo
1  sur  25
How RDFa works
How does RDFa work
   Session 2/4
An RDFa basics tutorial by Manu Sporny




http://www.youtube.com/watch?v=ldl0m-5zLz4&feature=player_embedded
Real metadata




Resource description framework
  ‣ RDF is a generic "way" of using definitive metadata with web resources.
  ‣ RDF describes "things" (entities, defined by uniform resource identifiers,
      URIs) by assigning properties and corresponding values—statements are
      known as "triples" consisting of [subject] [predicate] [object].
  ‣   The predicate URI usually references a term in a standard metadata
      vocabulary, resulting in unambiguous meaning.
  ‣   Any part of the triple can be a URI and URIs can point to other URIs that can
      be read using HTTP and extended (or related) in other web resources, thus a
      scalable model and very flexible.




             http://www.w3.org/TR/rdf-primer/   http://www.w3.org/RDF/
RDFa distiller




Extract RDF from HTML + RDFa
W3C   service to identify and list RDF from a web page
  ‣   using web address, local file or direct text inputs
  ‣   provides “clean” view of data hierarchy
  ‣   enables simple check on markup validation *and* intended meaning




                       http://www.w3.org/2007/08/pyRdfa/
Distilled RDFa page




<?xml version="1.0" encoding="utf-8"?>
<rdf:RDF
  xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
  xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
  xmlns:v="http://rdf.data-vocabulary.org/#"
  xmlns:xhv="http://www.w3.org/1999/xhtml/vocab#"
  xmlns:xml="http://www.w3.org/XML/1998/namespace"
>
  <v:Person>
    <v:nickname>mungeo</v:nickname>
    <v:address>
      <v:Address>
        <v:region>Ulster</v:region>
        <v:locality>Donegal</v:locality>
      </v:Address>
    </v:address>
    <v:url rdf:resource="http://www.platypusconsultancy.com"/>
    <v:name>George Munroe</v:name>
    <v:title>consultant trainer</v:title>
    <v:affiliation>Netskills</v:affiliation>
  </v:Person>
</rdf:RDF>




                                     RDFa
Validating the RDF extracted from an RDFa
file and exploring the data structure
  ‣ obtain a tabular description of entity classes and corresponding properties
  ‣ list inherent RDF triples
  ‣ visualise relationships with simple graphical representations




                        http://www.w3.org/RDF/Validator/
View RDF triples within RDF file




              RDFa
View graph of entities and properties




                RDFa
Practical exercise




Extracting RDF from RDFa
  ‣   open the W3C RDFa distiller service with your web browser
  ‣   use the “Distill by file upload” option, select your sample RDFa file and “Go!”
  ‣   view the page source of the output file and note the RDF tagging
  ‣   copy all of the page source of the output file (RDF)
  ‣   open the W3C RDF validation service with your web browser
  ‣   use the “Check by direct input” option and paste in your RDF source
  ‣   select “Triples and graph” from the “Display result options” section
  ‣   click the “Parse RDF” button and explore the RDF triples and graph produced
      by selecting “Grid view” and “SVG graph”
  ‣   if time permits repeat the above process with the W3C RDF primer sample
      file (http://www.w3.org/2006/07/SWD/RDFa/primer/alice-example.html)
      only use the validator directly by selecting the “Check by URI” option and
      supplying the distiller service address with the sample RDFa file in a query
      string as follows:
      http://www.w3.org/2007/08/pyRdfa/extract?uri=http://www.w3.org/2006/07/SWD/
      RDFa/primer/alice-example.html
Metadata vocabularies




The importance of shared metadata
Everyone (and every computer) must have a common understanding
of what particular entities and properties actually are
    ‣ concept of XML namespaces, used to make descriptions of entity classes and
        corresponding properties available via the web
    ‣   these descriptions supplied as RDF (or RDFa) files with a URL (URI)
And there’s more to it than just a flat list of entities and properties
   ‣ a real understanding involves being aware of the relationships between
        entity classes as well as what properties are associated with an entity
    ‣   possibly a very complex “ontology” but built very simply from triples where
        the object of one triple may be the subject of another




                                         RDFa
Namespaces




These namespaces are used to describe
entities in X number of documents
Namespaces (819 known namespaces)                   Number of documents

http://xmlns.com/foaf/0.1/                          1,261,416
http://purl.org/goodrelations/v1#                   645,210
http://blogs.yandex.ru/schema/foaf/                 584,781
http://www.w3.org/2002/07/owl#                      451,113
http://sites.wiwiss.fu-berlin.de/suhl/bi...         252,361
http://rdfs.org/sioc/ns#                            183,228
http://www.w3.org/2003/01/geo/wgs84_pos#            157,659
http://rdfs.org/sioc/types#                         117,621
http://semantic-mediawiki.org/swivt/1.0#            64,281
http://xmlns.com/foaf/spec/                         50,514




                              http://pingthesemanticweb.com/
Entities




The number of each entity type known by
pingthesemanticweb.com
Entity types (10,500 known entity types)              Count

http://xmlns.com/foaf/0.1/Person                      19,015,352
http://xmlns.com/foaf/0.1/Document                    5,115,311
http://purl.org/goodrelations/v1#Product...           1,418,899
http://purl.org/goodrelations/v1#UnitPri...           999,893
http://purl.org/goodrelations/v1#Busines...           753,812
http://purl.org/goodrelations/v1#Offerin...           644,652
http://purl.org/goodrelations/v1#TypeAnd...           626,684
http://blogs.yandex.ru/schema/foaf/Posts              585,188
http://www.w3.org/2002/07/owl#Ontology                446,664
http://purl.org/goodrelations/v1#Product...           374,456




                             http://pingthesemanticweb.com/
Exploring vocabularies




Commonly used metadata vocabularies
Google (person, organisation, review, event, recipe)
   ‣ http://rdf.data-vocabulary.org/
FOAF (Friend Of A Friend)
   ‣ http://xmlns.com/foaf/0.1/
GoodRelations (ecommerce)
   ‣ http://www.heppnetz.de/ontologies/goodrelations/v1.owl
Dublin Core (generic document)
   ‣ http://dublincore.org/2008/01/14/dcelements.rdf
Creative Commons (licensing)
   ‣ http://creativecommons.org/ns
SKOS (Simple Knowledge Organisation System)
   ‣ http://www.w3.org/2009/08/skos-reference/skos.rdf




                                     RDFa
Viewing complex vocabularies—ontologies
Use the OpenLink Data Explorer extension in Firefox




                        http://ode.openlinksw.com/
Exploring vocabularies




Digging into metadata vocabularies
Use the Firefox browser with OpenLink Data Explorer extension and/
or W3C RDF validator and grapher online service to view the following
vocabularies:
    ‣ Dublin Core at: http://dublincore.org/2008/01/14/dcelements.rdf
    ‣ GoodRelations at: http://www.heppnetz.de/ontologies/goodrelations/v1.owl
Use the pyRdfa distiller service to extract and view the RDF
vocabulary information from the following descriptions:
   ‣ FOAF at: http://xmlns.com/foaf/0.1/
   ‣ Creative Commons at: http://creativecommons.org/ns
RDFa tools




Summary of useful RDFa related tools
RDFa distiller (extract pure RDF from HTML + RDFa)
   ‣ http://www.w3.org/2007/08/pyRdfa/
   ‣ get RDF directly from http://example.com/sample.html using single address
      http://www.w3.org/2007/08/pyRdfa/extract?uri=http://example.com/sample.html

RDF validator and grapher
   ‣ http://www.w3.org/RDF/Validator/
OpenLink Data Explorer extension for Firefox
   ‣ https://addons.mozilla.org/en-US/firefox/addon/8062
List global namespaces and entities
    ‣ http://pingthesemanticweb.com/




                                       RDFa
How RDFa works
How RDFa works

Contenu connexe

Tendances

Linked data: spreading data over the web
Linked data: spreading data over the webLinked data: spreading data over the web
Linked data: spreading data over the web
shellac
 
Creating web applications with LODSPeaKr
Creating web applications with LODSPeaKrCreating web applications with LODSPeaKr
Creating web applications with LODSPeaKr
Alvaro Graves
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
trevorthornton
 

Tendances (20)

GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelab
 
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
 
쉽게 이해하는 LOD
쉽게 이해하는 LOD쉽게 이해하는 LOD
쉽게 이해하는 LOD
 
RDF Views of SQL Data Power Point Presentation - 1
RDF Views of SQL Data Power Point Presentation - 1RDF Views of SQL Data Power Point Presentation - 1
RDF Views of SQL Data Power Point Presentation - 1
 
Virtuoso Sponger - RDFizer Middleware for creating RDF from non RDF Data Sources
Virtuoso Sponger - RDFizer Middleware for creating RDF from non RDF Data SourcesVirtuoso Sponger - RDFizer Middleware for creating RDF from non RDF Data Sources
Virtuoso Sponger - RDFizer Middleware for creating RDF from non RDF Data Sources
 
Introduction to RDFa
Introduction to RDFaIntroduction to RDFa
Introduction to RDFa
 
Linked data: spreading data over the web
Linked data: spreading data over the webLinked data: spreading data over the web
Linked data: spreading data over the web
 
FOAF
FOAFFOAF
FOAF
 
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
 
Saveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF dataSaveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF data
 
Introducing JSONpedia
Introducing JSONpediaIntroducing JSONpedia
Introducing JSONpedia
 
RDF Transformations
RDF TransformationsRDF Transformations
RDF Transformations
 
Creating web applications with LODSPeaKr
Creating web applications with LODSPeaKrCreating web applications with LODSPeaKr
Creating web applications with LODSPeaKr
 
4 sw architectures and sparql
4 sw architectures and sparql4 sw architectures and sparql
4 sw architectures and sparql
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
 
Programming with LOD
Programming with LODProgramming with LOD
Programming with LOD
 
DHWI Linked Open Data - Show and Tell
DHWI Linked Open Data - Show and TellDHWI Linked Open Data - Show and Tell
DHWI Linked Open Data - Show and Tell
 
XSPARQL Tutorial
XSPARQL TutorialXSPARQL Tutorial
XSPARQL Tutorial
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
 
Exploring and using the Semantic Web - SSSW09 tutorial
Exploring and using the Semantic Web - SSSW09 tutorialExploring and using the Semantic Web - SSSW09 tutorial
Exploring and using the Semantic Web - SSSW09 tutorial
 

En vedette

Digital storytelling for public engagement
Digital storytelling for public engagementDigital storytelling for public engagement
Digital storytelling for public engagement
JISC Netskills
 

En vedette (9)

A Tale of Two Cities (Neil Currant)
A Tale of Two Cities (Neil Currant)A Tale of Two Cities (Neil Currant)
A Tale of Two Cities (Neil Currant)
 
Introducing digital storytelling
Introducing digital storytellingIntroducing digital storytelling
Introducing digital storytelling
 
Web tools for administrators
Web tools for administratorsWeb tools for administrators
Web tools for administrators
 
Lisa Gray (JISC) ePortfolios - October 2012
Lisa Gray (JISC) ePortfolios - October 2012Lisa Gray (JISC) ePortfolios - October 2012
Lisa Gray (JISC) ePortfolios - October 2012
 
A quick guide to taking awesome photos
A quick guide to taking awesome photosA quick guide to taking awesome photos
A quick guide to taking awesome photos
 
Digital storytelling for public engagement
Digital storytelling for public engagementDigital storytelling for public engagement
Digital storytelling for public engagement
 
Digital maps & data mash ups
Digital maps & data mash upsDigital maps & data mash ups
Digital maps & data mash ups
 
Digital Storytelling (Changing the Learner Landscape event)
Digital Storytelling (Changing the Learner Landscape event)Digital Storytelling (Changing the Learner Landscape event)
Digital Storytelling (Changing the Learner Landscape event)
 
Engage 2014 going dragon hunting
Engage 2014   going dragon huntingEngage 2014   going dragon hunting
Engage 2014 going dragon hunting
 

Similaire à How RDFa works

SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
Rinke Hoekstra
 
LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...
LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...
LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...
Ross Singer
 
Linked data and rdf
Linked  data and rdfLinked  data and rdf
Linked data and rdf
Daniel Nüst
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
webhostingguy
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
Juan Sequeda
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
Juan Sequeda
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
trevorthornton
 

Similaire à How RDFa works (20)

SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
RDFauthor (EKAW)
RDFauthor (EKAW)RDFauthor (EKAW)
RDFauthor (EKAW)
 
LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...
LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...
LITA 2010: The Linked Library Data Cloud: it's time to stop think and start l...
 
Data in RDF
Data in RDFData in RDF
Data in RDF
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis Platform
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
 
SUMMER SCHOOL LEX 2014 - RDF + SPARQL querying the web of (lex)data
SUMMER SCHOOL LEX 2014 - RDF + SPARQL querying the web of (lex)dataSUMMER SCHOOL LEX 2014 - RDF + SPARQL querying the web of (lex)data
SUMMER SCHOOL LEX 2014 - RDF + SPARQL querying the web of (lex)data
 
Hacia la Internet del Futuro: Web Semántica y Open Linked Data, Parte 2
Hacia la Internet del Futuro: Web Semántica y Open Linked Data, Parte 2Hacia la Internet del Futuro: Web Semántica y Open Linked Data, Parte 2
Hacia la Internet del Futuro: Web Semántica y Open Linked Data, Parte 2
 
Biodiversity Informatics on the Semantic Web
Biodiversity Informatics on the Semantic WebBiodiversity Informatics on the Semantic Web
Biodiversity Informatics on the Semantic Web
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
 
Linked data and rdf
Linked  data and rdfLinked  data and rdf
Linked data and rdf
 
Sparql a simple knowledge query
Sparql  a simple knowledge querySparql  a simple knowledge query
Sparql a simple knowledge query
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Quick Introduction to the Semantic Web, RDFa & Microformats
Quick Introduction to the Semantic Web, RDFa & MicroformatsQuick Introduction to the Semantic Web, RDFa & Microformats
Quick Introduction to the Semantic Web, RDFa & Microformats
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
 
Semantic web and Drupal: an introduction
Semantic web and Drupal: an introductionSemantic web and Drupal: an introduction
Semantic web and Drupal: an introduction
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
 

Plus de JISC Netskills

QR codes - the virtual poking into the real
QR codes - the virtual poking into the realQR codes - the virtual poking into the real
QR codes - the virtual poking into the real
JISC Netskills
 
Green ICT in London HEIs
Green ICT in London HEIsGreen ICT in London HEIs
Green ICT in London HEIs
JISC Netskills
 

Plus de JISC Netskills (20)

Embedding Mahara at Birmingham City University
Embedding Mahara at Birmingham City UniversityEmbedding Mahara at Birmingham City University
Embedding Mahara at Birmingham City University
 
Digital storytelling for the flipped classroom
Digital storytelling for the flipped classroomDigital storytelling for the flipped classroom
Digital storytelling for the flipped classroom
 
Using free web based tools for collaborative e learning
Using free web based tools for collaborative e learningUsing free web based tools for collaborative e learning
Using free web based tools for collaborative e learning
 
QR codes - the virtual poking into the real
QR codes - the virtual poking into the realQR codes - the virtual poking into the real
QR codes - the virtual poking into the real
 
Greenevents2
Greenevents2Greenevents2
Greenevents2
 
Greeningthedatacentre
GreeningthedatacentreGreeningthedatacentre
Greeningthedatacentre
 
Suste tech
Suste techSuste tech
Suste tech
 
Proco2
Proco2Proco2
Proco2
 
Kit catalogue
Kit catalogueKit catalogue
Kit catalogue
 
Virtually sustainable
Virtually sustainableVirtually sustainable
Virtually sustainable
 
Ucare
UcareUcare
Ucare
 
Stor c
Stor cStor c
Stor c
 
Recso
RecsoRecso
Recso
 
Paws
PawsPaws
Paws
 
Open to-change
Open to-changeOpen to-change
Open to-change
 
Ibuilding
IbuildingIbuilding
Ibuilding
 
Heatandlight
HeatandlightHeatandlight
Heatandlight
 
Green in-silico
Green in-silicoGreen in-silico
Green in-silico
 
Green ICT in London HEIs
Green ICT in London HEIsGreen ICT in London HEIs
Green ICT in London HEIs
 
ER4SeR
ER4SeRER4SeR
ER4SeR
 

Dernier

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

How RDFa works

  • 2. How does RDFa work Session 2/4
  • 3. An RDFa basics tutorial by Manu Sporny http://www.youtube.com/watch?v=ldl0m-5zLz4&feature=player_embedded
  • 4. Real metadata Resource description framework ‣ RDF is a generic "way" of using definitive metadata with web resources. ‣ RDF describes "things" (entities, defined by uniform resource identifiers, URIs) by assigning properties and corresponding values—statements are known as "triples" consisting of [subject] [predicate] [object]. ‣ The predicate URI usually references a term in a standard metadata vocabulary, resulting in unambiguous meaning. ‣ Any part of the triple can be a URI and URIs can point to other URIs that can be read using HTTP and extended (or related) in other web resources, thus a scalable model and very flexible. http://www.w3.org/TR/rdf-primer/ http://www.w3.org/RDF/
  • 5. RDFa distiller Extract RDF from HTML + RDFa W3C service to identify and list RDF from a web page ‣ using web address, local file or direct text inputs ‣ provides “clean” view of data hierarchy ‣ enables simple check on markup validation *and* intended meaning http://www.w3.org/2007/08/pyRdfa/
  • 6. Distilled RDFa page <?xml version="1.0" encoding="utf-8"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:v="http://rdf.data-vocabulary.org/#" xmlns:xhv="http://www.w3.org/1999/xhtml/vocab#" xmlns:xml="http://www.w3.org/XML/1998/namespace" > <v:Person> <v:nickname>mungeo</v:nickname> <v:address> <v:Address> <v:region>Ulster</v:region> <v:locality>Donegal</v:locality> </v:Address> </v:address> <v:url rdf:resource="http://www.platypusconsultancy.com"/> <v:name>George Munroe</v:name> <v:title>consultant trainer</v:title> <v:affiliation>Netskills</v:affiliation> </v:Person> </rdf:RDF> RDFa
  • 7. Validating the RDF extracted from an RDFa file and exploring the data structure ‣ obtain a tabular description of entity classes and corresponding properties ‣ list inherent RDF triples ‣ visualise relationships with simple graphical representations http://www.w3.org/RDF/Validator/
  • 8. View RDF triples within RDF file RDFa
  • 9. View graph of entities and properties RDFa
  • 10. Practical exercise Extracting RDF from RDFa ‣ open the W3C RDFa distiller service with your web browser ‣ use the “Distill by file upload” option, select your sample RDFa file and “Go!” ‣ view the page source of the output file and note the RDF tagging ‣ copy all of the page source of the output file (RDF) ‣ open the W3C RDF validation service with your web browser ‣ use the “Check by direct input” option and paste in your RDF source ‣ select “Triples and graph” from the “Display result options” section ‣ click the “Parse RDF” button and explore the RDF triples and graph produced by selecting “Grid view” and “SVG graph” ‣ if time permits repeat the above process with the W3C RDF primer sample file (http://www.w3.org/2006/07/SWD/RDFa/primer/alice-example.html) only use the validator directly by selecting the “Check by URI” option and supplying the distiller service address with the sample RDFa file in a query string as follows: http://www.w3.org/2007/08/pyRdfa/extract?uri=http://www.w3.org/2006/07/SWD/ RDFa/primer/alice-example.html
  • 11. Metadata vocabularies The importance of shared metadata Everyone (and every computer) must have a common understanding of what particular entities and properties actually are ‣ concept of XML namespaces, used to make descriptions of entity classes and corresponding properties available via the web ‣ these descriptions supplied as RDF (or RDFa) files with a URL (URI) And there’s more to it than just a flat list of entities and properties ‣ a real understanding involves being aware of the relationships between entity classes as well as what properties are associated with an entity ‣ possibly a very complex “ontology” but built very simply from triples where the object of one triple may be the subject of another RDFa
  • 12. Namespaces These namespaces are used to describe entities in X number of documents Namespaces (819 known namespaces) Number of documents http://xmlns.com/foaf/0.1/ 1,261,416 http://purl.org/goodrelations/v1# 645,210 http://blogs.yandex.ru/schema/foaf/ 584,781 http://www.w3.org/2002/07/owl# 451,113 http://sites.wiwiss.fu-berlin.de/suhl/bi... 252,361 http://rdfs.org/sioc/ns# 183,228 http://www.w3.org/2003/01/geo/wgs84_pos# 157,659 http://rdfs.org/sioc/types# 117,621 http://semantic-mediawiki.org/swivt/1.0# 64,281 http://xmlns.com/foaf/spec/ 50,514 http://pingthesemanticweb.com/
  • 13. Entities The number of each entity type known by pingthesemanticweb.com Entity types (10,500 known entity types) Count http://xmlns.com/foaf/0.1/Person 19,015,352 http://xmlns.com/foaf/0.1/Document 5,115,311 http://purl.org/goodrelations/v1#Product... 1,418,899 http://purl.org/goodrelations/v1#UnitPri... 999,893 http://purl.org/goodrelations/v1#Busines... 753,812 http://purl.org/goodrelations/v1#Offerin... 644,652 http://purl.org/goodrelations/v1#TypeAnd... 626,684 http://blogs.yandex.ru/schema/foaf/Posts 585,188 http://www.w3.org/2002/07/owl#Ontology 446,664 http://purl.org/goodrelations/v1#Product... 374,456 http://pingthesemanticweb.com/
  • 14. Exploring vocabularies Commonly used metadata vocabularies Google (person, organisation, review, event, recipe) ‣ http://rdf.data-vocabulary.org/ FOAF (Friend Of A Friend) ‣ http://xmlns.com/foaf/0.1/ GoodRelations (ecommerce) ‣ http://www.heppnetz.de/ontologies/goodrelations/v1.owl Dublin Core (generic document) ‣ http://dublincore.org/2008/01/14/dcelements.rdf Creative Commons (licensing) ‣ http://creativecommons.org/ns SKOS (Simple Knowledge Organisation System) ‣ http://www.w3.org/2009/08/skos-reference/skos.rdf RDFa
  • 15. Viewing complex vocabularies—ontologies Use the OpenLink Data Explorer extension in Firefox http://ode.openlinksw.com/
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Exploring vocabularies Digging into metadata vocabularies Use the Firefox browser with OpenLink Data Explorer extension and/ or W3C RDF validator and grapher online service to view the following vocabularies: ‣ Dublin Core at: http://dublincore.org/2008/01/14/dcelements.rdf ‣ GoodRelations at: http://www.heppnetz.de/ontologies/goodrelations/v1.owl Use the pyRdfa distiller service to extract and view the RDF vocabulary information from the following descriptions: ‣ FOAF at: http://xmlns.com/foaf/0.1/ ‣ Creative Commons at: http://creativecommons.org/ns
  • 23. RDFa tools Summary of useful RDFa related tools RDFa distiller (extract pure RDF from HTML + RDFa) ‣ http://www.w3.org/2007/08/pyRdfa/ ‣ get RDF directly from http://example.com/sample.html using single address http://www.w3.org/2007/08/pyRdfa/extract?uri=http://example.com/sample.html RDF validator and grapher ‣ http://www.w3.org/RDF/Validator/ OpenLink Data Explorer extension for Firefox ‣ https://addons.mozilla.org/en-US/firefox/addon/8062 List global namespaces and entities ‣ http://pingthesemanticweb.com/ RDFa

Notes de l'éditeur