SlideShare a Scribd company logo
1 of 41
Download to read offline
vision

txt2rdf

grounding

Can Documents be Linked Data?
Kate Byrne, School of Informatics, University of Edinburgh
CIGS LOD Workshop

18th November 2013

1
vision

txt2rdf

1

The semantic web vision

2

Extracting structured knowledge from free text

3

grounding

Respect for authority, or, Why we need ontologies

2
vision

txt2rdf

grounding

The semantic web vision

W3C RDF Concepts, 2002 draft
“RDF ... allows anyone to say anything about anything.”

Tim Berners-Lee, 2006
“The day-to-day mechanisms of trade, bureaucracy and our daily
lives will be handled by machines talking to machine, leaving
humans to provide the inspiration and intuition.”

Tim Berners-Lee, 2009
“The web as I envisaged it, we have not seen it yet.”

3
vision

txt2rdf

grounding

The semantic web vision

W3C RDF Concepts, 2002 draft
“RDF ... allows anyone to say anything about anything.”

Tim Berners-Lee, 2006
“The day-to-day mechanisms of trade, bureaucracy and our daily
lives will be handled by machines talking to machine, leaving
humans to provide the inspiration and intuition.”

Tim Berners-Lee, 2009
“The web as I envisaged it, we have not seen it yet.”

3
vision

txt2rdf

grounding

The semantic web vision

W3C RDF Concepts, 2002 draft
“RDF ... allows anyone to say anything about anything.”

Tim Berners-Lee, 2006
“The day-to-day mechanisms of trade, bureaucracy and our daily
lives will be handled by machines talking to machine, leaving
humans to provide the inspiration and intuition.”

Tim Berners-Lee, 2009
“The web as I envisaged it, we have not seen it yet.”

3
vision

txt2rdf

grounding

The semantic web vision

W3C RDF Concepts, 2002 draft
“RDF ... allows anyone to say anything about anything.”

Tim Berners-Lee, 2006
“The day-to-day mechanisms of trade, bureaucracy and our daily
lives will be handled by machines talking to machine, leaving
humans to provide the inspiration and intuition.”

Tim Berners-Lee, 2009
“The web as I envisaged it, we have not seen it yet.”

3
vision

txt2rdf

grounding

Simple declarative sentences
“In a hole in the ground there lived a hobbit. Not a nasty, dirty,
wet hole, filled with the ends of worms and an oozy smell, nor yet
a dry, bare, sandy hole with nothing in it to sit down on or to eat:
it was a hobbit-hole, and that means comfort.”

5
vision

txt2rdf

grounding

Simple declarative sentences
“In a hole in the ground there lived a hobbit. Not a nasty, dirty,
wet hole, filled with the ends of worms and an oozy smell, nor yet
a dry, bare, sandy hole with nothing in it to sit down on or to eat:
it was a hobbit-hole, and that means comfort.”
hobbit

lives in
hole

located in

the ground

5
vision

txt2rdf

grounding

Simple declarative sentences
“In a hole in the ground there lived a hobbit. Not a nasty, dirty,
wet hole, filled with the ends of worms and an oozy smell, nor yet
a dry, bare, sandy hole with nothing in it to sit down on or to eat:
it was a hobbit-hole, and that means comfort.”
hobbit

lives in
hole

located in

the ground

does not have

nastiness

has type
hobbit hole

has characteristic
comfort

5
vision

txt2rdf

grounding

A lot of information is in textual form!

6
vision

txt2rdf

grounding

A lot of information is in textual form!

6
vision

txt2rdf

grounding

A lot of information is in textual form!

6
vision

txt2rdf

grounding

A lot of information is in textual form!

6
vision

txt2rdf

grounding

A lot of information is in textual form!

6
vision

txt2rdf

grounding

A lot of information is in textual form!

6
vision

txt2rdf

grounding

Nouns and verbs
subject

object
predicate

7
vision

txt2rdf

grounding

Nouns and verbs
subject

object
predicate

hobbit

lives in
hole

located in

the ground

does not have

nastiness

has type
hobbit hole

has characteristic
comfort

7
vision

txt2rdf

grounding

Nouns and verbs
subject

object
predicate
nouns

hobbit

lives in
hole

located in

the ground

does not have

nastiness

has type
hobbit hole

has characteristic
comfort

7
vision

txt2rdf

grounding

Nouns and verbs
subject

object
predicate
nouns

hobbit

lives in
hole

located in

the ground

does not have

verbs

nastiness

has type
hobbit hole

has characteristic
comfort

7
vision

txt2rdf

1

The semantic web vision

2

Extracting structured knowledge from free text

3

grounding

Respect for authority, or, Why we need ontologies
8
vision

txt2rdf

grounding

Extracting structured knowledge from free text

fancy NLP processing
and RDFisation

8
vision

txt2rdf

grounding

Natural Language Processing pipeline

Text documents

sfsjksjwjvssjkljljs sd’lajoen s

Pre−processing

tokenise

jjs kjdlk lksjlkj sks oihhg sk

jjlkjlj jljbjl skj ekw

generate
triples

Graph
of triples

sentence
and para
split

remove
unwanted
relations

RDF
translation

Named Entity Recognition

POS tag

multi−word
tokens and
features

trained NER
model

list of NEs
and
classes

attach
siteids

trained RE
model

set of NE
pairs and
features
list of
relations
and classes

Relation Extraction

9
vision

txt2rdf

grounding

Named entities and relations

site 20
Evidence of a quartz knapping site was found within the confines of the stone
circle, and in conjunction with several structures within the inner ring,
strongly suggests a domestic site.
Besides the quartz implements and corresponding waste, several other artifacts of local
origin occurred including a split pebble axe of greenstone with Shetland Early
Bronze Age affinities. B Beveridge, 1972.
Field survey and excavation, as a response to continual wind and marine
erosion, was carried out at the Sands of Breckon between
1982 and 1983.
HP50NW 11.00 was recorded as a stone settings surrounded by
occupational debris (Site 22). Excavation revealed midden deposits of an
early Iron Age date and a surface scatter of artefacts of mixed dates. The
stone settings were tentatively interpreted as the basal stones of long
cists.
Historic Scotland Archive Project (SW) 2002.

10
vision

txt2rdf

grounding

Named entities and relations

site 20

10
vision

txt2rdf

grounding

Converting text relations to RDF – 1
site 20

site20 − hasEvent − excavationX
excavationX − hasLocation − SandsOfBreckon
excavationX − hasDate − 1982

11
vision

txt2rdf

grounding

Converting text relations to RDF – 2
event:excavation
site20 − hasEvent − excavationX
excavationX − hasLocation − SandsOfBreckon
excavationX − hasDate − 1982

rdf:type

date:1982
sitetype:stone+settings20w179
:hasPeriod
:hasEvent
event:excavation20w158

:hasClassn

siteid:site20

:hasLocation
:hasLocation
sitename:sands+of+breckon
:hasLocation

address:hp50nw+11.00

address:breckon
12
vision

txt2rdf

1

The semantic web vision

2

Extracting structured knowledge from free text

3

grounding

Respect for authority, or, Why we need ontologies
13
vision

txt2rdf

grounding

Let’s remind ourselves what’s the point of Linked Data

13
vision

txt2rdf

grounding

Let’s remind ourselves what’s the point of Linked Data

archaeological site archive
museum database
siteid:
sitename:

47919
Cairnpapple

find spot: Cairnpapple

classification:

Cairn, henge

This stone flake from the cutting edge of a
ground stone axehead was found at Cairnpapple
in West Lothian. The stone is from...

site number:

NS97SE 16

objectid:

X.EP 167

A complex site on the summit of Cairnpapple Hill
excavated by Piggot in 1947...

:Objectid#x.ep+167
Classn/Sitetype#cairn%20+henge
:hasClassn

:hasFindSpot

:hasClassn
:hasId

:Siteid#site47919

:hasLocation

:Classn/Objtype#axe+flake

Id#ns97se+16

:hasEvent

:Loc/Sitename#cairnpapple

:Event#excavated47919w10
:hasLocation
:hasLocation

:Loc/Place#west+lothian

:hasAgent

:Agent/Person#piggot

:hasPeriod

:Time/Date#1947

:Loc/Place#cairnpapple+hill

13
vision

txt2rdf

grounding

But linking Linked Data is actually pretty hard
archaeological site archive
museum database
siteid:
sitename:

47919
Cairnpapple

find spot: Cairnpapple

classification:

Cairn, henge

This stone flake from the cutting edge of a
ground stone axehead was found at Cairnpapple
in West Lothian. The stone is from...

site number:

NS97SE 16

objectid:

X.EP 167

A complex site on the summit of Cairnpapple Hill
excavated by Piggot in 1947...

:Objectid#x.ep+167
Classn/Sitetype#cairn%20+henge
:hasClassn

:hasFindSpot

:hasClassn
:hasId

:Siteid#site47919

:hasLocation

:Classn/Objtype#axe+flake

Id#ns97se+16

:hasEvent

:Loc/Sitename#cairnpapple

:Event#excavated47919w10
:hasLocation
:hasLocation

:Loc/Place#west+lothian

:hasAgent

:Agent/Person#piggot

:hasPeriod

:Time/Date#1947

:Loc/Place#cairnpapple+hill

Direct link means spotting identical node in separate graph
How? String matching? Clues from context?
14
vision

txt2rdf

grounding

Using LOD cloud “Authority Nodes” as intermediaries

15
vision

txt2rdf

grounding

Using LOD cloud “Authority Nodes” as intermediaries

15
vision

txt2rdf

grounding

Using LOD cloud “Authority Nodes” as intermediaries

grounding local URIs
against "authority" nodes
is the
next big challenge!

15
vision

txt2rdf

grounding

Grounding site20 against Monument Thesaurus
sitetype:religious+ritual+and+funerary
skos:broader
sitetype:standing+stone

"An arrangement of two
or more standing stones"

sitetype:stone+circle
skos:scopeNote

event:excavation

"stone setting"
rdf:type

sitetype:stone+row
skos:related

rdfs:label
sitetype:stone+setting

rdfs:subClassOf

rdf:type

sitetype:

date:1982
sitetype:stone+settings20w179

:hasPeriod

:hasClassn

:hasEvent
event:excavation20w158
siteid:site20
:hasLocation
:hasLocation

sitename:sands+of+breckon
:hasLocation

address:hp50nw+11.01+hp+5304+0519

address:breckon

16
vision

txt2rdf

grounding

Grounding site20 against Monument Thesaurus
sitetype:religious+ritual+and+funerary
skos:broader
sitetype:standing+stone

"An arrangement of two
or more standing stones"

sitetype:stone+circle
skos:scopeNote

event:excavation

"stone setting"

sitetype:stone+row
skos:related

rdf:type

rdfs:label
sitetype:stone+setting

rdfs:subClassOf

rdf:type

sitetype:

date:1982
sitetype:stone+settings20w179

:hasPeriod

:hasClassn

:hasEvent
event:excavation20w158
siteid:site20
:hasLocation
:hasLocation

sitename:sands+of+breckon
:hasLocation

address:hp50nw+11.01+hp+5304+0519

address:breckon

16
vision

txt2rdf

grounding

Grounding against various authorities/ontologies

Placename authorities: Geonames, OS gazetteer, Pleiades
Period: EH draft ontology
Monument classifications: Seneschal project
Bibliographic: LCSH, FRBR
...hundreds of LOD datasets in the cloud

Informatics projects
Edina “Unlock” service – spatial and temporal grounding
GAP projects – grounding against maps of the ancient world

17
vision

txt2rdf

grounding

Grounding against various authorities/ontologies

Placename authorities: Geonames, OS gazetteer, Pleiades
Period: EH draft ontology
Monument classifications: Seneschal project
Bibliographic: LCSH, FRBR
...hundreds of LOD datasets in the cloud

Informatics projects
Edina “Unlock” service – spatial and temporal grounding
GAP projects – grounding against maps of the ancient world

17
vision

txt2rdf

grounding

Unlock Text – find placenames and plot on map

http://unlock.edina.ac.uk/
18
vision

txt2rdf

grounding

GapVis interface

http://nrabinowitz.github.com/gapvis/

19
vision

txt2rdf

grounding

Questions?

20

More Related Content

Viewers also liked

By any other name : personal name authority metadata across Edinburgh Univer...
By any other name : personal name authority metadata across Edinburgh Univer...By any other name : personal name authority metadata across Edinburgh Univer...
By any other name : personal name authority metadata across Edinburgh Univer...CIGScotland
 
Linked data experiments at the National Library of Scotland / Alexandra De Pr...
Linked data experiments at the National Library of Scotland / Alexandra De Pr...Linked data experiments at the National Library of Scotland / Alexandra De Pr...
Linked data experiments at the National Library of Scotland / Alexandra De Pr...CIGScotland
 
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...CIGScotland
 
Cwe advertisement rr_bs_phase_ii
Cwe advertisement rr_bs_phase_iiCwe advertisement rr_bs_phase_ii
Cwe advertisement rr_bs_phase_iiSree Nivas
 
Auszeichnung für gute Lehre vom VDMA
Auszeichnung für gute Lehre vom VDMAAuszeichnung für gute Lehre vom VDMA
Auszeichnung für gute Lehre vom VDMABenita Rowe
 
Metadata is catnip to digital scholars / Jennifer schaffner
Metadata is catnip to digital scholars / Jennifer schaffnerMetadata is catnip to digital scholars / Jennifer schaffner
Metadata is catnip to digital scholars / Jennifer schaffnerCIGScotland
 
ICT4D Seminar Uni Köln 20.02.16
ICT4D Seminar Uni Köln 20.02.16ICT4D Seminar Uni Köln 20.02.16
ICT4D Seminar Uni Köln 20.02.16Benita Rowe
 
Innovation Hydration - Collaborative Consumption - Rob Fanshawe
Innovation Hydration - Collaborative Consumption - Rob FanshaweInnovation Hydration - Collaborative Consumption - Rob Fanshawe
Innovation Hydration - Collaborative Consumption - Rob Fanshawe33 Talent
 
Step children of printing : toward an integrated standard for the description...
Step children of printing : toward an integrated standard for the description...Step children of printing : toward an integrated standard for the description...
Step children of printing : toward an integrated standard for the description...CIGScotland
 
Tugas call juliana resti
Tugas call juliana restiTugas call juliana resti
Tugas call juliana restiRandyLoveResty
 
Hoseki, the cleaning solution for your jewels
Hoseki, the cleaning solution for your jewelsHoseki, the cleaning solution for your jewels
Hoseki, the cleaning solution for your jewelsKevin Labbe
 
шауенова сауле аэф презентация выступление
шауенова сауле аэф презентация выступлениешауенова сауле аэф презентация выступление
шауенова сауле аэф презентация выступлениеADJK
 
Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...
Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...
Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...CIGScotland
 

Viewers also liked (15)

By any other name : personal name authority metadata across Edinburgh Univer...
By any other name : personal name authority metadata across Edinburgh Univer...By any other name : personal name authority metadata across Edinburgh Univer...
By any other name : personal name authority metadata across Edinburgh Univer...
 
Linked data experiments at the National Library of Scotland / Alexandra De Pr...
Linked data experiments at the National Library of Scotland / Alexandra De Pr...Linked data experiments at the National Library of Scotland / Alexandra De Pr...
Linked data experiments at the National Library of Scotland / Alexandra De Pr...
 
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
 
Cwe advertisement rr_bs_phase_ii
Cwe advertisement rr_bs_phase_iiCwe advertisement rr_bs_phase_ii
Cwe advertisement rr_bs_phase_ii
 
Auszeichnung für gute Lehre vom VDMA
Auszeichnung für gute Lehre vom VDMAAuszeichnung für gute Lehre vom VDMA
Auszeichnung für gute Lehre vom VDMA
 
Metadata is catnip to digital scholars / Jennifer schaffner
Metadata is catnip to digital scholars / Jennifer schaffnerMetadata is catnip to digital scholars / Jennifer schaffner
Metadata is catnip to digital scholars / Jennifer schaffner
 
ICT4D Seminar Uni Köln 20.02.16
ICT4D Seminar Uni Köln 20.02.16ICT4D Seminar Uni Köln 20.02.16
ICT4D Seminar Uni Köln 20.02.16
 
Innovation Hydration - Collaborative Consumption - Rob Fanshawe
Innovation Hydration - Collaborative Consumption - Rob FanshaweInnovation Hydration - Collaborative Consumption - Rob Fanshawe
Innovation Hydration - Collaborative Consumption - Rob Fanshawe
 
A word of caution
A word of cautionA word of caution
A word of caution
 
Step children of printing : toward an integrated standard for the description...
Step children of printing : toward an integrated standard for the description...Step children of printing : toward an integrated standard for the description...
Step children of printing : toward an integrated standard for the description...
 
Tugas call juliana resti
Tugas call juliana restiTugas call juliana resti
Tugas call juliana resti
 
Card intro
Card introCard intro
Card intro
 
Hoseki, the cleaning solution for your jewels
Hoseki, the cleaning solution for your jewelsHoseki, the cleaning solution for your jewels
Hoseki, the cleaning solution for your jewels
 
шауенова сауле аэф презентация выступление
шауенова сауле аэф презентация выступлениешауенова сауле аэф презентация выступление
шауенова сауле аэф презентация выступление
 
Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...
Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...
Online outreach at RCAHMS / Alan Muirden, RCAHMS Education Manager, Andrew Ni...
 

Similar to Can documents be Linked Data? / Kate Byrne, School of Informatics, University of Edinburgh, CIGS LOD Workshop

Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleDirk Roorda
 
Build, Branded and Coded - Placemaking in the Digital Era
Build, Branded and Coded - Placemaking in the Digital EraBuild, Branded and Coded - Placemaking in the Digital Era
Build, Branded and Coded - Placemaking in the Digital EraTom Beck
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webhorvadam
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
KiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonKiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonAlyona Medelyan
 
Data Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AIData Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AIJonathan Mugan
 
STC Summit 2010: Semantic Web and Content Strategy
STC Summit 2010: Semantic Web and Content StrategySTC Summit 2010: Semantic Web and Content Strategy
STC Summit 2010: Semantic Web and Content StrategyRachel Lovinger
 
Beautifying Data in the real world
Beautifying Data in the real worldBeautifying Data in the real world
Beautifying Data in the real worldTan Tran
 
The Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationThe Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationFrank van Harmelen
 

Similar to Can documents be Linked Data? / Kate Byrne, School of Informatics, University of Edinburgh, CIGS LOD Workshop (11)

Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew Bible
 
Build, Branded and Coded - Placemaking in the Digital Era
Build, Branded and Coded - Placemaking in the Digital EraBuild, Branded and Coded - Placemaking in the Digital Era
Build, Branded and Coded - Placemaking in the Digital Era
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic web
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
KiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonKiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with Python
 
Web3uploaded
Web3uploadedWeb3uploaded
Web3uploaded
 
Data Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AIData Day Seattle, From NLP to AI
Data Day Seattle, From NLP to AI
 
Wikisym Deep Hypertext slides
Wikisym Deep Hypertext slidesWikisym Deep Hypertext slides
Wikisym Deep Hypertext slides
 
STC Summit 2010: Semantic Web and Content Strategy
STC Summit 2010: Semantic Web and Content StrategySTC Summit 2010: Semantic Web and Content Strategy
STC Summit 2010: Semantic Web and Content Strategy
 
Beautifying Data in the real world
Beautifying Data in the real worldBeautifying Data in the real world
Beautifying Data in the real world
 
The Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationThe Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge Representation
 

More from CIGScotland

From Scottish Bibliographies Online to National Bibliography of Scotland : Re...
From Scottish Bibliographies Online to National Bibliography of Scotland : Re...From Scottish Bibliographies Online to National Bibliography of Scotland : Re...
From Scottish Bibliographies Online to National Bibliography of Scotland : Re...CIGScotland
 
The future of cataloguing: a CIGS World Cafe Workshop
The future of cataloguing: a CIGS World Cafe WorkshopThe future of cataloguing: a CIGS World Cafe Workshop
The future of cataloguing: a CIGS World Cafe WorkshopCIGScotland
 
Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...
Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...
Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...CIGScotland
 
What do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsWhat do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsCIGScotland
 
From student to graduate trainee : a user perspective / Liz Antel, Graduate L...
From student to graduate trainee : a user perspective / Liz Antel, Graduate L...From student to graduate trainee : a user perspective / Liz Antel, Graduate L...
From student to graduate trainee : a user perspective / Liz Antel, Graduate L...CIGScotland
 
"We want something like Google ... why do we get so many results?" : implemen...
"We want something like Google ... why do we get so many results?" : implemen..."We want something like Google ... why do we get so many results?" : implemen...
"We want something like Google ... why do we get so many results?" : implemen...CIGScotland
 
Researching the user experience of the National Library of Scotland eResource...
Researching the user experience of the National Library of Scotland eResource...Researching the user experience of the National Library of Scotland eResource...
Researching the user experience of the National Library of Scotland eResource...CIGScotland
 
Where did you come from, where will you go? : bibliographic data and union ca...
Where did you come from, where will you go? : bibliographic data and union ca...Where did you come from, where will you go? : bibliographic data and union ca...
Where did you come from, where will you go? : bibliographic data and union ca...CIGScotland
 
Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...
Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...
Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...CIGScotland
 
5Rights : enabling children and young people to access the digital world crea...
5Rights : enabling children and young people to access the digital world crea...5Rights : enabling children and young people to access the digital world crea...
5Rights : enabling children and young people to access the digital world crea...CIGScotland
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...CIGScotland
 
Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)
Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)
Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)CIGScotland
 
How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...
How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...
How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...CIGScotland
 
The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)
The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)
The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)CIGScotland
 
Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...
Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...
Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...CIGScotland
 
Unlocking the value : metadata and linked data at the British Library / Alan ...
Unlocking the value : metadata and linked data at the British Library / Alan ...Unlocking the value : metadata and linked data at the British Library / Alan ...
Unlocking the value : metadata and linked data at the British Library / Alan ...CIGScotland
 
RDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireRDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireCIGScotland
 
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...CIGScotland
 
Your name is not good enough : an introduction to (and university perspectiv...
Your name is not good enough : an introduction to (and university perspectiv...Your name is not good enough : an introduction to (and university perspectiv...
Your name is not good enough : an introduction to (and university perspectiv...CIGScotland
 
What can linked data do for me? / Janet Aucock (University of St Andrews)
What can linked data do for me? / Janet Aucock (University of St Andrews)What can linked data do for me? / Janet Aucock (University of St Andrews)
What can linked data do for me? / Janet Aucock (University of St Andrews)CIGScotland
 

More from CIGScotland (20)

From Scottish Bibliographies Online to National Bibliography of Scotland : Re...
From Scottish Bibliographies Online to National Bibliography of Scotland : Re...From Scottish Bibliographies Online to National Bibliography of Scotland : Re...
From Scottish Bibliographies Online to National Bibliography of Scotland : Re...
 
The future of cataloguing: a CIGS World Cafe Workshop
The future of cataloguing: a CIGS World Cafe WorkshopThe future of cataloguing: a CIGS World Cafe Workshop
The future of cataloguing: a CIGS World Cafe Workshop
 
Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...
Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...
Everyone, everywhere, everything : then, now and the future / Gill Hamilton, ...
 
What do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsWhat do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St Andrews
 
From student to graduate trainee : a user perspective / Liz Antel, Graduate L...
From student to graduate trainee : a user perspective / Liz Antel, Graduate L...From student to graduate trainee : a user perspective / Liz Antel, Graduate L...
From student to graduate trainee : a user perspective / Liz Antel, Graduate L...
 
"We want something like Google ... why do we get so many results?" : implemen...
"We want something like Google ... why do we get so many results?" : implemen..."We want something like Google ... why do we get so many results?" : implemen...
"We want something like Google ... why do we get so many results?" : implemen...
 
Researching the user experience of the National Library of Scotland eResource...
Researching the user experience of the National Library of Scotland eResource...Researching the user experience of the National Library of Scotland eResource...
Researching the user experience of the National Library of Scotland eResource...
 
Where did you come from, where will you go? : bibliographic data and union ca...
Where did you come from, where will you go? : bibliographic data and union ca...Where did you come from, where will you go? : bibliographic data and union ca...
Where did you come from, where will you go? : bibliographic data and union ca...
 
Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...
Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...
Engaging the crowd : old hands, modern minds : evolving an on-line manuscript...
 
5Rights : enabling children and young people to access the digital world crea...
5Rights : enabling children and young people to access the digital world crea...5Rights : enabling children and young people to access the digital world crea...
5Rights : enabling children and young people to access the digital world crea...
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)
Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)
Playing with metadata / Gavin Willshaw, Scott Renton (University of Edinburgh)
 
How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...
How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...
How to effectively archive Olympic and Paralympic websites / Helena Byrne (Br...
 
The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)
The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)
The Statistical Accounts of Scotland / Vivienne Mayo (EDINA)
 
Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...
Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...
Beyond bibliographic description : emotional metadata on YouTube / Diane Rasm...
 
Unlocking the value : metadata and linked data at the British Library / Alan ...
Unlocking the value : metadata and linked data at the British Library / Alan ...Unlocking the value : metadata and linked data at the British Library / Alan ...
Unlocking the value : metadata and linked data at the British Library / Alan ...
 
RDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireRDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon Dunsire
 
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
 
Your name is not good enough : an introduction to (and university perspectiv...
Your name is not good enough : an introduction to (and university perspectiv...Your name is not good enough : an introduction to (and university perspectiv...
Your name is not good enough : an introduction to (and university perspectiv...
 
What can linked data do for me? / Janet Aucock (University of St Andrews)
What can linked data do for me? / Janet Aucock (University of St Andrews)What can linked data do for me? / Janet Aucock (University of St Andrews)
What can linked data do for me? / Janet Aucock (University of St Andrews)
 

Recently uploaded

Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 

Recently uploaded (20)

Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 

Can documents be Linked Data? / Kate Byrne, School of Informatics, University of Edinburgh, CIGS LOD Workshop

  • 1. vision txt2rdf grounding Can Documents be Linked Data? Kate Byrne, School of Informatics, University of Edinburgh CIGS LOD Workshop 18th November 2013 1
  • 2. vision txt2rdf 1 The semantic web vision 2 Extracting structured knowledge from free text 3 grounding Respect for authority, or, Why we need ontologies 2
  • 3. vision txt2rdf grounding The semantic web vision W3C RDF Concepts, 2002 draft “RDF ... allows anyone to say anything about anything.” Tim Berners-Lee, 2006 “The day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machine, leaving humans to provide the inspiration and intuition.” Tim Berners-Lee, 2009 “The web as I envisaged it, we have not seen it yet.” 3
  • 4. vision txt2rdf grounding The semantic web vision W3C RDF Concepts, 2002 draft “RDF ... allows anyone to say anything about anything.” Tim Berners-Lee, 2006 “The day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machine, leaving humans to provide the inspiration and intuition.” Tim Berners-Lee, 2009 “The web as I envisaged it, we have not seen it yet.” 3
  • 5. vision txt2rdf grounding The semantic web vision W3C RDF Concepts, 2002 draft “RDF ... allows anyone to say anything about anything.” Tim Berners-Lee, 2006 “The day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machine, leaving humans to provide the inspiration and intuition.” Tim Berners-Lee, 2009 “The web as I envisaged it, we have not seen it yet.” 3
  • 6. vision txt2rdf grounding The semantic web vision W3C RDF Concepts, 2002 draft “RDF ... allows anyone to say anything about anything.” Tim Berners-Lee, 2006 “The day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machine, leaving humans to provide the inspiration and intuition.” Tim Berners-Lee, 2009 “The web as I envisaged it, we have not seen it yet.” 3
  • 7.
  • 8. vision txt2rdf grounding Simple declarative sentences “In a hole in the ground there lived a hobbit. Not a nasty, dirty, wet hole, filled with the ends of worms and an oozy smell, nor yet a dry, bare, sandy hole with nothing in it to sit down on or to eat: it was a hobbit-hole, and that means comfort.” 5
  • 9. vision txt2rdf grounding Simple declarative sentences “In a hole in the ground there lived a hobbit. Not a nasty, dirty, wet hole, filled with the ends of worms and an oozy smell, nor yet a dry, bare, sandy hole with nothing in it to sit down on or to eat: it was a hobbit-hole, and that means comfort.” hobbit lives in hole located in the ground 5
  • 10. vision txt2rdf grounding Simple declarative sentences “In a hole in the ground there lived a hobbit. Not a nasty, dirty, wet hole, filled with the ends of worms and an oozy smell, nor yet a dry, bare, sandy hole with nothing in it to sit down on or to eat: it was a hobbit-hole, and that means comfort.” hobbit lives in hole located in the ground does not have nastiness has type hobbit hole has characteristic comfort 5
  • 11. vision txt2rdf grounding A lot of information is in textual form! 6
  • 12. vision txt2rdf grounding A lot of information is in textual form! 6
  • 13. vision txt2rdf grounding A lot of information is in textual form! 6
  • 14. vision txt2rdf grounding A lot of information is in textual form! 6
  • 15. vision txt2rdf grounding A lot of information is in textual form! 6
  • 16. vision txt2rdf grounding A lot of information is in textual form! 6
  • 18. vision txt2rdf grounding Nouns and verbs subject object predicate hobbit lives in hole located in the ground does not have nastiness has type hobbit hole has characteristic comfort 7
  • 19. vision txt2rdf grounding Nouns and verbs subject object predicate nouns hobbit lives in hole located in the ground does not have nastiness has type hobbit hole has characteristic comfort 7
  • 20. vision txt2rdf grounding Nouns and verbs subject object predicate nouns hobbit lives in hole located in the ground does not have verbs nastiness has type hobbit hole has characteristic comfort 7
  • 21. vision txt2rdf 1 The semantic web vision 2 Extracting structured knowledge from free text 3 grounding Respect for authority, or, Why we need ontologies 8
  • 22. vision txt2rdf grounding Extracting structured knowledge from free text fancy NLP processing and RDFisation 8
  • 23. vision txt2rdf grounding Natural Language Processing pipeline Text documents sfsjksjwjvssjkljljs sd’lajoen s Pre−processing tokenise jjs kjdlk lksjlkj sks oihhg sk jjlkjlj jljbjl skj ekw generate triples Graph of triples sentence and para split remove unwanted relations RDF translation Named Entity Recognition POS tag multi−word tokens and features trained NER model list of NEs and classes attach siteids trained RE model set of NE pairs and features list of relations and classes Relation Extraction 9
  • 24. vision txt2rdf grounding Named entities and relations site 20 Evidence of a quartz knapping site was found within the confines of the stone circle, and in conjunction with several structures within the inner ring, strongly suggests a domestic site. Besides the quartz implements and corresponding waste, several other artifacts of local origin occurred including a split pebble axe of greenstone with Shetland Early Bronze Age affinities. B Beveridge, 1972. Field survey and excavation, as a response to continual wind and marine erosion, was carried out at the Sands of Breckon between 1982 and 1983. HP50NW 11.00 was recorded as a stone settings surrounded by occupational debris (Site 22). Excavation revealed midden deposits of an early Iron Age date and a surface scatter of artefacts of mixed dates. The stone settings were tentatively interpreted as the basal stones of long cists. Historic Scotland Archive Project (SW) 2002. 10
  • 26. vision txt2rdf grounding Converting text relations to RDF – 1 site 20 site20 − hasEvent − excavationX excavationX − hasLocation − SandsOfBreckon excavationX − hasDate − 1982 11
  • 27. vision txt2rdf grounding Converting text relations to RDF – 2 event:excavation site20 − hasEvent − excavationX excavationX − hasLocation − SandsOfBreckon excavationX − hasDate − 1982 rdf:type date:1982 sitetype:stone+settings20w179 :hasPeriod :hasEvent event:excavation20w158 :hasClassn siteid:site20 :hasLocation :hasLocation sitename:sands+of+breckon :hasLocation address:hp50nw+11.00 address:breckon 12
  • 28. vision txt2rdf 1 The semantic web vision 2 Extracting structured knowledge from free text 3 grounding Respect for authority, or, Why we need ontologies 13
  • 29. vision txt2rdf grounding Let’s remind ourselves what’s the point of Linked Data 13
  • 30. vision txt2rdf grounding Let’s remind ourselves what’s the point of Linked Data archaeological site archive museum database siteid: sitename: 47919 Cairnpapple find spot: Cairnpapple classification: Cairn, henge This stone flake from the cutting edge of a ground stone axehead was found at Cairnpapple in West Lothian. The stone is from... site number: NS97SE 16 objectid: X.EP 167 A complex site on the summit of Cairnpapple Hill excavated by Piggot in 1947... :Objectid#x.ep+167 Classn/Sitetype#cairn%20+henge :hasClassn :hasFindSpot :hasClassn :hasId :Siteid#site47919 :hasLocation :Classn/Objtype#axe+flake Id#ns97se+16 :hasEvent :Loc/Sitename#cairnpapple :Event#excavated47919w10 :hasLocation :hasLocation :Loc/Place#west+lothian :hasAgent :Agent/Person#piggot :hasPeriod :Time/Date#1947 :Loc/Place#cairnpapple+hill 13
  • 31. vision txt2rdf grounding But linking Linked Data is actually pretty hard archaeological site archive museum database siteid: sitename: 47919 Cairnpapple find spot: Cairnpapple classification: Cairn, henge This stone flake from the cutting edge of a ground stone axehead was found at Cairnpapple in West Lothian. The stone is from... site number: NS97SE 16 objectid: X.EP 167 A complex site on the summit of Cairnpapple Hill excavated by Piggot in 1947... :Objectid#x.ep+167 Classn/Sitetype#cairn%20+henge :hasClassn :hasFindSpot :hasClassn :hasId :Siteid#site47919 :hasLocation :Classn/Objtype#axe+flake Id#ns97se+16 :hasEvent :Loc/Sitename#cairnpapple :Event#excavated47919w10 :hasLocation :hasLocation :Loc/Place#west+lothian :hasAgent :Agent/Person#piggot :hasPeriod :Time/Date#1947 :Loc/Place#cairnpapple+hill Direct link means spotting identical node in separate graph How? String matching? Clues from context? 14
  • 32. vision txt2rdf grounding Using LOD cloud “Authority Nodes” as intermediaries 15
  • 33. vision txt2rdf grounding Using LOD cloud “Authority Nodes” as intermediaries 15
  • 34. vision txt2rdf grounding Using LOD cloud “Authority Nodes” as intermediaries grounding local URIs against "authority" nodes is the next big challenge! 15
  • 35. vision txt2rdf grounding Grounding site20 against Monument Thesaurus sitetype:religious+ritual+and+funerary skos:broader sitetype:standing+stone "An arrangement of two or more standing stones" sitetype:stone+circle skos:scopeNote event:excavation "stone setting" rdf:type sitetype:stone+row skos:related rdfs:label sitetype:stone+setting rdfs:subClassOf rdf:type sitetype: date:1982 sitetype:stone+settings20w179 :hasPeriod :hasClassn :hasEvent event:excavation20w158 siteid:site20 :hasLocation :hasLocation sitename:sands+of+breckon :hasLocation address:hp50nw+11.01+hp+5304+0519 address:breckon 16
  • 36. vision txt2rdf grounding Grounding site20 against Monument Thesaurus sitetype:religious+ritual+and+funerary skos:broader sitetype:standing+stone "An arrangement of two or more standing stones" sitetype:stone+circle skos:scopeNote event:excavation "stone setting" sitetype:stone+row skos:related rdf:type rdfs:label sitetype:stone+setting rdfs:subClassOf rdf:type sitetype: date:1982 sitetype:stone+settings20w179 :hasPeriod :hasClassn :hasEvent event:excavation20w158 siteid:site20 :hasLocation :hasLocation sitename:sands+of+breckon :hasLocation address:hp50nw+11.01+hp+5304+0519 address:breckon 16
  • 37. vision txt2rdf grounding Grounding against various authorities/ontologies Placename authorities: Geonames, OS gazetteer, Pleiades Period: EH draft ontology Monument classifications: Seneschal project Bibliographic: LCSH, FRBR ...hundreds of LOD datasets in the cloud Informatics projects Edina “Unlock” service – spatial and temporal grounding GAP projects – grounding against maps of the ancient world 17
  • 38. vision txt2rdf grounding Grounding against various authorities/ontologies Placename authorities: Geonames, OS gazetteer, Pleiades Period: EH draft ontology Monument classifications: Seneschal project Bibliographic: LCSH, FRBR ...hundreds of LOD datasets in the cloud Informatics projects Edina “Unlock” service – spatial and temporal grounding GAP projects – grounding against maps of the ancient world 17
  • 39. vision txt2rdf grounding Unlock Text – find placenames and plot on map http://unlock.edina.ac.uk/ 18