SlideShare une entreprise Scribd logo
1  sur  49
Télécharger pour lire hors ligne
NERD: an open source
platform for extracting and
disambiguating named entities
in very diverse documents
Raphaël Troncy <raphael.troncy@eurecom.fr>
Giuseppe Rizzo <giuseppe.rizzo@eurecom.fr>
What is a Named Entity recognition task?
 A task that aims to locate and classify the name of a
person or an organization, a location, a brand, a
product, a numeric expression including time, date,
money and percent in a textual document

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-2
Example
 “ I want to book a room in an hotel located in
the heart of Paris, just a stone’s throw from the
Eiffel Tower ”

Eric Charton, “Named Entity Detection and Entity Linking in the
Context of Semantic Web: Exploring the ambiguity question”

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-3
Part of Speech
I
want
to
book
a
room
in
…
Paris

PRP
VBP
TO
VB
DT
NN
IN
…
NNP

NER: What is Paris?
NEL: Which Paris are we
talking about?

Giuseppe Rizzo, “Learning with the Web: Structuring data to
ease machine understanding”

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-4
What is Paris? Type Ambiguity

dbpedia-owl:Asteroid

schema:City

schema:Movie
dbpedia-owl:Film

Giuseppe Rizzo, “Learning with the Web: Structuring data to
ease machine understanding”

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-5
Named Entity Recognition (NER)
I
want
to
book
a
room
in
…
Paris

PRP
VBP
TO
VB
DT
NN
IN
…
NNP

O
O
O
O
O
O
O
…
LOC

Giuseppe Rizzo, “Learning with the Web: Structuring data to
ease machine understanding”

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-6
What is Paris? Name Ambiguity

Paris, Kentucky

Paris, France

Paris, Maine

Paris, Idaho

Paris, Tennessee

Paris, Ontario

Giuseppe Rizzo, “Learning with the Web: Structuring data to
ease machine understanding”
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-7
Named Entity Linking (NEL)
I
want
to
book
a
room
in
…
Paris

PRP
VBP
TO
VB
DT
NN
IN
…
NNP

O
O
O
O
O
O
O
…
LOC

O
O
O
O
O
O
O
…
http://dbpedia.org/resource/Paris

Giuseppe Rizzo, “Learning with the Web: Structuring data to
ease machine understanding”

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-8
NER Tools and Web APIs
 Standalone software
 GATE
 Stanford CoreNLP
 Temis

http://nerd.eurecom.fr/

 Web APIs

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

-9
NERD: Named Entity Recognition and
Disambiguation
 Compare performances of
NER and NEL tools
 Understand strengths and weaknesses of different Web APIs
 Adapt NER processing to different context

 (Learn how to) Combine NER (/ NEL) tools
 Participate in various benchmarks

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 10
What is NERD?
ontology1

REST API2
UI3

1

http://nerd.eurecom.fr/ontology
2 http://nerd.eurecom.fr/api/application.wadl
3 http://nerd.eurecom.fr

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 11
Factual comparison of 10 Web NER tools
Alchemy
API

DBpedia
Spotlight

Evri

Extractiv

Lupedia

Open
Calais

Saplo

Wikimeta

Yahoo!

Zemanta

Language

EN,FR,
GR,IT,
PT,RU,
SP,SW

EN
GR*
PT*
SP*

EN,I
T

EN

EN,FR,
IT

EN,FR
SP

EN,
SW

EN,FR
SP

EN

EN

Granularity

OEN

OEN

OED

OEN

OEN

OEN

OED

OEN

OEN

OED

Entity
position

N/A

char
offset

N/A

word
offset

range of
chars

char
offset

N/A

POS
offset

range
of
chars

N/A

Alchemy

DBpedia
FreeBase
Scema.or
g

Evri

DBpedia

DBpedia
LinkedM
DB

Open
Calais

N/A

ESTER

Yahoo

FreeBase

Number of
classes

324

320

5

34

319

95

5

7

13

81

Response
Format

JSON
MicroF
XML
RDF

HTML
JSON
RDF
XML

HTM
L
JSO
N
RDF

HTML
JSON
RDF
XML

HTML
JSON
RDFa
XML

JSON
MicroF
ormat

JSON

JSON
XML

JSON
XML

XML
JSON
RDF

Quota
(calls/day)

30000

unl

300
3000
unl
50000
NLP&DBpedia International Workshop, Sydney, October 2013
0

1333

unl

5000

10000

Classification
schema

22/10/2013 -

12/15
NERD Ontology

Aligned the taxonomies used by
the extractors
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 13
NERD type

Building the NERD Ontology

Occurrence

Person

10

Organization

10

Country
Company

6

Continent

5

City

5

RadioStation

5

Album

5

Product

5

...

NLP&DBpedia International Workshop, Sydney, October 2013

6

Location

22/10/2013 -

6

...

- 14
NERD REST API
RDF
/document
/user
/annotation/{extractor}
/extraction
/evaluation
...

GET,
POST,
PUT,
DELETE

JSON
“entities” : [{
“entity”: “Tim Berners-Lee” ,
“type”: “Person” ,
“uri”: "http://dbpedia.org/resource/Tim_berners_lee",
“nerdType”: "http://nerd.eurecom.fr/ontology#Person",
“startChar”: 30,
“endChar”: 45,
“confidence”: 1,
“relevance”: 0.5
}]

Rizzo G., Troncy R. (2012), NERD: A Framework for Unifying Named Entity Recognition and Disambiguation Web Extraction
Tools. In: European chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 15
NERD meets NIF
Model documents through a
set of strings deferencable on
the Web
: offset_23107_ 23110 a str:String ;
str:referenceContext :offset_0_26546 .

Map string to entity
: offset_23107_ 23110 sso:oen dbpedia:W3C.

Classification
dbpedia:W3C

rdf:type

nerd:Organization .

Rizzo G, Troncy R., Hellmann S. and Bruemmer M. (2012), NERD meets NIF: Lifting NLP Extraction Results to the Linked
Data Cloud. In: (LDOW'12) Linked Data on the Web (WWW'12), Lyon, France.
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 16
NERD User Dashboard

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 17
NERD User Interface

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 18
History of NER benchmarks
 CoNLL 2003 and CoNLL 2005
 schema (4 types): person, organization, location and miscellaneous

 ACE 2004, ACE 2005 and ACE 2007
 schema (7 types): person, organization, location, facility, weapon,
vehicle and geo-political entity
 entity recognition, co-ref, find relationships among entities extracted

 TAC 2009 (Knowledge Base Track)
 schema (3 types): person, organization and location
 create a knowledge base from the named entities extracted

 ETAPE 2012 (Named Entity Task)
 schema: Quaero (7 main types, 32 sub-types)

 MSM 2013: tweet corpus !
 schema (4 types): person, organization, location, miscellaneous
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 19
ETAPE 2012 challenge
genre

train

dev

test

TV news

7h 40m

1h 40m

1h 40m

BFM Story, Top QUestions (LCP)

TV debates

10h 30m

5h 10m

5h 10m

Pile et Face, Ca vous regarde,
Entre les lignes (LCP)

1h 05m

1h 05m

La place du village (TV8)

TV amusements -

sources

Train

Dev

Eval

Item length

26h

10h 55m

10h 55m

Nb files

44

15

15

Nb words

290517

91656

115511

Nb Named Entities

46763

14398

13055

Nb unique categories

33

33

33

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 20
NERD @ ETAPE (naïve combined strategy)
extraction

(eA1,tA1,URIA1,siA1,eiA1) ...
(eA2,tA2,URIA2,siA2,eiA2)
(eA3,tA3,URIA3,siA3,eiA3)

...

...

cleaning

fusion
`

22/10/2013 -

(eN1,tN1,URIN1,siN1,eiN1)
(eN2,tN2,URIN2,siN2,eiN2)

When at least 2 extractors classify the
same entity with a different type then
we apply a preferred selection order
(empirically defined): Wikimeta,
AlchemyAPI, OpenCalais, Lupedia

NLP&DBpedia International Workshop, Sydney, October 2013

- 21
Participation at ETAPE (combined+ strategy)
ETAPE
Train & Dev

...
Learned model

POS tagger

Created
static rules

Apply rules

(eA1,tA1,URIA1,siA1,eA1
)
(eA2,tA2,URIA2,siA2,eiA2
)

(e1,t1,URI1,si1,ei1)

fusion
Conflicts handled by
priority selection: own,
Wikimeta,AlchemyAPI,
OpenCalais,Lupedia

(eN1,tN1,URIN1,sN1,eN1)
`(e ,t ,URI ,s ,e )
N2 N2
N2 N2 N2
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 22
NERD Global results

SLR

Precision

Recall

F-measure

%correct

combined

86.85%

35.31%

17.69%

23.44%

17.69%

combined+

188.81%

15.13%

28.40%

19.45%

28.40%

Combined+ : Eval corpus differs substantially from the Train & Dev
corpora. The static rules do not fit well the Eval corpora and they
introduce classification noise.

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 23
Per-extractor results
SLR

Precision

Recall

F-measure

%correct

alchemyapi

37.71%

47.95%

5.45%

9.68%

5.45%

lupedia

39.49%

22.87%

1.56%

2.91%

1.56%

opencalais

37.47%

41.69%

3.53%

6.49%

3.53%

wikimeta

36.67%

19.40%

4.25%

6.95%

4.25%

combined
(nerd)

86.85%

35.31%

17.69%

23.44%

17.69%

combined+
(nerd+)

188.81%

15.13%

28.40%

19.45%

28.40%

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 24
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 25
Learning How to Combine NER Extractors

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 26
NERD on CoNLL 2003 (NER task)

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 27
NERD on MSM 2013 (NER task)

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 28
NERD on MSM 2013 (NEL task)

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 29
Media Fragment Enricher:
http://mfe.synote.org/mfe/

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 30
Linking pieces of knowledge

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 31
Linking pieces of knowledge

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 32
Named Entities for Video Classification

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 33
Workflow
5:Timed Text
6: NEs with time
alignment
(json)

2: Metadata
7: RDFize (ttl)
Media Fragment Enricher Services
Metadata &
timed-text

1: Video
URL

NERD
Client

3: metadata

RDFizator

9: SPARQL query
4:NERDify

Video and
metadata preview

Categorization

Triple Store

8: Generate
Category

Video replay with subtitles and
aligned NEs

Media Fragment Enricher UI

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 34
Channel signature based on NE distribution

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 35
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 36
LinkedTV: automatic annotations ...

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 37
... and enrichment for hypervideos

CONCEPT IN
PLAYER
Cubism

Expressionism

Fauvism

FACETS / PROPERTIES OF CONCEPT
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

CONTENT ENRICHMENT
- 38
Media Fragments and Annotations

http://data.linkedtv.eu/medi
a/e2899e7f#t=840,900

nerd:Location
Casablanca

nerd:Location
Cafe Rick

nerd:Person
H. Bogart

nerd:Person
I. Bergman

 Media Fragment URI 1.0





22/10/2013 -

Chapters
Scenes
Shots
etc…

NLP&DBpedia International Workshop, Sydney, October 2013

- 39
Enrichment and Hypervideos

nerd:Location
Casablanca

nerd:Location
Cafe Rick

nerd:Person
H. Bogart

Nerd:Person
E. Tierney

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

nerd:Person
I. Bergman
nerd:Location
China

- 40
Media Fragment + Open Annotation + NERD
Locator

MediaResource
OffsetBasedString

Annotation

MediaFragment

Entity
Type

URL (hyperlink)

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 41
Towards a Linked Media Layer
 Enriching media with media from a closed collection
(e.g. BBC archive)
 The MediaEval scenario (~ 1697 hours of archived BBC video)
http://www.multimediaeval.org/mediaeval2013/hyper2013/

 Enriching media with content from the open web
 LinkedTV scenarios: white listed web sites for each program
 Media Collector for Social Media
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 42
Seed video enriched with web content
rbbaktuell_20120809

nerd:Location
Brandenburg
oa
Enrichments are Annotations too

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 44
Media Finder (named entities clustering)

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 45
Media Finder (zooming in a cluster)

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 46
Media Finder: http://mediafinder.eurecom.fr/
 Live Topic Generation from Event Streams
 WWW 2013 Demo Session
 http://www.youtube.com/watch?v=8iRiwz7cDYY

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 47
Credits
 Giuseppe Rizzo, Vuk Milicic,
José Luis Redondo Garcia (EURECOM)
 Thomas Steiner (Google Inc.)
 Marieke van Erp (Free University of Amsterdam)
 Yunjia Li (University of Southampton)
 … and many other students

22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 48
http://www.slideshare.net/troncy
22/10/2013 -

NLP&DBpedia International Workshop, Sydney, October 2013

- 49

Contenu connexe

Tendances

Linked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsLinked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsVito Ostuni
 
Revealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in dataRevealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in dataJakob .
 
Semantic Integration Patterns
Semantic Integration PatternsSemantic Integration Patterns
Semantic Integration PatternsOptum
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.docbutest
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataversevty
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryvty
 
Application Modeling with Graph Databases - Relationships are cool
Application Modeling with Graph Databases - Relationships are coolApplication Modeling with Graph Databases - Relationships are cool
Application Modeling with Graph Databases - Relationships are coolLars Martin
 
Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018ITIIIndustries
 
Extracting, Aligning, and Linking Data to Build Knowledge Graphs
Extracting, Aligning, and Linking Data to Build Knowledge GraphsExtracting, Aligning, and Linking Data to Build Knowledge Graphs
Extracting, Aligning, and Linking Data to Build Knowledge GraphsCraig Knoblock
 
SemTecBiz 2012: Corporate Semantic Web
SemTecBiz 2012: Corporate Semantic WebSemTecBiz 2012: Corporate Semantic Web
SemTecBiz 2012: Corporate Semantic WebAdrian Paschke
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data ManagementeXascale Infolab
 
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic SystemsBorders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systemsnet2-project
 
Lecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data MiningLecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data MiningPier Luca Lanzi
 
Semantic Meta-Mining of Knowledge Discovery Processes
Semantic Meta-Mining of Knowledge Discovery ProcessesSemantic Meta-Mining of Knowledge Discovery Processes
Semantic Meta-Mining of Knowledge Discovery ProcessesAgnieszka Ławrynowicz
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research datavty
 

Tendances (18)

Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
 
Linked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsLinked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender Systems
 
P-Plan
P-PlanP-Plan
P-Plan
 
Revealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in dataRevealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in data
 
Semantic Integration Patterns
Semantic Integration PatternsSemantic Integration Patterns
Semantic Integration Patterns
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.doc
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repository
 
Application Modeling with Graph Databases - Relationships are cool
Application Modeling with Graph Databases - Relationships are coolApplication Modeling with Graph Databases - Relationships are cool
Application Modeling with Graph Databases - Relationships are cool
 
Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018
 
Extracting, Aligning, and Linking Data to Build Knowledge Graphs
Extracting, Aligning, and Linking Data to Build Knowledge GraphsExtracting, Aligning, and Linking Data to Build Knowledge Graphs
Extracting, Aligning, and Linking Data to Build Knowledge Graphs
 
SemTecBiz 2012: Corporate Semantic Web
SemTecBiz 2012: Corporate Semantic WebSemTecBiz 2012: Corporate Semantic Web
SemTecBiz 2012: Corporate Semantic Web
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic SystemsBorders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systems
 
Lecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data MiningLecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data Mining
 
Semantic Meta-Mining of Knowledge Discovery Processes
Semantic Meta-Mining of Knowledge Discovery ProcessesSemantic Meta-Mining of Knowledge Discovery Processes
Semantic Meta-Mining of Knowledge Discovery Processes
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 

Similaire à NERD: an open source platform for extracting and disambiguating named entities in very diverse documents NLP-DBpedia 2013

Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Raphael Troncy
 
Inductive Entity Typing Alignment
Inductive Entity Typing AlignmentInductive Entity Typing Alignment
Inductive Entity Typing AlignmentGiuseppe Rizzo
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021hala Skaf
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationEnrico Palumbo
 
3rd 3DDRESD: DRESD Future Plan 0809
3rd 3DDRESD: DRESD Future Plan 08093rd 3DDRESD: DRESD Future Plan 0809
3rd 3DDRESD: DRESD Future Plan 0809Marco Santambrogio
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...Marco Brambilla
 
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao XieSCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao XieTao Xie
 
OER World Map Project
OER World Map Project OER World Map Project
OER World Map Project Robert Farrow
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things PayamBarnaghi
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...Marta Villegas
 
GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003butest
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloudNational Institute of Informatics
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Bianca Pereira
 

Similaire à NERD: an open source platform for extracting and disambiguating named entities in very diverse documents NLP-DBpedia 2013 (20)

Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013
 
Inductive Entity Typing Alignment
Inductive Entity Typing AlignmentInductive Entity Typing Alignment
Inductive Entity Typing Alignment
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
 
3rd 3DDRESD: DRESD Future Plan 0809
3rd 3DDRESD: DRESD Future Plan 08093rd 3DDRESD: DRESD Future Plan 0809
3rd 3DDRESD: DRESD Future Plan 0809
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
 
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao XieSCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
 
JAVA PROGRAMMING
JAVA PROGRAMMING JAVA PROGRAMMING
JAVA PROGRAMMING
 
Birasa 1
Birasa 1Birasa 1
Birasa 1
 
JAVA PROGRAMMINGD
JAVA PROGRAMMINGDJAVA PROGRAMMINGD
JAVA PROGRAMMINGD
 
OER World Map Project
OER World Map Project OER World Map Project
OER World Map Project
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003
 
Publishing Linked Data using Schema.org
Publishing Linked Data using Schema.orgPublishing Linked Data using Schema.org
Publishing Linked Data using Schema.org
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloud
 
Distributed Deep Learning + others for Spark Meetup
Distributed Deep Learning + others for Spark MeetupDistributed Deep Learning + others for Spark Meetup
Distributed Deep Learning + others for Spark Meetup
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)
 

Plus de Raphael Troncy

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyRaphael Troncy
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentRaphael Troncy
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningRaphael Troncy
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationRaphael Troncy
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...Raphael Troncy
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Raphael Troncy
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Raphael Troncy
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Raphael Troncy
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionRaphael Troncy
 
Semantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webSemantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webRaphael Troncy
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streamsRaphael Troncy
 
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdMediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdRaphael Troncy
 
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentEventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentRaphael Troncy
 
Extracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksExtracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksRaphael Troncy
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Raphael Troncy
 
MediaEval 2012 SED Opening
MediaEval 2012 SED OpeningMediaEval 2012 SED Opening
MediaEval 2012 SED OpeningRaphael Troncy
 
DeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingDeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingRaphael Troncy
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED OpeningRaphael Troncy
 
ShareIt: Mining SocialMedia Activities for Detecting Events
ShareIt: Mining SocialMedia Activities for Detecting EventsShareIt: Mining SocialMedia Activities for Detecting Events
ShareIt: Mining SocialMedia Activities for Detecting EventsRaphael Troncy
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating eventsRaphael Troncy
 

Plus de Raphael Troncy (20)

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening Ceremony
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things Environment
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learning
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip Recommendation
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and Description
 
Semantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webSemantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social web
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streams
 
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdMediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
 
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentEventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
 
Extracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksExtracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social Networks
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...
 
MediaEval 2012 SED Opening
MediaEval 2012 SED OpeningMediaEval 2012 SED Opening
MediaEval 2012 SED Opening
 
DeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingDeRiVE 2011 workshop opening
DeRiVE 2011 workshop opening
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED Opening
 
ShareIt: Mining SocialMedia Activities for Detecting Events
ShareIt: Mining SocialMedia Activities for Detecting EventsShareIt: Mining SocialMedia Activities for Detecting Events
ShareIt: Mining SocialMedia Activities for Detecting Events
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating events
 

Dernier

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Dernier (20)

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

NERD: an open source platform for extracting and disambiguating named entities in very diverse documents NLP-DBpedia 2013

  • 1. NERD: an open source platform for extracting and disambiguating named entities in very diverse documents Raphaël Troncy <raphael.troncy@eurecom.fr> Giuseppe Rizzo <giuseppe.rizzo@eurecom.fr>
  • 2. What is a Named Entity recognition task?  A task that aims to locate and classify the name of a person or an organization, a location, a brand, a product, a numeric expression including time, date, money and percent in a textual document 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -2
  • 3. Example  “ I want to book a room in an hotel located in the heart of Paris, just a stone’s throw from the Eiffel Tower ” Eric Charton, “Named Entity Detection and Entity Linking in the Context of Semantic Web: Exploring the ambiguity question” 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -3
  • 4. Part of Speech I want to book a room in … Paris PRP VBP TO VB DT NN IN … NNP NER: What is Paris? NEL: Which Paris are we talking about? Giuseppe Rizzo, “Learning with the Web: Structuring data to ease machine understanding” 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -4
  • 5. What is Paris? Type Ambiguity dbpedia-owl:Asteroid schema:City schema:Movie dbpedia-owl:Film Giuseppe Rizzo, “Learning with the Web: Structuring data to ease machine understanding” 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -5
  • 6. Named Entity Recognition (NER) I want to book a room in … Paris PRP VBP TO VB DT NN IN … NNP O O O O O O O … LOC Giuseppe Rizzo, “Learning with the Web: Structuring data to ease machine understanding” 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -6
  • 7. What is Paris? Name Ambiguity Paris, Kentucky Paris, France Paris, Maine Paris, Idaho Paris, Tennessee Paris, Ontario Giuseppe Rizzo, “Learning with the Web: Structuring data to ease machine understanding” 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -7
  • 8. Named Entity Linking (NEL) I want to book a room in … Paris PRP VBP TO VB DT NN IN … NNP O O O O O O O … LOC O O O O O O O … http://dbpedia.org/resource/Paris Giuseppe Rizzo, “Learning with the Web: Structuring data to ease machine understanding” 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -8
  • 9. NER Tools and Web APIs  Standalone software  GATE  Stanford CoreNLP  Temis http://nerd.eurecom.fr/  Web APIs 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 -9
  • 10. NERD: Named Entity Recognition and Disambiguation  Compare performances of NER and NEL tools  Understand strengths and weaknesses of different Web APIs  Adapt NER processing to different context  (Learn how to) Combine NER (/ NEL) tools  Participate in various benchmarks 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 10
  • 11. What is NERD? ontology1 REST API2 UI3 1 http://nerd.eurecom.fr/ontology 2 http://nerd.eurecom.fr/api/application.wadl 3 http://nerd.eurecom.fr 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 11
  • 12. Factual comparison of 10 Web NER tools Alchemy API DBpedia Spotlight Evri Extractiv Lupedia Open Calais Saplo Wikimeta Yahoo! Zemanta Language EN,FR, GR,IT, PT,RU, SP,SW EN GR* PT* SP* EN,I T EN EN,FR, IT EN,FR SP EN, SW EN,FR SP EN EN Granularity OEN OEN OED OEN OEN OEN OED OEN OEN OED Entity position N/A char offset N/A word offset range of chars char offset N/A POS offset range of chars N/A Alchemy DBpedia FreeBase Scema.or g Evri DBpedia DBpedia LinkedM DB Open Calais N/A ESTER Yahoo FreeBase Number of classes 324 320 5 34 319 95 5 7 13 81 Response Format JSON MicroF XML RDF HTML JSON RDF XML HTM L JSO N RDF HTML JSON RDF XML HTML JSON RDFa XML JSON MicroF ormat JSON JSON XML JSON XML XML JSON RDF Quota (calls/day) 30000 unl 300 3000 unl 50000 NLP&DBpedia International Workshop, Sydney, October 2013 0 1333 unl 5000 10000 Classification schema 22/10/2013 - 12/15
  • 13. NERD Ontology Aligned the taxonomies used by the extractors 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 13
  • 14. NERD type Building the NERD Ontology Occurrence Person 10 Organization 10 Country Company 6 Continent 5 City 5 RadioStation 5 Album 5 Product 5 ... NLP&DBpedia International Workshop, Sydney, October 2013 6 Location 22/10/2013 - 6 ... - 14
  • 15. NERD REST API RDF /document /user /annotation/{extractor} /extraction /evaluation ... GET, POST, PUT, DELETE JSON “entities” : [{ “entity”: “Tim Berners-Lee” , “type”: “Person” , “uri”: "http://dbpedia.org/resource/Tim_berners_lee", “nerdType”: "http://nerd.eurecom.fr/ontology#Person", “startChar”: 30, “endChar”: 45, “confidence”: 1, “relevance”: 0.5 }] Rizzo G., Troncy R. (2012), NERD: A Framework for Unifying Named Entity Recognition and Disambiguation Web Extraction Tools. In: European chapter of the Association for Computational Linguistics (EACL'12), Avignon, France. 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 15
  • 16. NERD meets NIF Model documents through a set of strings deferencable on the Web : offset_23107_ 23110 a str:String ; str:referenceContext :offset_0_26546 . Map string to entity : offset_23107_ 23110 sso:oen dbpedia:W3C. Classification dbpedia:W3C rdf:type nerd:Organization . Rizzo G, Troncy R., Hellmann S. and Bruemmer M. (2012), NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud. In: (LDOW'12) Linked Data on the Web (WWW'12), Lyon, France. 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 16
  • 17. NERD User Dashboard 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 17
  • 18. NERD User Interface 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 18
  • 19. History of NER benchmarks  CoNLL 2003 and CoNLL 2005  schema (4 types): person, organization, location and miscellaneous  ACE 2004, ACE 2005 and ACE 2007  schema (7 types): person, organization, location, facility, weapon, vehicle and geo-political entity  entity recognition, co-ref, find relationships among entities extracted  TAC 2009 (Knowledge Base Track)  schema (3 types): person, organization and location  create a knowledge base from the named entities extracted  ETAPE 2012 (Named Entity Task)  schema: Quaero (7 main types, 32 sub-types)  MSM 2013: tweet corpus !  schema (4 types): person, organization, location, miscellaneous 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 19
  • 20. ETAPE 2012 challenge genre train dev test TV news 7h 40m 1h 40m 1h 40m BFM Story, Top QUestions (LCP) TV debates 10h 30m 5h 10m 5h 10m Pile et Face, Ca vous regarde, Entre les lignes (LCP) 1h 05m 1h 05m La place du village (TV8) TV amusements - sources Train Dev Eval Item length 26h 10h 55m 10h 55m Nb files 44 15 15 Nb words 290517 91656 115511 Nb Named Entities 46763 14398 13055 Nb unique categories 33 33 33 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 20
  • 21. NERD @ ETAPE (naïve combined strategy) extraction (eA1,tA1,URIA1,siA1,eiA1) ... (eA2,tA2,URIA2,siA2,eiA2) (eA3,tA3,URIA3,siA3,eiA3) ... ... cleaning fusion ` 22/10/2013 - (eN1,tN1,URIN1,siN1,eiN1) (eN2,tN2,URIN2,siN2,eiN2) When at least 2 extractors classify the same entity with a different type then we apply a preferred selection order (empirically defined): Wikimeta, AlchemyAPI, OpenCalais, Lupedia NLP&DBpedia International Workshop, Sydney, October 2013 - 21
  • 22. Participation at ETAPE (combined+ strategy) ETAPE Train & Dev ... Learned model POS tagger Created static rules Apply rules (eA1,tA1,URIA1,siA1,eA1 ) (eA2,tA2,URIA2,siA2,eiA2 ) (e1,t1,URI1,si1,ei1) fusion Conflicts handled by priority selection: own, Wikimeta,AlchemyAPI, OpenCalais,Lupedia (eN1,tN1,URIN1,sN1,eN1) `(e ,t ,URI ,s ,e ) N2 N2 N2 N2 N2 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 22
  • 23. NERD Global results SLR Precision Recall F-measure %correct combined 86.85% 35.31% 17.69% 23.44% 17.69% combined+ 188.81% 15.13% 28.40% 19.45% 28.40% Combined+ : Eval corpus differs substantially from the Train & Dev corpora. The static rules do not fit well the Eval corpora and they introduce classification noise. 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 23
  • 25. 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 25
  • 26. Learning How to Combine NER Extractors 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 26
  • 27. NERD on CoNLL 2003 (NER task) 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 27
  • 28. NERD on MSM 2013 (NER task) 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 28
  • 29. NERD on MSM 2013 (NEL task) 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 29
  • 30. Media Fragment Enricher: http://mfe.synote.org/mfe/ 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 30
  • 31. Linking pieces of knowledge 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 31
  • 32. Linking pieces of knowledge 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 32
  • 33. Named Entities for Video Classification 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 33
  • 34. Workflow 5:Timed Text 6: NEs with time alignment (json) 2: Metadata 7: RDFize (ttl) Media Fragment Enricher Services Metadata & timed-text 1: Video URL NERD Client 3: metadata RDFizator 9: SPARQL query 4:NERDify Video and metadata preview Categorization Triple Store 8: Generate Category Video replay with subtitles and aligned NEs Media Fragment Enricher UI 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 34
  • 35. Channel signature based on NE distribution 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 35
  • 36. 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 36
  • 37. LinkedTV: automatic annotations ... 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 37
  • 38. ... and enrichment for hypervideos CONCEPT IN PLAYER Cubism Expressionism Fauvism FACETS / PROPERTIES OF CONCEPT 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 CONTENT ENRICHMENT - 38
  • 39. Media Fragments and Annotations http://data.linkedtv.eu/medi a/e2899e7f#t=840,900 nerd:Location Casablanca nerd:Location Cafe Rick nerd:Person H. Bogart nerd:Person I. Bergman  Media Fragment URI 1.0     22/10/2013 - Chapters Scenes Shots etc… NLP&DBpedia International Workshop, Sydney, October 2013 - 39
  • 40. Enrichment and Hypervideos nerd:Location Casablanca nerd:Location Cafe Rick nerd:Person H. Bogart Nerd:Person E. Tierney 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 nerd:Person I. Bergman nerd:Location China - 40
  • 41. Media Fragment + Open Annotation + NERD Locator MediaResource OffsetBasedString Annotation MediaFragment Entity Type URL (hyperlink) 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 41
  • 42. Towards a Linked Media Layer  Enriching media with media from a closed collection (e.g. BBC archive)  The MediaEval scenario (~ 1697 hours of archived BBC video) http://www.multimediaeval.org/mediaeval2013/hyper2013/  Enriching media with content from the open web  LinkedTV scenarios: white listed web sites for each program  Media Collector for Social Media 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 42
  • 43. Seed video enriched with web content rbbaktuell_20120809 nerd:Location Brandenburg oa
  • 44. Enrichments are Annotations too 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 44
  • 45. Media Finder (named entities clustering) 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 45
  • 46. Media Finder (zooming in a cluster) 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 46
  • 47. Media Finder: http://mediafinder.eurecom.fr/  Live Topic Generation from Event Streams  WWW 2013 Demo Session  http://www.youtube.com/watch?v=8iRiwz7cDYY 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 47
  • 48. Credits  Giuseppe Rizzo, Vuk Milicic, José Luis Redondo Garcia (EURECOM)  Thomas Steiner (Google Inc.)  Marieke van Erp (Free University of Amsterdam)  Yunjia Li (University of Southampton)  … and many other students 22/10/2013 - NLP&DBpedia International Workshop, Sydney, October 2013 - 48