SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Making Sense of Microposts
(#Microposts2015) @ WWW2015
Named Entity rEcognition
and Linking Challenge
http://www.scc.lancs.ac.uk/microposts2015/challenge/
NEEL challenge overview
➢ Challenging to make sense of Microposts
○ they are very short text messages
○ they contain abbreviations and typos
○ they are “grammar free”
➢ The NEEL challenge aims to explore new
approaches to foster research into novel,
more accurate entity recognition and linking
approaches tailored for Microposts
2013
2014
Information Extraction (IE)
named entity recognition (4 types)
2015
Named Entity Extraction and Linking
(NEEL)
named entity extraction and linking to
DBpedia 3.9 entries
Named Entity rEcognition and Linking
(NEEL)
named entity recognition (7 types) and linking
to DBpedia 2014 entries
➢ normalization
○ linguistic pre-processing and expansion of tweets
➢ entity recognition and linking
○ sequential and semi-joint tasks
○ large Knowledge Bases (such as DBpedia and
Yago) as lexical dictionaries and source of already
existing relations among entities
○ supervised learning approaches to both predict the
type of the entity given the linguistic and contextual
similarity, and the link given the semantic similarity
○ unsupervised learning approaches for grouping
similar lexical entities, affecting the entity resolution
Highlights of the submitted
approaches over the 3-year challenge
Sponsorship
➢ Successfully obtained sponsorship each year
○ highlights importance of this practical research
○ importance extends BEYOND academia
➢ Sponsor has early access to results as senior
PC member
○ opportunity to liaise with participants to extend work
➢ Workshop and participants obtain greater
exposure
➢ Italian company operating in the business of
knowledge extraction and representation
➢ successfully participated in 2014 NEEL
challenge, ranking 3rd overall
29 teams expressed intent to take part in the
challenge
21 teams finally got
involved and signed the
agreement to access to
the NEEL challenge
corpus
NEEL corpus
no. of tweets %
Training 3498 58.06
Development 500 8.3
Test 2027 33.64
NEEL Corpus details
➢ 6025 tweets
○ events from 2011 and 2013 such the London Riots,
the Oslo bombing (cf. event-annotated tweets
provided by the Redites project)
○ events in 2014 such as UCI Cyclo-cross World Cup
➢ Corpus available after having signed the
NEEL Agreement Form
(remains available by contacting msm.
orgcom@gmail.com)
Manual creation of the Gold
Standard
3-step annotation
1. unsupervised annotations, with intent to
extract candidate links which were used as
input to the second stage. NERD-ML was
used as off-the-shelf system
2. three human annotators analyzed and
complemented the annotations. GATE was
used as the workbench
3. one domain expert reviewed and resolved
problematic cases
Evaluation protocol
Participants were asked to wrap their
prototypes as a publicly accessible
web service following a REST-based
protocol
Widen the dissemination, ensure the
reproducibility, the reuse, and the
correctness of the results
Evaluation periods
D-Time to test the contending entries
(REST APIs) submitted by the
participants
T-Time for the final evaluation and
metric computations
Submissions and Runs
➢ Paper submission
○ describing approach taken
○ identifying and detailing any limitations or
dependencies of approach
➢ Up to 10 contending entries
○ best of 3 used for the final ranking
Evaluation scorer
TAC KBP official scorer
https://github.com/wikilinks/neleval
Evaluation metrics
tagging strong_typed_mention_match
(check entity name boundary and type)
linking strong_link_match
clustering mention_ceaf (NIL over the exact
match of the entities)
latency computation time
Ranking strategy
rs
= 0.4*clusteringF1
+
0.3*taggingF1
+
0.3*linkingF1
we resolved to the latency to sort draws
7 teams participated to
the T-Time
Drop of 14 participants
due to complexity
i) of the challenge protocol, which has
required broaden expertise in different
domains such as Information Extraction,
Data Semantics, and Web
ii) generally low results
And the winner is ...
Ikuya Yamada, Hideaki Takeda and
Yoshiyasu Takefuji
An End-to-End Entity Linking Approach
for Tweets
Team Ousia
rank runid
team name rs
1 9 ousia 0.8067
2 7 acubelab 0.4757
3 guru uva 0.4756
4 UNIBA-SUP uniba 0.4329
5 ualberta ualberta 0.3808
6 CEN_NEEL_1 cen_neel 0.0004
7 run2 tcs-iitkgp NCA*
NEEL Final Ranking
NCA = annotations not compliant with the NEEL specs
NEEL Final Ranking
breakdown per clusteringF1
rank runid
team name clusteringF1
1 9 ousia 0.84
2 guru uva 0.643
3 7 acubelab 0.506
4 UNIBA-SUP uniba 0.459
5 ualberta ualberta 0.394
6 CEN_NEEL_1 cen_neel 0.001
7 run2 tcs-iitkgp NCA
NEEL Final Ranking
breakdown per taggingF1
rank runid
team name taggingF1
1 9 ousia 0.807
2 guru uva 0.412
3 7 acubelab 0.388
4 UNIBA-SUP uniba 0.367
5 ualberta ualberta 0.329
6 CEN_NEEL_1 cen_neel 0
7 run2 tcs-iitkgp NCA
NEEL Final Ranking
breakdown per linkingF1
rank runid
team name linkingF1
1 9 ousia 0.762
2 7 acubelab 0.523
4 UNIBA-SUP uniba 0.464
5 ualberta ualberta 0.415
3 guru uva 0.316
6 CEN_NEEL_1 cen_neel 0
7 run2 tcs-iitkgp NCA
NEEL Final Ranking
breakdown per submission
rank team name runID
taggingF1
clusteringF1
linkingF1
latency[ms] score
1 ousia 9 0.807 0.84 0.762 8500.99 +/- 3619.12 0.8067
2 ousia 5 0.68 0.843 0.762 8477.88 +/- 3596.47 0.7698
3 ousia 10 0.679 0.842 0.762 8493.38 +/-3562.96 0.7691
4 acubelab 7 0.388 0.506 0.523 127.97 +/- 21.84 0.4757
5 uva guru 0.412 0.643 0.316 186.95 +/- 88.53 0.4756
6 acubelab 6 0.385 0.506 0.524 126.55 +/- 20.31 0.4751
7 acubelab 9 0.386 0.504 0.52 126.54 +/- 19.16 0.4734
8 uva wiz 0.404 0.642 0.285 187.83 +/- 99.78 0.4635
9 uva qtip 0.383 0.595 0.318 1731.16 +/- 857.98 0.4483
10 uniba UNIBA-SUP 0.367 0.459 0.464 2034.75 +/- 2346.23 0.4329
11 ualberta ualberta 0.329 0.394 0.415 3406.43 +/- 7625.28 0.3808
12 uniba
UNIBA-
UNSUP 0.283 0.37 0.348 761.88 +/- 631.59 0.3373
13 cen_neel
CEN_NEEL_
1 0 0.001 0 12366.61 +/- 27598.28 0.0004
14 tcs-iitkgp run2 NCA NCA NCA 12888.27 +/- 11654.02 NaN
15 tcs-iitkgp run4 NCA NCA NCA 12909.65 +/- 11593.13 NaN
16 tcs-iitkgp run10 NCA NCA NCA 12831.80 +/- 11538.43 NaN
Acknowledgements
The research leading to this
work was partially supported by
the European Union’s 7th
Framework Programme via the
projects LinkedTV

Contenu connexe

Tendances

Ariadne: Semantic Annotation and Linking
Ariadne: Semantic Annotation and LinkingAriadne: Semantic Annotation and Linking
Ariadne: Semantic Annotation and Linking
ariadnenetwork
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
Andre Freitas
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Julien PLU
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
Francesco Osborne
 

Tendances (19)

AINL 2016: Galinsky, Alekseev, Nikolenko
AINL 2016: Galinsky, Alekseev, NikolenkoAINL 2016: Galinsky, Alekseev, Nikolenko
AINL 2016: Galinsky, Alekseev, Nikolenko
 
AINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, NikolenkoAINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, Nikolenko
 
EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic PublicationsEKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language Processing
 
OO Metrics
OO MetricsOO Metrics
OO Metrics
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and Evaluation
 
Aspects of NLP Practice
Aspects of NLP PracticeAspects of NLP Practice
Aspects of NLP Practice
 
Ariadne: Semantic Annotation and Linking
Ariadne: Semantic Annotation and LinkingAriadne: Semantic Annotation and Linking
Ariadne: Semantic Annotation and Linking
 
AINL 2016: Bastrakova, Ledesma, Millan, Zighed
AINL 2016: Bastrakova, Ledesma, Millan, ZighedAINL 2016: Bastrakova, Ledesma, Millan, Zighed
AINL 2016: Bastrakova, Ledesma, Millan, Zighed
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic Technologies
 
Linked science presentation 25
Linked science presentation 25Linked science presentation 25
Linked science presentation 25
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)
 
The SentiME System at the SSA Challenge Task 1
The SentiME System at the SSA Challenge Task 1The SentiME System at the SSA Challenge Task 1
The SentiME System at the SSA Challenge Task 1
 
Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference Resolution
 
Group 8 presentation_metrics_for_object_oriented_system
Group 8 presentation_metrics_for_object_oriented_systemGroup 8 presentation_metrics_for_object_oriented_system
Group 8 presentation_metrics_for_object_oriented_system
 

En vedette

Locals Slides - UpCity
Locals Slides - UpCityLocals Slides - UpCity
Locals Slides - UpCity
Adi Buzgar
 
Chapter 4 Popular Radio
Chapter 4   Popular RadioChapter 4   Popular Radio
Chapter 4 Popular Radio
Jill Falk
 
Gaudi Mjo
Gaudi MjoGaudi Mjo
Gaudi Mjo
enritro
 
V3 basic pm training overview thammasat
V3 basic pm training overview thammasatV3 basic pm training overview thammasat
V3 basic pm training overview thammasat
Robert Twiddy
 
MONYC Power Point Presentation 2009
MONYC Power Point Presentation 2009MONYC Power Point Presentation 2009
MONYC Power Point Presentation 2009
jgalosic
 
Waspada Aceh 10 8 2009
Waspada Aceh 10 8 2009Waspada Aceh 10 8 2009
Waspada Aceh 10 8 2009
epaper
 
Fresh forward compensation plan
Fresh forward compensation planFresh forward compensation plan
Fresh forward compensation plan
Victor Manalac
 
Binder19 September
Binder19 SeptemberBinder19 September
Binder19 September
epaper
 
Medan031009
Medan031009Medan031009
Medan031009
epaper
 

En vedette (20)

Locals Slides - UpCity
Locals Slides - UpCityLocals Slides - UpCity
Locals Slides - UpCity
 
Chapter 4 Popular Radio
Chapter 4   Popular RadioChapter 4   Popular Radio
Chapter 4 Popular Radio
 
Dubai. Religion
Dubai. ReligionDubai. Religion
Dubai. Religion
 
Gaudi Mjo
Gaudi MjoGaudi Mjo
Gaudi Mjo
 
Make The Most Of Your Marketing Budget!
Make The Most Of Your Marketing Budget!Make The Most Of Your Marketing Budget!
Make The Most Of Your Marketing Budget!
 
V3 basic pm training overview thammasat
V3 basic pm training overview thammasatV3 basic pm training overview thammasat
V3 basic pm training overview thammasat
 
Fastest Startups of the World (2014)
Fastest Startups of the World (2014)Fastest Startups of the World (2014)
Fastest Startups of the World (2014)
 
MONYC Power Point Presentation 2009
MONYC Power Point Presentation 2009MONYC Power Point Presentation 2009
MONYC Power Point Presentation 2009
 
Waspada Aceh 10 8 2009
Waspada Aceh 10 8 2009Waspada Aceh 10 8 2009
Waspada Aceh 10 8 2009
 
White Sands Look Book Ss10
White Sands  Look Book Ss10White Sands  Look Book Ss10
White Sands Look Book Ss10
 
55 Firstworld
55 Firstworld55 Firstworld
55 Firstworld
 
K2
K2K2
K2
 
Apex Application Form
Apex Application FormApex Application Form
Apex Application Form
 
Fresh forward compensation plan
Fresh forward compensation planFresh forward compensation plan
Fresh forward compensation plan
 
Brownian Motion Publication
Brownian Motion PublicationBrownian Motion Publication
Brownian Motion Publication
 
Journalism today1 - slideshare
Journalism today1  -  slideshareJournalism today1  -  slideshare
Journalism today1 - slideshare
 
Ecommerce 2k9
Ecommerce 2k9Ecommerce 2k9
Ecommerce 2k9
 
Binder19 September
Binder19 SeptemberBinder19 September
Binder19 September
 
Medan031009
Medan031009Medan031009
Medan031009
 
Project Exploration Experience
Project Exploration ExperienceProject Exploration Experience
Project Exploration Experience
 

Similaire à NEEL2015 challenge summary

Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014
Maria Eskevich
 
Unit1 - Individual Project Due on 03292014A company wan.docx
Unit1 - Individual Project       Due on  03292014A company wan.docxUnit1 - Individual Project       Due on  03292014A company wan.docx
Unit1 - Individual Project Due on 03292014A company wan.docx
dickonsondorris
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
Mohamed BEN ELLEFI
 

Similaire à NEEL2015 challenge summary (20)

Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
 
ntcir14centre-overview
ntcir14centre-overviewntcir14centre-overview
ntcir14centre-overview
 
Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014
 
Online Index Extraction from Linked Open Data Sources
Online Index Extraction from Linked Open Data SourcesOnline Index Extraction from Linked Open Data Sources
Online Index Extraction from Linked Open Data Sources
 
NLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsNLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology Constraints
 
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGAUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
 
Natural Language Interface to Knowledge Graph
Natural Language Interface to Knowledge GraphNatural Language Interface to Knowledge Graph
Natural Language Interface to Knowledge Graph
 
Unit1 - Individual Project Due on 03292014A company wan.docx
Unit1 - Individual Project       Due on  03292014A company wan.docxUnit1 - Individual Project       Due on  03292014A company wan.docx
Unit1 - Individual Project Due on 03292014A company wan.docx
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Triantafyllia Voulibasi
Triantafyllia VoulibasiTriantafyllia Voulibasi
Triantafyllia Voulibasi
 
ThesisPresentation
ThesisPresentationThesisPresentation
ThesisPresentation
 
Standard Datasets in Information Retrieval
Standard Datasets in Information Retrieval Standard Datasets in Information Retrieval
Standard Datasets in Information Retrieval
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
SCOPUS PAPER EJMCM.pdf
SCOPUS PAPER EJMCM.pdfSCOPUS PAPER EJMCM.pdf
SCOPUS PAPER EJMCM.pdf
 
Co-evolving changes in a data-intensive software system
Co-evolving changes in a data-intensive software systemCo-evolving changes in a data-intensive software system
Co-evolving changes in a data-intensive software system
 
Using DBpedia for Spotting and Disambiguating Entities
Using DBpedia for Spotting and Disambiguating EntitiesUsing DBpedia for Spotting and Disambiguating Entities
Using DBpedia for Spotting and Disambiguating Entities
 
Norman and McCraken, "OpenURL Implementation: Link Resolution That Users Will...
Norman and McCraken, "OpenURL Implementation: Link Resolution That Users Will...Norman and McCraken, "OpenURL Implementation: Link Resolution That Users Will...
Norman and McCraken, "OpenURL Implementation: Link Resolution That Users Will...
 
Neo4j workshop at GraphSummit London 14 Nov 2023.pdf
Neo4j workshop at GraphSummit London 14 Nov 2023.pdfNeo4j workshop at GraphSummit London 14 Nov 2023.pdf
Neo4j workshop at GraphSummit London 14 Nov 2023.pdf
 
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
 
7th SDN Expert Group Seminar - Session1
7th SDN Expert Group Seminar - Session17th SDN Expert Group Seminar - Session1
7th SDN Expert Group Seminar - Session1
 

Plus de Giuseppe Rizzo

Zenaminer: driving the SCORM tandard towards the Web of Data
Zenaminer: driving the SCORM tandard towards the Web of DataZenaminer: driving the SCORM tandard towards the Web of Data
Zenaminer: driving the SCORM tandard towards the Web of Data
Giuseppe Rizzo
 

Plus de Giuseppe Rizzo (20)

Artificial intelligence for social good
Artificial intelligence for social goodArtificial intelligence for social good
Artificial intelligence for social good
 
AI in 60 minutes
AI in 60 minutesAI in 60 minutes
AI in 60 minutes
 
COMPRENDE, PERSONALIZZA, INTERAGISCE E IMPARA: L’AI COGNITIVA PER L’HR
COMPRENDE, PERSONALIZZA, INTERAGISCE E  IMPARA: L’AI COGNITIVA PER L’HRCOMPRENDE, PERSONALIZZA, INTERAGISCE E  IMPARA: L’AI COGNITIVA PER L’HR
COMPRENDE, PERSONALIZZA, INTERAGISCE E IMPARA: L’AI COGNITIVA PER L’HR
 
Understand, Answer and Argument: Conversational Agents
Understand, Answer and Argument: Conversational AgentsUnderstand, Answer and Argument: Conversational Agents
Understand, Answer and Argument: Conversational Agents
 
AI For Profiling Your Customers
AI For Profiling Your CustomersAI For Profiling Your Customers
AI For Profiling Your Customers
 
AI for Personalized Chatbot
AI for Personalized ChatbotAI for Personalized Chatbot
AI for Personalized Chatbot
 
Tourist Knowledge Graph Creation to Automating Travel Bookings
Tourist Knowledge Graph Creation to Automating Travel BookingsTourist Knowledge Graph Creation to Automating Travel Bookings
Tourist Knowledge Graph Creation to Automating Travel Bookings
 
Context-Enhanced Adaptive Entity Linking
Context-Enhanced Adaptive Entity LinkingContext-Enhanced Adaptive Entity Linking
Context-Enhanced Adaptive Entity Linking
 
From Data to Knowledge for Tourists
From Data to Knowledge for TouristsFrom Data to Knowledge for Tourists
From Data to Knowledge for Tourists
 
Enabling Visitors to Explore a Smart City
Enabling Visitors to Explore a Smart CityEnabling Visitors to Explore a Smart City
Enabling Visitors to Explore a Smart City
 
Inductive Entity Typing Alignment
Inductive Entity Typing AlignmentInductive Entity Typing Alignment
Inductive Entity Typing Alignment
 
Benchmarking the Extraction and Disambiguation of Named Entities on the Seman...
Benchmarking the Extraction and Disambiguation of Named Entities on the Seman...Benchmarking the Extraction and Disambiguation of Named Entities on the Seman...
Benchmarking the Extraction and Disambiguation of Named Entities on the Seman...
 
CrossLanguageSpotter: A Library for Detecting Relations in Polyglot Frameworks
CrossLanguageSpotter: A Library for Detecting Relations in Polyglot FrameworksCrossLanguageSpotter: A Library for Detecting Relations in Polyglot Frameworks
CrossLanguageSpotter: A Library for Detecting Relations in Polyglot Frameworks
 
Learning with the Web. Structuring data to ease machine understanding
Learning with the Web. Structuring data to ease  machine understandingLearning with the Web. Structuring data to ease  machine understanding
Learning with the Web. Structuring data to ease machine understanding
 
Learning with the Web: Spotting Named Entities on the intersection of NERD an...
Learning with the Web: Spotting Named Entities on the intersection of NERD an...Learning with the Web: Spotting Named Entities on the intersection of NERD an...
Learning with the Web: Spotting Named Entities on the intersection of NERD an...
 
NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud
NERD meets NIF:  Lifting NLP Extraction Results to the Linked Data CloudNERD meets NIF:  Lifting NLP Extraction Results to the Linked Data Cloud
NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud
 
The NERD project
The NERD projectThe NERD project
The NERD project
 
L'enorme archivio di dati: il Web
L'enorme archivio di dati: il WebL'enorme archivio di dati: il Web
L'enorme archivio di dati: il Web
 
NERD: Evaluating Named Entity Recognition Tools in the Web of Data
NERD: Evaluating Named Entity Recognition Tools in the Web of DataNERD: Evaluating Named Entity Recognition Tools in the Web of Data
NERD: Evaluating Named Entity Recognition Tools in the Web of Data
 
Zenaminer: driving the SCORM tandard towards the Web of Data
Zenaminer: driving the SCORM tandard towards the Web of DataZenaminer: driving the SCORM tandard towards the Web of Data
Zenaminer: driving the SCORM tandard towards the Web of Data
 

Dernier

Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 

Dernier (20)

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 

NEEL2015 challenge summary

  • 1. Making Sense of Microposts (#Microposts2015) @ WWW2015 Named Entity rEcognition and Linking Challenge http://www.scc.lancs.ac.uk/microposts2015/challenge/
  • 2. NEEL challenge overview ➢ Challenging to make sense of Microposts ○ they are very short text messages ○ they contain abbreviations and typos ○ they are “grammar free” ➢ The NEEL challenge aims to explore new approaches to foster research into novel, more accurate entity recognition and linking approaches tailored for Microposts
  • 3. 2013 2014 Information Extraction (IE) named entity recognition (4 types) 2015 Named Entity Extraction and Linking (NEEL) named entity extraction and linking to DBpedia 3.9 entries Named Entity rEcognition and Linking (NEEL) named entity recognition (7 types) and linking to DBpedia 2014 entries
  • 4. ➢ normalization ○ linguistic pre-processing and expansion of tweets ➢ entity recognition and linking ○ sequential and semi-joint tasks ○ large Knowledge Bases (such as DBpedia and Yago) as lexical dictionaries and source of already existing relations among entities ○ supervised learning approaches to both predict the type of the entity given the linguistic and contextual similarity, and the link given the semantic similarity ○ unsupervised learning approaches for grouping similar lexical entities, affecting the entity resolution Highlights of the submitted approaches over the 3-year challenge
  • 5. Sponsorship ➢ Successfully obtained sponsorship each year ○ highlights importance of this practical research ○ importance extends BEYOND academia ➢ Sponsor has early access to results as senior PC member ○ opportunity to liaise with participants to extend work ➢ Workshop and participants obtain greater exposure
  • 6. ➢ Italian company operating in the business of knowledge extraction and representation ➢ successfully participated in 2014 NEEL challenge, ranking 3rd overall
  • 7. 29 teams expressed intent to take part in the challenge
  • 8. 21 teams finally got involved and signed the agreement to access to the NEEL challenge corpus
  • 9. NEEL corpus no. of tweets % Training 3498 58.06 Development 500 8.3 Test 2027 33.64
  • 10. NEEL Corpus details ➢ 6025 tweets ○ events from 2011 and 2013 such the London Riots, the Oslo bombing (cf. event-annotated tweets provided by the Redites project) ○ events in 2014 such as UCI Cyclo-cross World Cup ➢ Corpus available after having signed the NEEL Agreement Form (remains available by contacting msm. orgcom@gmail.com)
  • 11. Manual creation of the Gold Standard 3-step annotation 1. unsupervised annotations, with intent to extract candidate links which were used as input to the second stage. NERD-ML was used as off-the-shelf system 2. three human annotators analyzed and complemented the annotations. GATE was used as the workbench 3. one domain expert reviewed and resolved problematic cases
  • 12. Evaluation protocol Participants were asked to wrap their prototypes as a publicly accessible web service following a REST-based protocol Widen the dissemination, ensure the reproducibility, the reuse, and the correctness of the results
  • 13. Evaluation periods D-Time to test the contending entries (REST APIs) submitted by the participants T-Time for the final evaluation and metric computations
  • 14. Submissions and Runs ➢ Paper submission ○ describing approach taken ○ identifying and detailing any limitations or dependencies of approach ➢ Up to 10 contending entries ○ best of 3 used for the final ranking
  • 15. Evaluation scorer TAC KBP official scorer https://github.com/wikilinks/neleval
  • 16. Evaluation metrics tagging strong_typed_mention_match (check entity name boundary and type) linking strong_link_match clustering mention_ceaf (NIL over the exact match of the entities) latency computation time
  • 18. 7 teams participated to the T-Time
  • 19. Drop of 14 participants due to complexity i) of the challenge protocol, which has required broaden expertise in different domains such as Information Extraction, Data Semantics, and Web ii) generally low results
  • 20. And the winner is ...
  • 21. Ikuya Yamada, Hideaki Takeda and Yoshiyasu Takefuji An End-to-End Entity Linking Approach for Tweets Team Ousia
  • 22. rank runid team name rs 1 9 ousia 0.8067 2 7 acubelab 0.4757 3 guru uva 0.4756 4 UNIBA-SUP uniba 0.4329 5 ualberta ualberta 0.3808 6 CEN_NEEL_1 cen_neel 0.0004 7 run2 tcs-iitkgp NCA* NEEL Final Ranking NCA = annotations not compliant with the NEEL specs
  • 23. NEEL Final Ranking breakdown per clusteringF1 rank runid team name clusteringF1 1 9 ousia 0.84 2 guru uva 0.643 3 7 acubelab 0.506 4 UNIBA-SUP uniba 0.459 5 ualberta ualberta 0.394 6 CEN_NEEL_1 cen_neel 0.001 7 run2 tcs-iitkgp NCA
  • 24. NEEL Final Ranking breakdown per taggingF1 rank runid team name taggingF1 1 9 ousia 0.807 2 guru uva 0.412 3 7 acubelab 0.388 4 UNIBA-SUP uniba 0.367 5 ualberta ualberta 0.329 6 CEN_NEEL_1 cen_neel 0 7 run2 tcs-iitkgp NCA
  • 25. NEEL Final Ranking breakdown per linkingF1 rank runid team name linkingF1 1 9 ousia 0.762 2 7 acubelab 0.523 4 UNIBA-SUP uniba 0.464 5 ualberta ualberta 0.415 3 guru uva 0.316 6 CEN_NEEL_1 cen_neel 0 7 run2 tcs-iitkgp NCA
  • 27. rank team name runID taggingF1 clusteringF1 linkingF1 latency[ms] score 1 ousia 9 0.807 0.84 0.762 8500.99 +/- 3619.12 0.8067 2 ousia 5 0.68 0.843 0.762 8477.88 +/- 3596.47 0.7698 3 ousia 10 0.679 0.842 0.762 8493.38 +/-3562.96 0.7691 4 acubelab 7 0.388 0.506 0.523 127.97 +/- 21.84 0.4757 5 uva guru 0.412 0.643 0.316 186.95 +/- 88.53 0.4756 6 acubelab 6 0.385 0.506 0.524 126.55 +/- 20.31 0.4751 7 acubelab 9 0.386 0.504 0.52 126.54 +/- 19.16 0.4734 8 uva wiz 0.404 0.642 0.285 187.83 +/- 99.78 0.4635 9 uva qtip 0.383 0.595 0.318 1731.16 +/- 857.98 0.4483 10 uniba UNIBA-SUP 0.367 0.459 0.464 2034.75 +/- 2346.23 0.4329 11 ualberta ualberta 0.329 0.394 0.415 3406.43 +/- 7625.28 0.3808 12 uniba UNIBA- UNSUP 0.283 0.37 0.348 761.88 +/- 631.59 0.3373 13 cen_neel CEN_NEEL_ 1 0 0.001 0 12366.61 +/- 27598.28 0.0004 14 tcs-iitkgp run2 NCA NCA NCA 12888.27 +/- 11654.02 NaN 15 tcs-iitkgp run4 NCA NCA NCA 12909.65 +/- 11593.13 NaN 16 tcs-iitkgp run10 NCA NCA NCA 12831.80 +/- 11538.43 NaN
  • 28. Acknowledgements The research leading to this work was partially supported by the European Union’s 7th Framework Programme via the projects LinkedTV