SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
A look inside
Babelfy:
Examining the
bubble
Filip IlievskiMinh Le
Reproducibility
“Reproducibility is a defining feature of science,
but the extent to which it characterizes current research
is unknown.”
(Estimating the reproducibility of psychological science, 2015)
“According to the replicators’ qualitative assessments,
only 39 of the 100 replication attempts were successful.”
(http://www.nature.com/news/over-half-of-psychology-studies-fa
il-reproducibility-test-1.18248)
Replication is h.a.r.d.
“According to the replicators’ qualitative assessments,
only 39 of the 100 replication attempts were successful.”
“BUT, whether a replication attempt is considered successful is not
straightforward.”
(http://www.nature.com/news/over-half-of-psychology-studies-fa
il-reproducibility-test-1.18248)
Everyone has a replication story
to tell
... And here is ours!
Babelfy
● Solution published in 2014
Babelfy
● Solution published in 2014
● State-of-the-art reported performance
Babelfy
● Solution published in 2014
● State-of-the-art reported performance
● Great algorithm
○ Multilinguality
○ Joint decision making
○ Combined knowledge from multiple sources
■ NLP (EL, WSD)
■ Semantic Web (BabelNet)
Babelfy
● Solution published in 2014
● State-of-the-art reported performance
● Great algorithm
○ Multilinguality
○ Joint decision making
○ Combined knowledge from multiple sources
■ NLP (EL, WSD)
■ Semantic Web (BabelNet)
But: No open source available
Idea: Reimplement
● No open source available
● Curiosity how the algorithm works in detail
● Accelerate research by opening the source code
Algorithm
1. Preprocessing (run once)
1.1. Input: BabelNet graph (undirected, labeled graph of concepts and
entities)
1.2. Re-weight the graph
1.3. Random walk with restart
1.4. Output: set of related concepts and entities
2. Disambiguation (for each document)
2.1. Generate candidates
2.2. Construct referent graph
2.3. Apply Densest subgraph algorithm
2.4. Disambiguate with a threshold
1) Graph -> Concept set
->
Algorithm
1. Preprocessing (run once)
1.1. Input: BabelNet graph (undirected, labeled graph of concepts and
entities)
1.2. Re-weight the graph
1.3. Random walk with restart
1.4. Output: set of related concepts and entities
2. Disambiguation (for each document)
2.1. Generate candidates
2.2. Construct referent graph
2.3. Apply Densest subgraph algorithm
2.4. Disambiguate with a threshold
Thomas and Mario are strikers playing in Munich.
2) Disambiguation
2) Disambiguation:
Graph construction
Thomas and Mario are strikers playing in Munich.
2) Disambiguation:
Densest subgraph
Thomas and Mario are strikers playing in Munich.
Experimental setup
We ran experiments on:
a University server
16/32GB RAM
1TB disk space
16 cores CPU @ 2.60 Ghz
Experiments
KORE-50 dataset: 50 ‘difficult’ sentences
Execution time: KORE-50
1. Preprocessing (run once)
1.1. Input: BabelNet graph (undirected, labeled graph of concepts and
entities)
1.2. Re-weight the graph
1.3. Random walk with restart
1.4. Output: set of related concepts and entities
2. Disambiguation (for each document)
2.1. Generate candidates
2.2. Construct referent graph
2.3. Apply Densest subgraph algorithm
2.4. Disambiguate with a threshold
10.3 hours
3.5 days
Results: KORE-50
Precision Recall F1
Recognition 0.82 0.77 0.79
Disambiguation 0.41 0.40 0.40
Disambiguation
(reported by
Babelfy)
/ / 0.715
Discussion
● significant accuracy mismatch
○ Reported F1-score: 71.5 %
○ Our F1-score: 40%
Discussion
● significant accuracy mismatch
● algorithm gaps that matter
○ partial matching for entities, exact matching for concepts: but how
do we know if something is an entity?
○ different parameters for EL and WSD
○ parameters tuned per dataset :-(
○ what happens if the linking conference is too low?
Discussion
● significant accuracy mismatch
● algorithm gaps that matter
● computationally quite heavy
For KORE-50:
○ 3.5 days to preprocess data
○ 17 mins to generate candidates
○ 10 hours to disambiguate
Discussion
● significant accuracy mismatch
● algorithm gaps that matter
● computationally quite heavy
● ok, implementation could be improved
○ currently we use Python + MongoDB + up to 32GB RAM
Conclusions
Combining knowledge is an attractive idea
but coping with that knowledge is additional challenge!
Babelfy is a great effort
which should have an open source version
which should be reproducible
Given:
the detail in the paper & the computational costs,
reproducing is very difficult
Do we know what we think we
know?
Reproducibility is a defining feature of science
Is reproducibility appreciated in the real-world?
Scientific claims should gain trust and credence by
replicability of their supporting evidence
Join us!
https://github.com/filipdbrsk/BabelfyReimplementation
Appendix
Discussion on computation
● a lot of requests for generating candidates
● a lot of candidates
● big graphs
● computationally super heavy (better implementation
needed):
For KORE-50:
○ 3.5 days to preprocess data
○ 17 mins to generate candidates
○ 10 hours to disambiguate
Discussion on algorithm
partial vs exact matching: when to use what
partial matching for entities, but how do we know if
something is an entity?
different hyperparameters for EL & WSD -> very confusing
what happens if the confidence is too low -> unreported
tuning of the accuracy per dataset: especially on partial vs full
matching
Discussion on implementation
● Python is a poor choice for speed
○ GIL (Global Interpreter Lock) → no multithreading
● MongoDB is scalable but not fast enough
● More RAM
The algorithm in a nutshell
Finding semantic signature:
● Synset graph: an edge between any pair of synsets having
a relation of any type
● Structural weighting: edge weight is proportional to the
number of directed triangles
● Semantic signature: random walk with restart
Finding candidates:
- tight integration with the background knowledge
- every phrase which contains a noun is checked for
existence in BabelNet
The algorithm in a nutshell
Disambiguating:
The algorithm in a nutshell
Implementation
Structural weighting: 1st attempt
for u in vertices
for v in adjacent vertices of u
for w in adjacent vertices of v
is u adjacent to w?
→ O(E^2)
Implementation
Structural weighting: 2nd attempt
reverse all edges
for u in vertices
for v in adjacent vertices of u
intersection(adjacent vertices of v,
reversed-adjacent vertices of u)
→ O(E)

Contenu connexe

Similaire à Inside the Babelfy bubble: Reexamining an influential NLP algorithm

Deep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image ProcessingDeep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image ProcessingGrigory Sapunov
 
Functional Programming with Immutable Data Structures
Functional Programming with Immutable Data StructuresFunctional Programming with Immutable Data Structures
Functional Programming with Immutable Data Structureselliando dias
 
Nibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL storeNibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL storeEdward Capriolo
 
Building your own NSQL store
Building your own NSQL storeBuilding your own NSQL store
Building your own NSQL storeEdward Capriolo
 
Nibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL storeNibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL storeEdward Capriolo
 
PR-146: CornerNet detecting objects as paired keypoints
PR-146: CornerNet detecting objects as paired keypointsPR-146: CornerNet detecting objects as paired keypoints
PR-146: CornerNet detecting objects as paired keypointsjaewon lee
 
Convolutional neural network in practice
Convolutional neural network in practiceConvolutional neural network in practice
Convolutional neural network in practice남주 김
 
DL4J at Workday Meetup
DL4J at Workday MeetupDL4J at Workday Meetup
DL4J at Workday MeetupDavid Kale
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibilityc.titus.brown
 
[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun YooJaeJun Yoo
 
Report: "MolGAN: An implicit generative model for small molecular graphs"
Report: "MolGAN: An implicit generative model for small molecular graphs"Report: "MolGAN: An implicit generative model for small molecular graphs"
Report: "MolGAN: An implicit generative model for small molecular graphs"Ryohei Suzuki
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research ObjectsDavid De Roure
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersRoelof Pieters
 
Madaari : Ordering For The Monkeys
Madaari : Ordering For The MonkeysMadaari : Ordering For The Monkeys
Madaari : Ordering For The MonkeysJ On The Beach
 
Machine learning the next revolution or just another hype
Machine learning   the next revolution or just another hypeMachine learning   the next revolution or just another hype
Machine learning the next revolution or just another hypeJorge Ferrer
 
Using MongoDB for Materials Discovery
Using MongoDB for Materials DiscoveryUsing MongoDB for Materials Discovery
Using MongoDB for Materials DiscoveryDan Gunter
 
Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)Alexander Korbonits
 

Similaire à Inside the Babelfy bubble: Reexamining an influential NLP algorithm (20)

Deep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image ProcessingDeep Learning Cases: Text and Image Processing
Deep Learning Cases: Text and Image Processing
 
Functional Programming with Immutable Data Structures
Functional Programming with Immutable Data StructuresFunctional Programming with Immutable Data Structures
Functional Programming with Immutable Data Structures
 
2014 pycon-talk
2014 pycon-talk2014 pycon-talk
2014 pycon-talk
 
Nibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL storeNibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL store
 
Building your own NSQL store
Building your own NSQL storeBuilding your own NSQL store
Building your own NSQL store
 
Nibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL storeNibiru: Building your own NoSQL store
Nibiru: Building your own NoSQL store
 
PR-146: CornerNet detecting objects as paired keypoints
PR-146: CornerNet detecting objects as paired keypointsPR-146: CornerNet detecting objects as paired keypoints
PR-146: CornerNet detecting objects as paired keypoints
 
EMNLP 2021 proScript
EMNLP 2021 proScriptEMNLP 2021 proScript
EMNLP 2021 proScript
 
Convolutional neural network in practice
Convolutional neural network in practiceConvolutional neural network in practice
Convolutional neural network in practice
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
DL4J at Workday Meetup
DL4J at Workday MeetupDL4J at Workday Meetup
DL4J at Workday Meetup
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
 
[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo
 
Report: "MolGAN: An implicit generative model for small molecular graphs"
Report: "MolGAN: An implicit generative model for small molecular graphs"Report: "MolGAN: An implicit generative model for small molecular graphs"
Report: "MolGAN: An implicit generative model for small molecular graphs"
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research Objects
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ers
 
Madaari : Ordering For The Monkeys
Madaari : Ordering For The MonkeysMadaari : Ordering For The Monkeys
Madaari : Ordering For The Monkeys
 
Machine learning the next revolution or just another hype
Machine learning   the next revolution or just another hypeMachine learning   the next revolution or just another hype
Machine learning the next revolution or just another hype
 
Using MongoDB for Materials Discovery
Using MongoDB for Materials DiscoveryUsing MongoDB for Materials Discovery
Using MongoDB for Materials Discovery
 
Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)
 

Plus de Filip Ilievski

The Commonsense Knowledge Graph
The Commonsense Knowledge GraphThe Commonsense Knowledge Graph
The Commonsense Knowledge GraphFilip Ilievski
 
Commonsense knowledge in Wikidata
Commonsense knowledge in WikidataCommonsense knowledge in Wikidata
Commonsense knowledge in WikidataFilip Ilievski
 
SemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tailSemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tailFilip Ilievski
 
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slidesFilip Ilievski
 
Systematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingSystematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingFilip Ilievski
 
LOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked DataLOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked DataFilip Ilievski
 
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
Lotus: Linked Open Text UnleaShed - ISWC COLD '15Lotus: Linked Open Text UnleaShed - ISWC COLD '15
Lotus: Linked Open Text UnleaShed - ISWC COLD '15Filip Ilievski
 
NAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceNAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceFilip Ilievski
 
Mini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationMini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationFilip Ilievski
 
CLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationCLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationFilip Ilievski
 

Plus de Filip Ilievski (11)

The Commonsense Knowledge Graph
The Commonsense Knowledge GraphThe Commonsense Knowledge Graph
The Commonsense Knowledge Graph
 
Commonsense knowledge in Wikidata
Commonsense knowledge in WikidataCommonsense knowledge in Wikidata
Commonsense knowledge in Wikidata
 
SemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tailSemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tail
 
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
 
Systematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingSystematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity Linking
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
LOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked DataLOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked Data
 
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
Lotus: Linked Open Text UnleaShed - ISWC COLD '15Lotus: Linked Open Text UnleaShed - ISWC COLD '15
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
 
NAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceNAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event Coreference
 
Mini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationMini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimization
 
CLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationCLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimization
 

Dernier

Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 

Dernier (20)

The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 

Inside the Babelfy bubble: Reexamining an influential NLP algorithm

  • 1. A look inside Babelfy: Examining the bubble Filip IlievskiMinh Le
  • 2. Reproducibility “Reproducibility is a defining feature of science, but the extent to which it characterizes current research is unknown.” (Estimating the reproducibility of psychological science, 2015)
  • 3. “According to the replicators’ qualitative assessments, only 39 of the 100 replication attempts were successful.” (http://www.nature.com/news/over-half-of-psychology-studies-fa il-reproducibility-test-1.18248)
  • 4. Replication is h.a.r.d. “According to the replicators’ qualitative assessments, only 39 of the 100 replication attempts were successful.” “BUT, whether a replication attempt is considered successful is not straightforward.” (http://www.nature.com/news/over-half-of-psychology-studies-fa il-reproducibility-test-1.18248)
  • 5. Everyone has a replication story to tell ... And here is ours!
  • 7. Babelfy ● Solution published in 2014 ● State-of-the-art reported performance
  • 8. Babelfy ● Solution published in 2014 ● State-of-the-art reported performance ● Great algorithm ○ Multilinguality ○ Joint decision making ○ Combined knowledge from multiple sources ■ NLP (EL, WSD) ■ Semantic Web (BabelNet)
  • 9. Babelfy ● Solution published in 2014 ● State-of-the-art reported performance ● Great algorithm ○ Multilinguality ○ Joint decision making ○ Combined knowledge from multiple sources ■ NLP (EL, WSD) ■ Semantic Web (BabelNet) But: No open source available
  • 10. Idea: Reimplement ● No open source available ● Curiosity how the algorithm works in detail ● Accelerate research by opening the source code
  • 11. Algorithm 1. Preprocessing (run once) 1.1. Input: BabelNet graph (undirected, labeled graph of concepts and entities) 1.2. Re-weight the graph 1.3. Random walk with restart 1.4. Output: set of related concepts and entities 2. Disambiguation (for each document) 2.1. Generate candidates 2.2. Construct referent graph 2.3. Apply Densest subgraph algorithm 2.4. Disambiguate with a threshold
  • 12. 1) Graph -> Concept set ->
  • 13. Algorithm 1. Preprocessing (run once) 1.1. Input: BabelNet graph (undirected, labeled graph of concepts and entities) 1.2. Re-weight the graph 1.3. Random walk with restart 1.4. Output: set of related concepts and entities 2. Disambiguation (for each document) 2.1. Generate candidates 2.2. Construct referent graph 2.3. Apply Densest subgraph algorithm 2.4. Disambiguate with a threshold
  • 14. Thomas and Mario are strikers playing in Munich. 2) Disambiguation
  • 15. 2) Disambiguation: Graph construction Thomas and Mario are strikers playing in Munich.
  • 16. 2) Disambiguation: Densest subgraph Thomas and Mario are strikers playing in Munich.
  • 17. Experimental setup We ran experiments on: a University server 16/32GB RAM 1TB disk space 16 cores CPU @ 2.60 Ghz
  • 18. Experiments KORE-50 dataset: 50 ‘difficult’ sentences
  • 19. Execution time: KORE-50 1. Preprocessing (run once) 1.1. Input: BabelNet graph (undirected, labeled graph of concepts and entities) 1.2. Re-weight the graph 1.3. Random walk with restart 1.4. Output: set of related concepts and entities 2. Disambiguation (for each document) 2.1. Generate candidates 2.2. Construct referent graph 2.3. Apply Densest subgraph algorithm 2.4. Disambiguate with a threshold 10.3 hours 3.5 days
  • 20. Results: KORE-50 Precision Recall F1 Recognition 0.82 0.77 0.79 Disambiguation 0.41 0.40 0.40 Disambiguation (reported by Babelfy) / / 0.715
  • 21. Discussion ● significant accuracy mismatch ○ Reported F1-score: 71.5 % ○ Our F1-score: 40%
  • 22. Discussion ● significant accuracy mismatch ● algorithm gaps that matter ○ partial matching for entities, exact matching for concepts: but how do we know if something is an entity? ○ different parameters for EL and WSD ○ parameters tuned per dataset :-( ○ what happens if the linking conference is too low?
  • 23. Discussion ● significant accuracy mismatch ● algorithm gaps that matter ● computationally quite heavy For KORE-50: ○ 3.5 days to preprocess data ○ 17 mins to generate candidates ○ 10 hours to disambiguate
  • 24. Discussion ● significant accuracy mismatch ● algorithm gaps that matter ● computationally quite heavy ● ok, implementation could be improved ○ currently we use Python + MongoDB + up to 32GB RAM
  • 25. Conclusions Combining knowledge is an attractive idea but coping with that knowledge is additional challenge! Babelfy is a great effort which should have an open source version which should be reproducible Given: the detail in the paper & the computational costs, reproducing is very difficult
  • 26. Do we know what we think we know? Reproducibility is a defining feature of science Is reproducibility appreciated in the real-world? Scientific claims should gain trust and credence by replicability of their supporting evidence
  • 29. Discussion on computation ● a lot of requests for generating candidates ● a lot of candidates ● big graphs ● computationally super heavy (better implementation needed): For KORE-50: ○ 3.5 days to preprocess data ○ 17 mins to generate candidates ○ 10 hours to disambiguate
  • 30. Discussion on algorithm partial vs exact matching: when to use what partial matching for entities, but how do we know if something is an entity? different hyperparameters for EL & WSD -> very confusing what happens if the confidence is too low -> unreported tuning of the accuracy per dataset: especially on partial vs full matching
  • 31. Discussion on implementation ● Python is a poor choice for speed ○ GIL (Global Interpreter Lock) → no multithreading ● MongoDB is scalable but not fast enough ● More RAM
  • 32. The algorithm in a nutshell Finding semantic signature: ● Synset graph: an edge between any pair of synsets having a relation of any type ● Structural weighting: edge weight is proportional to the number of directed triangles ● Semantic signature: random walk with restart
  • 33. Finding candidates: - tight integration with the background knowledge - every phrase which contains a noun is checked for existence in BabelNet The algorithm in a nutshell
  • 35. Implementation Structural weighting: 1st attempt for u in vertices for v in adjacent vertices of u for w in adjacent vertices of v is u adjacent to w? → O(E^2)
  • 36. Implementation Structural weighting: 2nd attempt reverse all edges for u in vertices for v in adjacent vertices of u intersection(adjacent vertices of v, reversed-adjacent vertices of u) → O(E)