SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
Entityclassifier.eu: Real-time Classification
of Entities in Text with Wikipedia
Milan Dojchinovski1,2, Tomáš Kliegr2
1 Faculty of Information Technology
Czech Technical University in Prague
2Faculty of Informatics and Statistics
University of Economics, Prague
European Conference on Machine Learning and Principles and Practice of
Knowledge Discovery Discovery in Databases (ECMLPKDD 2013)
September 23-27, 2013, Prague, CZ
Milan Dojchinovski
milan.dojchinovski@vse.cz - @m1ci - http://dojchinovski.mk
Except where otherwise noted, the content of this presentation is licensed under
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported
Czech Technical University
in Prague
University of Economics
Prague
What is Entityclassifier.eu?
‣ Fully-automated Named Entity Recognition (NER) system
- entity spotting - rule based lexico-syntactic patterns
- entity disambiguation - unique identification with Wikipedia/DBpedia URIs
- entity classification - using types from the DBpedia Ontology
- entity linking - entities linked with concepts from DBpedia and YAGO
2Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
Advantages of using Entityclassifier.eu
‣ Real-time mining
- previously unknown entities can be disambiguated and classified in real-time
‣ Right type granularity
- most frequent type, as selected by the Wikipedia editors, extracted from free text
‣ Multilinguality
- can process English, German and Dutch texts
3Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
Availability
‣ Web application
- http://entityclassfier.eu
‣ REST API
- API documentation http://entityclassifier.eu/thd/docs/
4Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
Live demo!
http://entityclassifier.eu
Feedback
5
Thank you!
Questions, comments, ideas?
Milan Dojchinovski @m1ci
milan.dojchinovski@fit.cvut.cz http://dojchinovski.mk
Except where otherwise noted, the content of this presentation is licensed under
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported

Contenu connexe

En vedette

Recognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpediaRecognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpediaMilan Dojchinovski
 
7 kalimah allah
7 kalimah allah7 kalimah allah
7 kalimah allahIcha Brow
 
Anggaran penjualan
Anggaran penjualanAnggaran penjualan
Anggaran penjualanIcha Brow
 
01. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 201401. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 2014Icha Brow
 
Presentase pemasaran
Presentase pemasaranPresentase pemasaran
Presentase pemasaranIcha Brow
 
Prada H & D in Tokyo
Prada H & D in Tokyo Prada H & D in Tokyo
Prada H & D in Tokyo Emma Pereira
 
Keuangan dan tata kelola lkp
Keuangan dan tata kelola lkpKeuangan dan tata kelola lkp
Keuangan dan tata kelola lkpIcha Brow
 
Manajemen mutu, visi, renstra
Manajemen mutu, visi, renstraManajemen mutu, visi, renstra
Manajemen mutu, visi, renstraIcha Brow
 

En vedette (9)

Recognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpediaRecognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpedia
 
7 kalimah allah
7 kalimah allah7 kalimah allah
7 kalimah allah
 
Humor kocak
Humor kocakHumor kocak
Humor kocak
 
Anggaran penjualan
Anggaran penjualanAnggaran penjualan
Anggaran penjualan
 
01. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 201401. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 2014
 
Presentase pemasaran
Presentase pemasaranPresentase pemasaran
Presentase pemasaran
 
Prada H & D in Tokyo
Prada H & D in Tokyo Prada H & D in Tokyo
Prada H & D in Tokyo
 
Keuangan dan tata kelola lkp
Keuangan dan tata kelola lkpKeuangan dan tata kelola lkp
Keuangan dan tata kelola lkp
 
Manajemen mutu, visi, renstra
Manajemen mutu, visi, renstraManajemen mutu, visi, renstra
Manajemen mutu, visi, renstra
 

Dernier

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Dernier (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Entityclassifier. eu: Real-Time Classification of Entities in Text with Wikipedia

  • 1. Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia Milan Dojchinovski1,2, Tomáš Kliegr2 1 Faculty of Information Technology Czech Technical University in Prague 2Faculty of Informatics and Statistics University of Economics, Prague European Conference on Machine Learning and Principles and Practice of Knowledge Discovery Discovery in Databases (ECMLPKDD 2013) September 23-27, 2013, Prague, CZ Milan Dojchinovski milan.dojchinovski@vse.cz - @m1ci - http://dojchinovski.mk Except where otherwise noted, the content of this presentation is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported Czech Technical University in Prague University of Economics Prague
  • 2. What is Entityclassifier.eu? ‣ Fully-automated Named Entity Recognition (NER) system - entity spotting - rule based lexico-syntactic patterns - entity disambiguation - unique identification with Wikipedia/DBpedia URIs - entity classification - using types from the DBpedia Ontology - entity linking - entities linked with concepts from DBpedia and YAGO 2Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
  • 3. Advantages of using Entityclassifier.eu ‣ Real-time mining - previously unknown entities can be disambiguated and classified in real-time ‣ Right type granularity - most frequent type, as selected by the Wikipedia editors, extracted from free text ‣ Multilinguality - can process English, German and Dutch texts 3Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
  • 4. Availability ‣ Web application - http://entityclassfier.eu ‣ REST API - API documentation http://entityclassifier.eu/thd/docs/ 4Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk Live demo! http://entityclassifier.eu
  • 5. Feedback 5 Thank you! Questions, comments, ideas? Milan Dojchinovski @m1ci milan.dojchinovski@fit.cvut.cz http://dojchinovski.mk Except where otherwise noted, the content of this presentation is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported