SlideShare une entreprise Scribd logo
1  sur  20
Télécharger pour lire hors ligne
Creating Knowledge out of Interlinked Data
          MultilingualWeb – 2012/06/11 Dublin – Page 1                           http://lod2.eu




        Linked Data in Linguistics
      for NLP and Web Annotation



                                                            http://nlp2rdf.org
                                                              http://lod2.eu
                                                         Sebastian Hellmann
                                                            AKSW, Universität Leipzig
LOD2 Presentation . 02.09.2010 . Page                                      http://lod2.eu
MultilingualWeb – 2012/06/11 Dublin – Page 2   http://lod2.eu




 The Semantic Gap
MultilingualWeb – 2012/06/11 Dublin – Page 3                         http://lod2.eu

          Turning Walled Gardens into Park Networks of
          Semantic Linguistic Data
How can we leverage the Data Web for natural
language processing?
                                                            50 Billion facts covering
                                                            all kinds of domains are
                                                            readily available
                                    1. Use the Data         Leverage the wisdom of
                                        Web as              the crowds
                                      background
                                     knowledge for
                                          NLP

                                                    2. Use Data
                         3. Make the
                                                        Web
                        output of NLP
                                                   technologies
                       tools available
                                                  for integrating    RDF is all about
                         on the Data
On the Web, by                                      NLP tools &      semantic
                             Web
sharing and                                         approaches
                                                                     interoperability
copying the value
of information
increases
MultilingualWeb – 2012/06/11 Dublin – Page 4                            http://lod2.eu

 1. Use the Data Web as
 background knowledge for NLP




                                               Linguistic Data currently filed
                                                   under “cross-domain”
MultilingualWeb – 2012/06/11 Dublin – Page 5        http://lod2.eu

    1. Use the Data Web as
    background knowledge for NLP


Three communities with three resources:
 • Working Group for Open Linguistics Data (OWLG)
    – > http://linguistics.okfn.org
 • DBpedia Internationalization Committee
    – > http://wiki.dbpedia.org/Internationalization
 • Wiktionary2RDF Wrappers
    – > http://dbpedia.org/Wiktionary
All communities are open, please join!
MultilingualWeb – 2012/06/11 Dublin – Page 6   http://lod2.eu




 The Linguistic Linked Open Data Cloud
MultilingualWeb – 2012/06/11 Dublin – Page 7   http://lod2.eu




 Main question
MultilingualWeb – 2012/06/11 Dublin – Page 8                        http://lod2.eu




 Wiktionary2RDF – Mediator Wrapper
                                               http://dbpedia.org/Wiktionary
MultilingualWeb – 2012/06/11 Dublin – Page 9                        http://lod2.eu




 Wiktionary2RDF – Mediator Wrapper
                                               http://dbpedia.org/Wiktionary


                                                                 Mediator
                                                                  Lemon
MultilingualWeb – 2012/06/11 Dublin – Page 10     http://lod2.eu

          2. Use Data Web Technologies for
          Integrating NLP Tools and Approaches


Golden Hammer Anti-pattern



The question is not whether to
use RDF and Linked Data, but when
to use...




  Image from http://pbmo.wordpress.com/2011/09/29/maslows-hammer/
MultilingualWeb – 2012/06/11 Dublin – Page 11   http://lod2.eu
MultilingualWeb – 2012/06/11 Dublin – Page 12   http://lod2.eu

   2. Use Data Web Technologies for
   Integrating NLP Tools and Approaches




• Ontologies provide (formal) documentation (UML, ERD)
• Structure is easy to understand
• Wide range of RDF tools can be used, e.g. LOD2 Stack
• Indexing and querying as Big Picture possible
MultilingualWeb – 2012/06/11 Dublin – Page 13         http://lod2.eu

      2. Use Data Web Technologies for
      Integrating NLP Tools and Approaches

  The NLP Interchange Format (NIF) is an RDF/OWL-based
  format that aims to achieve interoperability between Natural
  Language Processing (NLP) tools, language resources and
  annotations.
• Road map
   • Bootstrapped by LOD2, but a community project
   • First release in September 2011
   • Great resonance
      – Over 50 people joined the mailing list:
         http://lists.okfn.org/mailman/listinfo/open-linguistics
      – First third party implementations and contributions
      – Several project discuss usage
   • Currently setting up advisory board, next draft in July
MultilingualWeb – 2012/06/11 Dublin – Page 14                                         http://lod2.eu




S. Auer and S. Hellmann: The Web of Data: Decentralized, collaborative, interlinked and interoperable
LREC 2012, http://www.lrec-conf.org/proceedings/lrec2012/keynotes/LREC%202012.Keynote%20Speech%201.Soeren%20Auer.pdf
MultilingualWeb – 2012/06/11 Dublin – Page 15   http://lod2.eu

        3. Make the Output of NLP Tools
         available on the Web




Currently there is no standard mechanism to transparently
combine the WWW, GGG and NLP




GGG = Giant Global Graph (basically the Web of Data)
see: http://dig.csail.mit.edu/breadcrumbs/node/215
MultilingualWeb – 2012/06/11 Dublin – Page 16   http://lod2.eu

 3. Make the Output of NLP Tools
  available on the Web
MultilingualWeb – 2012/06/11 Dublin – Page 17            http://lod2.eu

           3. Make the Output of NLP Tools
            available on the Web




http://dbpedia.org/spotlight P. Mendes et. al. DBpedia spotlight: Shedding
           light on the web of documents. In I-Semantics, 2011
MultilingualWeb – 2012/06/11 Dublin – Page 18    http://lod2.eu

 3. Make the Output of NLP Tools
  available on the Web




http://annotateit.org
http://sourceforge.net/projects/fragmentlinks/
MultilingualWeb – 2012/06/11 Dublin – Page 19                http://lod2.eu

         3. Make the Output of NLP Tools
          available on the Web

        NLP Interchange Format (NIF) join the mailing list at:
                         http://nlp2rdf.org




Hellmann et.al.: Towards an Ontology for Representing Strings In: EKAW 2012
    http://svn.aksw.org/papers/2012/WWW_NIF/public/string_ontology.pdf
LOD2 Title . 02.09.2010 . Page 20                               http://lod2.eu




            Contact

            Address

            University of Leipzig
            Faculty of Mathematics and Computer
            Science
            Institute of Computer Science
            Department of Business Information
            Systems

            Postfach 100920
            04009 Leipzig
            Germany



     Project: http://lod2.eu
     Organisation: http://uni-leipzig.de, http://aksw.org
     Presenter: http://bis.informatik.uni-leipzig.de/SebastianHellmann
     NLP2RDF page: http://nlp2rdf.org

                                                     Acknowledgement:
       CC-BY-SA                            some slides are taken from the keynote
  Thanks for your
unless otherwise stated                         of Sören Auer at LREC 2012

Contenu connexe

Similaire à Linked Data in Linguistics for NLP and Web Annotation

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...Sebastian Hellmann
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 
Semantic personalisation in networked media: determining the background know...
Semantic personalisation in networked media: determining the  background know...Semantic personalisation in networked media: determining the  background know...
Semantic personalisation in networked media: determining the background know...LinkedTV
 
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois ScharffeDatalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois Scharffewebscience-montpellier
 

Similaire à Linked Data in Linguistics for NLP and Web Annotation (20)

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
NIF 2.0 draft for Pisa
NIF 2.0 draft for PisaNIF 2.0 draft for Pisa
NIF 2.0 draft for Pisa
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
LOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViewsLOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViews
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General PresentationLOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIFLOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
 
Semantic personalisation in networked media: determining the background know...
Semantic personalisation in networked media: determining the  background know...Semantic personalisation in networked media: determining the  background know...
Semantic personalisation in networked media: determining the background know...
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois ScharffeDatalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois Scharffe
 

Plus de Sebastian Hellmann

Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkSebastian Hellmann
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016Sebastian Hellmann
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015Sebastian Hellmann
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataSebastian Hellmann
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web Sebastian Hellmann
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationSebastian Hellmann
 
Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by ExampleSebastian Hellmann
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23Sebastian Hellmann
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftSebastian Hellmann
 

Plus de Sebastian Hellmann (14)

KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future Work
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and Segmentation
 
Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by Example
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23
 
Tool collection as linkeddata
Tool collection as linkeddataTool collection as linkeddata
Tool collection as linkeddata
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draft
 

Dernier

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Dernier (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Linked Data in Linguistics for NLP and Web Annotation

  • 1. Creating Knowledge out of Interlinked Data MultilingualWeb – 2012/06/11 Dublin – Page 1 http://lod2.eu Linked Data in Linguistics for NLP and Web Annotation http://nlp2rdf.org http://lod2.eu Sebastian Hellmann AKSW, Universität Leipzig LOD2 Presentation . 02.09.2010 . Page http://lod2.eu
  • 2. MultilingualWeb – 2012/06/11 Dublin – Page 2 http://lod2.eu The Semantic Gap
  • 3. MultilingualWeb – 2012/06/11 Dublin – Page 3 http://lod2.eu Turning Walled Gardens into Park Networks of Semantic Linguistic Data How can we leverage the Data Web for natural language processing? 50 Billion facts covering all kinds of domains are readily available 1. Use the Data Leverage the wisdom of Web as the crowds background knowledge for NLP 2. Use Data 3. Make the Web output of NLP technologies tools available for integrating RDF is all about on the Data On the Web, by NLP tools & semantic Web sharing and approaches interoperability copying the value of information increases
  • 4. MultilingualWeb – 2012/06/11 Dublin – Page 4 http://lod2.eu 1. Use the Data Web as background knowledge for NLP Linguistic Data currently filed under “cross-domain”
  • 5. MultilingualWeb – 2012/06/11 Dublin – Page 5 http://lod2.eu 1. Use the Data Web as background knowledge for NLP Three communities with three resources: • Working Group for Open Linguistics Data (OWLG) – > http://linguistics.okfn.org • DBpedia Internationalization Committee – > http://wiki.dbpedia.org/Internationalization • Wiktionary2RDF Wrappers – > http://dbpedia.org/Wiktionary All communities are open, please join!
  • 6. MultilingualWeb – 2012/06/11 Dublin – Page 6 http://lod2.eu The Linguistic Linked Open Data Cloud
  • 7. MultilingualWeb – 2012/06/11 Dublin – Page 7 http://lod2.eu Main question
  • 8. MultilingualWeb – 2012/06/11 Dublin – Page 8 http://lod2.eu Wiktionary2RDF – Mediator Wrapper http://dbpedia.org/Wiktionary
  • 9. MultilingualWeb – 2012/06/11 Dublin – Page 9 http://lod2.eu Wiktionary2RDF – Mediator Wrapper http://dbpedia.org/Wiktionary Mediator Lemon
  • 10. MultilingualWeb – 2012/06/11 Dublin – Page 10 http://lod2.eu 2. Use Data Web Technologies for Integrating NLP Tools and Approaches Golden Hammer Anti-pattern The question is not whether to use RDF and Linked Data, but when to use... Image from http://pbmo.wordpress.com/2011/09/29/maslows-hammer/
  • 11. MultilingualWeb – 2012/06/11 Dublin – Page 11 http://lod2.eu
  • 12. MultilingualWeb – 2012/06/11 Dublin – Page 12 http://lod2.eu 2. Use Data Web Technologies for Integrating NLP Tools and Approaches • Ontologies provide (formal) documentation (UML, ERD) • Structure is easy to understand • Wide range of RDF tools can be used, e.g. LOD2 Stack • Indexing and querying as Big Picture possible
  • 13. MultilingualWeb – 2012/06/11 Dublin – Page 13 http://lod2.eu 2. Use Data Web Technologies for Integrating NLP Tools and Approaches The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. • Road map • Bootstrapped by LOD2, but a community project • First release in September 2011 • Great resonance – Over 50 people joined the mailing list: http://lists.okfn.org/mailman/listinfo/open-linguistics – First third party implementations and contributions – Several project discuss usage • Currently setting up advisory board, next draft in July
  • 14. MultilingualWeb – 2012/06/11 Dublin – Page 14 http://lod2.eu S. Auer and S. Hellmann: The Web of Data: Decentralized, collaborative, interlinked and interoperable LREC 2012, http://www.lrec-conf.org/proceedings/lrec2012/keynotes/LREC%202012.Keynote%20Speech%201.Soeren%20Auer.pdf
  • 15. MultilingualWeb – 2012/06/11 Dublin – Page 15 http://lod2.eu 3. Make the Output of NLP Tools available on the Web Currently there is no standard mechanism to transparently combine the WWW, GGG and NLP GGG = Giant Global Graph (basically the Web of Data) see: http://dig.csail.mit.edu/breadcrumbs/node/215
  • 16. MultilingualWeb – 2012/06/11 Dublin – Page 16 http://lod2.eu 3. Make the Output of NLP Tools available on the Web
  • 17. MultilingualWeb – 2012/06/11 Dublin – Page 17 http://lod2.eu 3. Make the Output of NLP Tools available on the Web http://dbpedia.org/spotlight P. Mendes et. al. DBpedia spotlight: Shedding light on the web of documents. In I-Semantics, 2011
  • 18. MultilingualWeb – 2012/06/11 Dublin – Page 18 http://lod2.eu 3. Make the Output of NLP Tools available on the Web http://annotateit.org http://sourceforge.net/projects/fragmentlinks/
  • 19. MultilingualWeb – 2012/06/11 Dublin – Page 19 http://lod2.eu 3. Make the Output of NLP Tools available on the Web NLP Interchange Format (NIF) join the mailing list at: http://nlp2rdf.org Hellmann et.al.: Towards an Ontology for Representing Strings In: EKAW 2012 http://svn.aksw.org/papers/2012/WWW_NIF/public/string_ontology.pdf
  • 20. LOD2 Title . 02.09.2010 . Page 20 http://lod2.eu Contact Address University of Leipzig Faculty of Mathematics and Computer Science Institute of Computer Science Department of Business Information Systems Postfach 100920 04009 Leipzig Germany Project: http://lod2.eu Organisation: http://uni-leipzig.de, http://aksw.org Presenter: http://bis.informatik.uni-leipzig.de/SebastianHellmann NLP2RDF page: http://nlp2rdf.org Acknowledgement: CC-BY-SA some slides are taken from the keynote Thanks for your unless otherwise stated of Sören Auer at LREC 2012