SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Integrating NLP with Linked Data and RDF: 
the NIF format (hands on) 
Ciro Baron Neto 
Ph.D student at University of Leipzig 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
1
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
2 
Overview 
• Github NLP2RDF web page overview 
and NIF Online demos (Dashboard, 
Combinator...) 
• Examples 
–Example 1: How to annotate string 
• using Snowball Steamer and OpenNLP 
–Example 2: 
• Query generated NIF data and Querying Brown 
Corpus
NLP2RDF GitHub Website 
• https://github.com/NLP2RDF/ 
• /home/ciro/websites/github/github.com/NLP2RDF/index.html 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
3
dashboard.nlp2rdf.aksw.org 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
4
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
5 
nlp2rdf.aksw.org
Example 1: Snowball Stemmer 
Wrapper 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
6
Snowball Stemmer Wrapper 
• Stemming algorithm is a process 
for removing suffixes from words. 
–CONNECT 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
7 
• CONNECTED 
• CONNECTION 
• CONNECTING 
• CONNECTIONS
Snowball Stemmer Wrapper 
• 1. Open the USB stick folder 
• 2. Go to “NIF_tutorial_hands_on_jars” folder 
• 3. Open the “instructions.txt” file in a text 
editor 
• 4. Open a terminal 
• 5. Go to the “jar” folder 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
8
Snowball Stemmer Wrapper 
• Copy the second command of the 
instructions.txt 
“java -jar snowball.jar -f text -i 'My 
favorite actress is Natalie Portman.'“ 
• -f is used to define the format 
• -i is used to define the input 
• Paste in the terminal 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
9
Snowball Stemmer Wrapper 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
10
Snowball Stemmer Wrapper 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
11
Snowball Stemmer Wrapper 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
12 
NIF Standard Annotations 
NIF Offset
Snowball Stemmer Wrapper 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
13 
NIF Standard Annotations 
Snowball Stem 
NIF Offset
OpenNLP Wrapper 
• Back to the terminal and use the first command 
of the instructions.txt 
java -jar opennlp.jar -f text -i 'My favorite actress is 
Natalie Portman.' -modelFolder ../model/ 
• The -modelFolder parameter set the folder that 
contains the POS tagging OpenNLP trained 
models and tokenization. 
• You might add the parameter “--outfile 
myAnnotatedFile.ttl“ to store the triples in a file. 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
14
Example 2: Query Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
15
Querying with Twinkle 
• Open the “/twinkle/example” folder 
• Open the NIF_query_example file 
in a text editor and copy the query 
• Open the “/twinle” folder and run 
the command: 
java -jar twinkle.jar 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
16
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
17
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
18
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
19
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
20
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
21
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
22
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
23
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
24
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
25
Querying Brown Corpus 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
26
Exercise 3: Querying your own NIF 
annotated string 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
27
Querying your own NIF annotated 
string 
1. Annotate your string using one of the 
wrappers 
2. Save your annotated sentence to a file 
(using “--outfile”) 
3. Open Twinkle 
4. Query your string using Twinkle 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
28
• Query your annotated string: 
– nif:Context 
– nif:Sentence 
– nif:anchorOf 
– nif:oliaCategory 
– nif:oliaLink 
… or practice with Brown Corpus! 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
29
Thank you! 
http://site.nlp2rdf.org/ 
NLP2RDF Google+ Community 
Building the Multilingual Web of Data – ISWC 
10/20/14 tutorial 
30

Contenu connexe

Similaire à NIF 2.0 Hands on Turorial.

nf-core: A community-driven collection of omics portable pipelines
nf-core: A community-driven collection of omics portable pipelinesnf-core: A community-driven collection of omics portable pipelines
nf-core: A community-driven collection of omics portable pipelinesJose Espinosa-Carrasco
 
Data Plane and VNF Acceleration Mini Summit
Data Plane and VNF Acceleration Mini Summit Data Plane and VNF Acceleration Mini Summit
Data Plane and VNF Acceleration Mini Summit Open-NFP
 
Varnish more than a cache
Varnish more than a cacheVarnish more than a cache
Varnish more than a cachebloeffeld
 
Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...
Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...
Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...Brent Salisbury
 
Common asp.net design patterns aspconf2012
Common asp.net design patterns aspconf2012Common asp.net design patterns aspconf2012
Common asp.net design patterns aspconf2012Steven Smith
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Olaf Hartig
 
Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Tracy Kent
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQLOlaf Hartig
 
Keeping a codebase fresh for over a decade
Keeping a codebase fresh for over a decadeKeeping a codebase fresh for over a decade
Keeping a codebase fresh for over a decadeChristian Keuerleber
 
DockerDay2015: Docker Networking
DockerDay2015: Docker NetworkingDockerDay2015: Docker Networking
DockerDay2015: Docker NetworkingDocker-Hanoi
 
Building a Web-Scale Dependency-Parsed Corpus from Common Crawl
Building a Web-Scale Dependency-Parsed Corpus from Common CrawlBuilding a Web-Scale Dependency-Parsed Corpus from Common Crawl
Building a Web-Scale Dependency-Parsed Corpus from Common CrawlAlexander Panchenko
 
CS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docxCS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docxPriyadarshiniS28
 
CS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docxCS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docxPriyadarshiniS28
 
Virtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OO
Virtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OOVirtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OO
Virtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OOPaolo Cristofaro
 
Concept net150529
Concept net150529Concept net150529
Concept net150529KangSe Lee
 
IBM Impact session Ed addison nuts and bolts ws
IBM Impact session Ed addison nuts and bolts wsIBM Impact session Ed addison nuts and bolts ws
IBM Impact session Ed addison nuts and bolts wsnick_garrod
 
Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...
Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...
Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...FITMAN FI
 

Similaire à NIF 2.0 Hands on Turorial. (20)

An API Your Parents Would Be Proud Of
An API Your Parents Would Be Proud OfAn API Your Parents Would Be Proud Of
An API Your Parents Would Be Proud Of
 
nf-core: A community-driven collection of omics portable pipelines
nf-core: A community-driven collection of omics portable pipelinesnf-core: A community-driven collection of omics portable pipelines
nf-core: A community-driven collection of omics portable pipelines
 
Data Plane and VNF Acceleration Mini Summit
Data Plane and VNF Acceleration Mini Summit Data Plane and VNF Acceleration Mini Summit
Data Plane and VNF Acceleration Mini Summit
 
Varnish more than a cache
Varnish more than a cacheVarnish more than a cache
Varnish more than a cache
 
Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...
Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...
Augmenting Flow Operations and Feedback on the Model Driven MD_SAL Approach i...
 
Common asp.net design patterns aspconf2012
Common asp.net design patterns aspconf2012Common asp.net design patterns aspconf2012
Common asp.net design patterns aspconf2012
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)
 
Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQL
 
Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
 
Keeping a codebase fresh for over a decade
Keeping a codebase fresh for over a decadeKeeping a codebase fresh for over a decade
Keeping a codebase fresh for over a decade
 
DockerDay2015: Docker Networking
DockerDay2015: Docker NetworkingDockerDay2015: Docker Networking
DockerDay2015: Docker Networking
 
Building a Web-Scale Dependency-Parsed Corpus from Common Crawl
Building a Web-Scale Dependency-Parsed Corpus from Common CrawlBuilding a Web-Scale Dependency-Parsed Corpus from Common Crawl
Building a Web-Scale Dependency-Parsed Corpus from Common Crawl
 
CS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docxCS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docx
 
CS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docxCS8073C# dot net programming syllabus.docx
CS8073C# dot net programming syllabus.docx
 
Virtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OO
Virtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OOVirtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OO
Virtuoso RDF Triple Store Analysis Benchmark & mapping tools RDF / OO
 
Concept net150529
Concept net150529Concept net150529
Concept net150529
 
IBM Impact session Ed addison nuts and bolts ws
IBM Impact session Ed addison nuts and bolts wsIBM Impact session Ed addison nuts and bolts ws
IBM Impact session Ed addison nuts and bolts ws
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
 
Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...
Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...
Fitman webinar 2015 09-21 Generation and Transformation of Virtualized Assets...
 

Dernier

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 

Dernier (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

NIF 2.0 Hands on Turorial.

  • 1. Integrating NLP with Linked Data and RDF: the NIF format (hands on) Ciro Baron Neto Ph.D student at University of Leipzig Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 1
  • 2. Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 2 Overview • Github NLP2RDF web page overview and NIF Online demos (Dashboard, Combinator...) • Examples –Example 1: How to annotate string • using Snowball Steamer and OpenNLP –Example 2: • Query generated NIF data and Querying Brown Corpus
  • 3. NLP2RDF GitHub Website • https://github.com/NLP2RDF/ • /home/ciro/websites/github/github.com/NLP2RDF/index.html Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 3
  • 4. dashboard.nlp2rdf.aksw.org Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 4
  • 5. Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 5 nlp2rdf.aksw.org
  • 6. Example 1: Snowball Stemmer Wrapper Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 6
  • 7. Snowball Stemmer Wrapper • Stemming algorithm is a process for removing suffixes from words. –CONNECT Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 7 • CONNECTED • CONNECTION • CONNECTING • CONNECTIONS
  • 8. Snowball Stemmer Wrapper • 1. Open the USB stick folder • 2. Go to “NIF_tutorial_hands_on_jars” folder • 3. Open the “instructions.txt” file in a text editor • 4. Open a terminal • 5. Go to the “jar” folder Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 8
  • 9. Snowball Stemmer Wrapper • Copy the second command of the instructions.txt “java -jar snowball.jar -f text -i 'My favorite actress is Natalie Portman.'“ • -f is used to define the format • -i is used to define the input • Paste in the terminal Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 9
  • 10. Snowball Stemmer Wrapper Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 10
  • 11. Snowball Stemmer Wrapper Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 11
  • 12. Snowball Stemmer Wrapper Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 12 NIF Standard Annotations NIF Offset
  • 13. Snowball Stemmer Wrapper Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 13 NIF Standard Annotations Snowball Stem NIF Offset
  • 14. OpenNLP Wrapper • Back to the terminal and use the first command of the instructions.txt java -jar opennlp.jar -f text -i 'My favorite actress is Natalie Portman.' -modelFolder ../model/ • The -modelFolder parameter set the folder that contains the POS tagging OpenNLP trained models and tokenization. • You might add the parameter “--outfile myAnnotatedFile.ttl“ to store the triples in a file. Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 14
  • 15. Example 2: Query Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 15
  • 16. Querying with Twinkle • Open the “/twinkle/example” folder • Open the NIF_query_example file in a text editor and copy the query • Open the “/twinle” folder and run the command: java -jar twinkle.jar Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 16
  • 17. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 17
  • 18. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 18
  • 19. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 19
  • 20. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 20
  • 21. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 21
  • 22. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 22
  • 23. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 23
  • 24. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 24
  • 25. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 25
  • 26. Querying Brown Corpus Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 26
  • 27. Exercise 3: Querying your own NIF annotated string Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 27
  • 28. Querying your own NIF annotated string 1. Annotate your string using one of the wrappers 2. Save your annotated sentence to a file (using “--outfile”) 3. Open Twinkle 4. Query your string using Twinkle Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 28
  • 29. • Query your annotated string: – nif:Context – nif:Sentence – nif:anchorOf – nif:oliaCategory – nif:oliaLink … or practice with Brown Corpus! Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 29
  • 30. Thank you! http://site.nlp2rdf.org/ NLP2RDF Google+ Community Building the Multilingual Web of Data – ISWC 10/20/14 tutorial 30