SlideShare une entreprise Scribd logo
1  sur  30
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 1
Co-funded by the Horizon 2020
Framework Programme of the European Union
Grant Agreement Number 644771
MLKREP, 10 JULY 2015
Felix Sasaki
DFKI / W3C Fellow
APPROACHES AND APPLICATION
SCENARIOS FOR INTEGRATING
MULTILINGUAL KNOWLEDGE
RESOURCES AND WEB CONTENT
www.freme-project.eu
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 2
BACKGROUND: THE FREME PROJECT
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 3
THE FREME PROJECT
• Two year H2020 Innovation action; start February 2015
• Industry partners leading four business cases around
digital content and (linked) data
• Technology development bridging language and data
• Outreach and business modelling demonstrating monetization of the multilingual
data value chain
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 4
CHALLENGE AND OPPORTUNITY: BIG DATA IS GROWING ACROSS
LANGUAGES, SECTORS AND DOMAINS
• BC: Digital publishing
• BC: Translation and localisation
• BC: Agriculture and food domain data
• BC: Web site personalisation
Agriculture
metadata, user
content, news
content, …
WHAT LIES AHEAD FOR SEVERAL INDUSTRIES? SEE THE FREME BUSINESS CASES
EN
ES
JA, ZH, ...
AR
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 5
CURRENT STATE OF SOLUTIONS
Machine
translation,
terminology
annotation, ...
Linked data
creation &
processing
GAPS THAT HINDER BUSINESS:
• Plethora of formats
• Adaptability and platform dependency
• Language coverage
• Usability “The right tool for the right person
in given and new enterprises”:
technology influences job profiles
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 6
FREME TO THE RESCUE: ENRICHING DIGITAL CONTENT
Machine
translation,
terminology
annotation, ...
Linked data
creation &
processing
LT and LD as first class
citizens on the Web
A SET OF INTERFACES* - DESIGN DRIVEN
BY BUSINESS CASES
LT and LD for various
user types: (application)
developer, content
architect, content
author, …
* Graphical interfaces
* Software Interfaces
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 7
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 8
EACH SERVICE IN ONE SENTENCE
• e-Translation: “Translate from Dutch to English”
• e-Terminology: “Add terminology annotations”
• e-Entity: “Identify unique entities”
• e-Link: “Add information from (linked open) data sources”
• e-Publishing: “Publish as digital book content”
• e-Internationalisation: “Use standardised metadata for multilingual content
production”
A KEY ASPECT FREME: FREME will allow to combine data and language technologies via
adequate software interfaces (APIs) and graphical user interfaces (GUIs)
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 9
CHALLENGES FOR MULTILINGUAL
KNOWLEDGE RESOURCES AND SOLUTIONS
PROVIDED BY FREME
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 10
CHALLENGE:
INTEGRATION OF KNOWLEDGE RESOURCES INTO CONTENT
• Content comes in a plethora of formats
• There is no standardised way to representation knowledge related information in
widely used content formats
• Keynote from Michael Wetzel: too many competing formats!
◦ SKOS, OWL, TBX, …
• Solution by FREME:
◦ Using NIF to represent natural natural language processing workflows
◦ Enrich with interlinked information
◦ Linking => benefit from the network effect on the Web
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 11
WHAT IS NIF?
• Natural Language Processing Interchange Format
• See http://nlp2rdf.org/
• Linked Data format to store annotations & to organize NLP pipelines
• API specification to create NIF workflows
• Following slides: main roles for NIF
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 12
EXAMPLE (PARTIAL; JSON-LD SYNTAX)
{ "@graph" : [ {
"@id" : "p:char=0,18",
"@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ],
"anchorOf" : "Welcome to Prague.",
"beginIndex" : "0",
"endIndex" : "18",
"isString" : "Welcome to Prague.",
"referenceContext" : "p:char=0,18”
}, {
"@id" : "p:char=11,17",
"@type" : [ "nif:RFC5147String", "nif:Word" ], …
"referenceContext" : "p:char=0,18",
"taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] }
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 13
EXAMPLE (PARTIAL; JSON-LD SYNTAX)
{ "@graph" : [ {
"@id" : "p:char=0,18",
"@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ],
"anchorOf" : "Welcome to Prague.",
"beginIndex" : "0",
"endIndex" : "18",
"isString" : "Welcome to Prague.",
"referenceContext" : "p:char=0,18”
}, {
"@id" : "p:char=11,17",
"@type" : [ "nif:RFC5147String", "nif:Word" ], …
"referenceContext" : "p:char=0,18",
"taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] }
• Identifying and typing
annotations
• Identifying annotation
offsets
• Adding additional
knowledge, e.g. named
entity identifier
• Interrelating
annotations
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 14
EXAMPLE (PARTIAL; JSON-LD SYNTAX)
{ "@graph" : [ {
"@id" : "p:char=0,18",
"@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ],
"anchorOf" : "Welcome to Prague.",
"beginIndex" : "0",
"endIndex" : "18",
"isString" : "Welcome to Prague.",
"referenceContext" : "p:char=0,18”
}, {
"@id" : "p:char=11,17",
"@type" : [ "nif:RFC5147String", "nif:Word" ], …
"referenceContext" : "p:char=0,18",
"taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] }
• Identifying and typing
annotations
• Identifying annotation
offsets
• Adding additional
knowledge, e.g. named
entity identifier
• Interrelating
annotations
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 15
EXAMPLE (PARTIAL; JSON-LD SYNTAX)
{ "@graph" : [ {
"@id" : "p:char=0,18",
"@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ],
"anchorOf" : "Welcome to Prague.",
"beginIndex" : "0",
"endIndex" : "18",
"isString" : "Welcome to Prague.",
"referenceContext" : "p:char=0,18”
}, {
"@id" : "p:char=11,17",
"@type" : [ "nif:RFC5147String", "nif:Word" ], …
"referenceContext" : "p:char=0,18",
"taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] }
• Identifying and typing
annotations
• Identifying annotation
offsets
• Adding additional
knowledge, e.g.
named entity identifier
• Interrelating
annotations
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 16
EXAMPLE (PARTIAL; JSON-LD SYNTAX)
{ "@graph" : [ {
"@id" : "p:char=0,18",
"@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ],
"anchorOf" : "Welcome to Prague.",
"beginIndex" : "0",
"endIndex" : "18",
"isString" : "Welcome to Prague.",
"referenceContext" : "p:char=0,18”
}, {
"@id" : "p:char=11,17",
"@type" : [ "nif:RFC5147String", "nif:Word" ], …
"referenceContext" : "p:char=0,18",
"taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] }
• Identifying and typing
annotations
• Identifying annotation
offsets
• Adding additional
knowledge, e.g.
named entity identifier
• Interrelating
annotations
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 17
A POTENTIAL NIF WORKFLOW
Existing
content
Content analytics, e.g.
named entity
recognition
Conversion to
NIF
Deploying knowledge from the
Linguistic Linked Data (LLD) cloud
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 18
A POTENTIAL NIF WORKFLOW
Existing
content
Content analytics, e.g.
named entity
recognition
Conversion to
NIF
Deploying knowledge from the
Linguistic Linked Data (LLD) cloud
Integrating world knowledge and
terminological knowledge
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 19
INTEGRATING WORLD KNOWLEDGE AND
TERMINOLOGICAL KNOWLEDGE
{ "@graph" : [ {
"@id" : "p:char=0,21", …
"isString" : "I have a screwdriver.",
"referenceContext" : "p:char=0,21"
}, …] }
• Step 1: creating NIF
from existing content
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 20
INTEGRATING WORLD KNOWLEDGE AND
TERMINOLOGICAL KNOWLEDGE
{ "@graph" : [ {
"@id" : "p:char=0,21", …
"isString" : "I have a screwdriver.",
"referenceContext" : "p:char=0,21"
}, {
"@id" : "p:char=9,20", …
"taIdentRef" : "http://dbpedia.org/resource/screwdriver" }, …] }
• Step 1: creating NIF
from existing content
• Step 2: adding world
knowledge based on
Dbpedia
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 21
INTEGRATING WORLD KNOWLEDGE AND
TERMINOLOGICAL KNOWLEDGE
{ "@graph" : [ {
"@id" : "p:char=0,21", …
"isString" : "I have a screwdriver.",
"referenceContext" : "p:char=0,21"
}, {
"@id" : "p:char=9,20", …
"taIdentRef" : "http://dbpedia.org/resource/screwdriver" },
"termInfoRef" : "http://tbx2rdf.lider-project.eu/…/query=schraubendreher" },
…] }
• Step 1: creating NIF
from existing content
• Step 2: adding world
knowledge based on
Dbpedia
• Step 3: adding
terminological
knowledge from IATE
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 22
INTEGRATING WORLD KNOWLEDGE AND
TERMINOLOGICAL KNOWLEDGE
{ "@graph" : [ {
"@id" : "p:char=0,21", …
"isString" : "I have a screwdriver.",
"referenceContext" : "p:char=0,21"
}, {
"@id" : "p:char=9,20", …
"taIdentRef" : "http://dbpedia.org/resource/screwdriver" },
"termInfoRef" : "http://tbx2rdf.lider-project.eu/…/query=schraubendreher" },
…] }
• Step 1: creating NIF
from existing content
• Step 2: adding world
knowledge based on
Dbpedia
• Step 3: adding
terminological
knowledge from IATE
• IATE is used as a linked data version, via
http://tbx2rdf.lider-project.eu
• The query to IATE uses the translation suggested from DBpedia
• The network effect: interlinking adds value 
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 23
SAMPLE APPLICATION SCENARIOS
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 24
AUTHORING AND PUBLISHING MULTILINGUALLY AND SEMANTICALLY
ENRICHED EBOOKS
• Example: Integration into ePub editing mode of oXygen XML Editor
e-Entity: annotate named entities
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 25
INTEGRATING SEMANTIC ENRICHMENT INTO MULTILINGUAL
CONTENT IN TRANSLATION AND LOCALISATION
• Example: Integration into XLIFF 2.0 editing mode of oXygen XML Editor
• Combination of services
◦ e-Entity: annotate named entities; e-Terminology: fetch terminological information
◦ e-Link: fetch additional information from a linked data source like DBpedia, specific to the
type of entities (places, persons, …)
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 26
INTEGRATING SEMANTIC ENRICHMENT INTO MULTILINGUAL
CONTENT IN TRANSLATION AND LOCALISATION
• Enriching content with machine readable information – represented as JSON-LD
◦ Input: “Welcome to Berlin … Marlene Dietrich!”
◦ Output:
[
{
"@id": "dbpedia:Marlene_Dietrich",
"@type": "person",
"born": "1901-12-27"
}
]
May be basis e.g. for further processing, e.g.
multilingual generation:
• “… born 1901”
• “… geboren 1901”
• “…1901年生まれ”
• …
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 27
DEMO
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 28
DEMO
• Generating translation suggestions
• Knowledge being used
◦ World knowledge: DBedia
◦ Terminological knowledge: IATE
• Storage in ePub based on Internationalization Tag Set (ITS) 2.0
◦ Standardised markup for multilingual content production
◦ Storage of translation suggestions here are ITS “Localization Note”
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 29
WANT TO TRY THINGS OUT?
• Go to http://api.freme-project.eu/doc/0.1/
• Check out API demo calls
• Time line for next prototypes
◦ 0.2: mid July
◦ 0.3: end of August
◦ Feedback to GitHub: https://github.com/freme-project
- Will be made public repro mid July
Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 30
CONTACTS
Felix Sasaki, on behalf of the FREME consortium
E-mail: felix.sasaki@dfki.de
CONSORTIUM

Contenu connexe

En vedette

Gov.uk content schemas tech monthly may 2015
Gov.uk content schemas   tech monthly may 2015Gov.uk content schemas   tech monthly may 2015
Gov.uk content schemas tech monthly may 2015David Heath
 
Automobile industry1 3
Automobile industry1 3Automobile industry1 3
Automobile industry1 3Samuel Gibbs
 
claims manager certification
claims manager certificationclaims manager certification
claims manager certificationVskills
 
The Last Vacation
The Last VacationThe Last Vacation
The Last VacationAlexa18S
 
Social media branding la miglior occasione di marketing sociale marco apadula
Social media branding la miglior occasione di marketing sociale marco apadulaSocial media branding la miglior occasione di marketing sociale marco apadula
Social media branding la miglior occasione di marketing sociale marco apadulaSocialMediaDayMI
 
Fundação Vanzolini - Bruno Casa Grande
Fundação Vanzolini - Bruno Casa GrandeFundação Vanzolini - Bruno Casa Grande
Fundação Vanzolini - Bruno Casa Grandeforumsustentar
 
School Managment Software for School & College
School Managment Software for School & CollegeSchool Managment Software for School & College
School Managment Software for School & CollegeMaan21
 
Os primeiros programas de atendime
Os primeiros programas de atendimeOs primeiros programas de atendime
Os primeiros programas de atendimeAlexandre Araujo
 

En vedette (16)

Resume-arunangshu 2
Resume-arunangshu 2Resume-arunangshu 2
Resume-arunangshu 2
 
Gov.uk content schemas tech monthly may 2015
Gov.uk content schemas   tech monthly may 2015Gov.uk content schemas   tech monthly may 2015
Gov.uk content schemas tech monthly may 2015
 
Automobile industry1 3
Automobile industry1 3Automobile industry1 3
Automobile industry1 3
 
claims manager certification
claims manager certificationclaims manager certification
claims manager certification
 
Tarjeta tanik
Tarjeta tanikTarjeta tanik
Tarjeta tanik
 
The Last Vacation
The Last VacationThe Last Vacation
The Last Vacation
 
Agricultural revolution whist 2
Agricultural revolution whist 2Agricultural revolution whist 2
Agricultural revolution whist 2
 
B Week 3 Assessing Resources 2015 Generic
B Week 3 Assessing Resources 2015 GenericB Week 3 Assessing Resources 2015 Generic
B Week 3 Assessing Resources 2015 Generic
 
Nyakoi001
Nyakoi001Nyakoi001
Nyakoi001
 
Executional Suggestions for Carhartt
Executional Suggestions for CarharttExecutional Suggestions for Carhartt
Executional Suggestions for Carhartt
 
Social media branding la miglior occasione di marketing sociale marco apadula
Social media branding la miglior occasione di marketing sociale marco apadulaSocial media branding la miglior occasione di marketing sociale marco apadula
Social media branding la miglior occasione di marketing sociale marco apadula
 
Fundação Vanzolini - Bruno Casa Grande
Fundação Vanzolini - Bruno Casa GrandeFundação Vanzolini - Bruno Casa Grande
Fundação Vanzolini - Bruno Casa Grande
 
House of Commons Presentation on legislative process July 2015
House of Commons Presentation on legislative process July 2015House of Commons Presentation on legislative process July 2015
House of Commons Presentation on legislative process July 2015
 
School Managment Software for School & College
School Managment Software for School & CollegeSchool Managment Software for School & College
School Managment Software for School & College
 
A' PERUGIA
A' PERUGIAA' PERUGIA
A' PERUGIA
 
Os primeiros programas de atendime
Os primeiros programas de atendimeOs primeiros programas de atendime
Os primeiros programas de atendime
 

Similaire à Sasaki mlkrep-20150710

Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...soapconf
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-dataFelix Sasaki
 
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Sergio Fernández
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...BigData_Europe
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xmlFelix Sasaki
 
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQLIntroduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQLPretaLLOD
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataBoris Villazón-Terrazas
 
Lodlam presentation v1.0 final al20151104
Lodlam presentation v1.0 final al20151104Lodlam presentation v1.0 final al20151104
Lodlam presentation v1.0 final al20151104Asa Letourneau
 
Big data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBig data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBigData_Europe
 
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataIFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataLars G. Svensson
 
Sharing irish place names as linked open data - Rebecca Grant
Sharing irish place names as linked open data - Rebecca GrantSharing irish place names as linked open data - Rebecca Grant
Sharing irish place names as linked open data - Rebecca Grantdri_ireland
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015Michele Pasin
 
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"Paco Nathan
 
Open data hackathon jelgava - report
Open data hackathon   jelgava - reportOpen data hackathon   jelgava - report
Open data hackathon jelgava - reportWirelessInfo
 
Europeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsEuropeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsVladimir Alexiev, PhD, PMP
 
The Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataThe Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataDavid Haskiya
 

Similaire à Sasaki mlkrep-20150710 (20)

Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-data
 
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
 
Linked Open Data and Ontotext Projects
Linked Open Data and Ontotext ProjectsLinked Open Data and Ontotext Projects
Linked Open Data and Ontotext Projects
 
Linked data tooling XML
Linked data tooling XMLLinked data tooling XML
Linked data tooling XML
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xml
 
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQLIntroduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
Lodlam presentation v1.0 final al20151104
Lodlam presentation v1.0 final al20151104Lodlam presentation v1.0 final al20151104
Lodlam presentation v1.0 final al20151104
 
Big data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBig data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilots
 
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataIFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
 
Sharing irish place names as linked open data - Rebecca Grant
Sharing irish place names as linked open data - Rebecca GrantSharing irish place names as linked open data - Rebecca Grant
Sharing irish place names as linked open data - Rebecca Grant
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015
 
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
 
Open data hackathon jelgava - report
Open data hackathon   jelgava - reportOpen data hackathon   jelgava - report
Open data hackathon jelgava - report
 
Europeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsEuropeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom Views
 
The Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataThe Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open Data
 

Dernier

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

Sasaki mlkrep-20150710

  • 1. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 1 Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 MLKREP, 10 JULY 2015 Felix Sasaki DFKI / W3C Fellow APPROACHES AND APPLICATION SCENARIOS FOR INTEGRATING MULTILINGUAL KNOWLEDGE RESOURCES AND WEB CONTENT www.freme-project.eu
  • 2. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 2 BACKGROUND: THE FREME PROJECT
  • 3. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 3 THE FREME PROJECT • Two year H2020 Innovation action; start February 2015 • Industry partners leading four business cases around digital content and (linked) data • Technology development bridging language and data • Outreach and business modelling demonstrating monetization of the multilingual data value chain
  • 4. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 4 CHALLENGE AND OPPORTUNITY: BIG DATA IS GROWING ACROSS LANGUAGES, SECTORS AND DOMAINS • BC: Digital publishing • BC: Translation and localisation • BC: Agriculture and food domain data • BC: Web site personalisation Agriculture metadata, user content, news content, … WHAT LIES AHEAD FOR SEVERAL INDUSTRIES? SEE THE FREME BUSINESS CASES EN ES JA, ZH, ... AR
  • 5. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 5 CURRENT STATE OF SOLUTIONS Machine translation, terminology annotation, ... Linked data creation & processing GAPS THAT HINDER BUSINESS: • Plethora of formats • Adaptability and platform dependency • Language coverage • Usability “The right tool for the right person in given and new enterprises”: technology influences job profiles
  • 6. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 6 FREME TO THE RESCUE: ENRICHING DIGITAL CONTENT Machine translation, terminology annotation, ... Linked data creation & processing LT and LD as first class citizens on the Web A SET OF INTERFACES* - DESIGN DRIVEN BY BUSINESS CASES LT and LD for various user types: (application) developer, content architect, content author, … * Graphical interfaces * Software Interfaces
  • 7. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 7
  • 8. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 8 EACH SERVICE IN ONE SENTENCE • e-Translation: “Translate from Dutch to English” • e-Terminology: “Add terminology annotations” • e-Entity: “Identify unique entities” • e-Link: “Add information from (linked open) data sources” • e-Publishing: “Publish as digital book content” • e-Internationalisation: “Use standardised metadata for multilingual content production” A KEY ASPECT FREME: FREME will allow to combine data and language technologies via adequate software interfaces (APIs) and graphical user interfaces (GUIs)
  • 9. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 9 CHALLENGES FOR MULTILINGUAL KNOWLEDGE RESOURCES AND SOLUTIONS PROVIDED BY FREME
  • 10. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 10 CHALLENGE: INTEGRATION OF KNOWLEDGE RESOURCES INTO CONTENT • Content comes in a plethora of formats • There is no standardised way to representation knowledge related information in widely used content formats • Keynote from Michael Wetzel: too many competing formats! ◦ SKOS, OWL, TBX, … • Solution by FREME: ◦ Using NIF to represent natural natural language processing workflows ◦ Enrich with interlinked information ◦ Linking => benefit from the network effect on the Web
  • 11. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 11 WHAT IS NIF? • Natural Language Processing Interchange Format • See http://nlp2rdf.org/ • Linked Data format to store annotations & to organize NLP pipelines • API specification to create NIF workflows • Following slides: main roles for NIF
  • 12. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 12 EXAMPLE (PARTIAL; JSON-LD SYNTAX) { "@graph" : [ { "@id" : "p:char=0,18", "@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ], "anchorOf" : "Welcome to Prague.", "beginIndex" : "0", "endIndex" : "18", "isString" : "Welcome to Prague.", "referenceContext" : "p:char=0,18” }, { "@id" : "p:char=11,17", "@type" : [ "nif:RFC5147String", "nif:Word" ], … "referenceContext" : "p:char=0,18", "taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] }
  • 13. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 13 EXAMPLE (PARTIAL; JSON-LD SYNTAX) { "@graph" : [ { "@id" : "p:char=0,18", "@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ], "anchorOf" : "Welcome to Prague.", "beginIndex" : "0", "endIndex" : "18", "isString" : "Welcome to Prague.", "referenceContext" : "p:char=0,18” }, { "@id" : "p:char=11,17", "@type" : [ "nif:RFC5147String", "nif:Word" ], … "referenceContext" : "p:char=0,18", "taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] } • Identifying and typing annotations • Identifying annotation offsets • Adding additional knowledge, e.g. named entity identifier • Interrelating annotations
  • 14. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 14 EXAMPLE (PARTIAL; JSON-LD SYNTAX) { "@graph" : [ { "@id" : "p:char=0,18", "@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ], "anchorOf" : "Welcome to Prague.", "beginIndex" : "0", "endIndex" : "18", "isString" : "Welcome to Prague.", "referenceContext" : "p:char=0,18” }, { "@id" : "p:char=11,17", "@type" : [ "nif:RFC5147String", "nif:Word" ], … "referenceContext" : "p:char=0,18", "taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] } • Identifying and typing annotations • Identifying annotation offsets • Adding additional knowledge, e.g. named entity identifier • Interrelating annotations
  • 15. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 15 EXAMPLE (PARTIAL; JSON-LD SYNTAX) { "@graph" : [ { "@id" : "p:char=0,18", "@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ], "anchorOf" : "Welcome to Prague.", "beginIndex" : "0", "endIndex" : "18", "isString" : "Welcome to Prague.", "referenceContext" : "p:char=0,18” }, { "@id" : "p:char=11,17", "@type" : [ "nif:RFC5147String", "nif:Word" ], … "referenceContext" : "p:char=0,18", "taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] } • Identifying and typing annotations • Identifying annotation offsets • Adding additional knowledge, e.g. named entity identifier • Interrelating annotations
  • 16. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 16 EXAMPLE (PARTIAL; JSON-LD SYNTAX) { "@graph" : [ { "@id" : "p:char=0,18", "@type" : [ "nif:Context", "nif:Sentence", "nif:RFC5147String" ], "anchorOf" : "Welcome to Prague.", "beginIndex" : "0", "endIndex" : "18", "isString" : "Welcome to Prague.", "referenceContext" : "p:char=0,18” }, { "@id" : "p:char=11,17", "@type" : [ "nif:RFC5147String", "nif:Word" ], … "referenceContext" : "p:char=0,18", "taIdentRef" : "http://dbpedia.org/resource/Prague" }, …] } • Identifying and typing annotations • Identifying annotation offsets • Adding additional knowledge, e.g. named entity identifier • Interrelating annotations
  • 17. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 17 A POTENTIAL NIF WORKFLOW Existing content Content analytics, e.g. named entity recognition Conversion to NIF Deploying knowledge from the Linguistic Linked Data (LLD) cloud
  • 18. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 18 A POTENTIAL NIF WORKFLOW Existing content Content analytics, e.g. named entity recognition Conversion to NIF Deploying knowledge from the Linguistic Linked Data (LLD) cloud Integrating world knowledge and terminological knowledge
  • 19. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 19 INTEGRATING WORLD KNOWLEDGE AND TERMINOLOGICAL KNOWLEDGE { "@graph" : [ { "@id" : "p:char=0,21", … "isString" : "I have a screwdriver.", "referenceContext" : "p:char=0,21" }, …] } • Step 1: creating NIF from existing content
  • 20. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 20 INTEGRATING WORLD KNOWLEDGE AND TERMINOLOGICAL KNOWLEDGE { "@graph" : [ { "@id" : "p:char=0,21", … "isString" : "I have a screwdriver.", "referenceContext" : "p:char=0,21" }, { "@id" : "p:char=9,20", … "taIdentRef" : "http://dbpedia.org/resource/screwdriver" }, …] } • Step 1: creating NIF from existing content • Step 2: adding world knowledge based on Dbpedia
  • 21. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 21 INTEGRATING WORLD KNOWLEDGE AND TERMINOLOGICAL KNOWLEDGE { "@graph" : [ { "@id" : "p:char=0,21", … "isString" : "I have a screwdriver.", "referenceContext" : "p:char=0,21" }, { "@id" : "p:char=9,20", … "taIdentRef" : "http://dbpedia.org/resource/screwdriver" }, "termInfoRef" : "http://tbx2rdf.lider-project.eu/…/query=schraubendreher" }, …] } • Step 1: creating NIF from existing content • Step 2: adding world knowledge based on Dbpedia • Step 3: adding terminological knowledge from IATE
  • 22. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 22 INTEGRATING WORLD KNOWLEDGE AND TERMINOLOGICAL KNOWLEDGE { "@graph" : [ { "@id" : "p:char=0,21", … "isString" : "I have a screwdriver.", "referenceContext" : "p:char=0,21" }, { "@id" : "p:char=9,20", … "taIdentRef" : "http://dbpedia.org/resource/screwdriver" }, "termInfoRef" : "http://tbx2rdf.lider-project.eu/…/query=schraubendreher" }, …] } • Step 1: creating NIF from existing content • Step 2: adding world knowledge based on Dbpedia • Step 3: adding terminological knowledge from IATE • IATE is used as a linked data version, via http://tbx2rdf.lider-project.eu • The query to IATE uses the translation suggested from DBpedia • The network effect: interlinking adds value 
  • 23. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 23 SAMPLE APPLICATION SCENARIOS
  • 24. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 24 AUTHORING AND PUBLISHING MULTILINGUALLY AND SEMANTICALLY ENRICHED EBOOKS • Example: Integration into ePub editing mode of oXygen XML Editor e-Entity: annotate named entities
  • 25. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 25 INTEGRATING SEMANTIC ENRICHMENT INTO MULTILINGUAL CONTENT IN TRANSLATION AND LOCALISATION • Example: Integration into XLIFF 2.0 editing mode of oXygen XML Editor • Combination of services ◦ e-Entity: annotate named entities; e-Terminology: fetch terminological information ◦ e-Link: fetch additional information from a linked data source like DBpedia, specific to the type of entities (places, persons, …)
  • 26. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 26 INTEGRATING SEMANTIC ENRICHMENT INTO MULTILINGUAL CONTENT IN TRANSLATION AND LOCALISATION • Enriching content with machine readable information – represented as JSON-LD ◦ Input: “Welcome to Berlin … Marlene Dietrich!” ◦ Output: [ { "@id": "dbpedia:Marlene_Dietrich", "@type": "person", "born": "1901-12-27" } ] May be basis e.g. for further processing, e.g. multilingual generation: • “… born 1901” • “… geboren 1901” • “…1901年生まれ” • …
  • 27. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 27 DEMO
  • 28. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 28 DEMO • Generating translation suggestions • Knowledge being used ◦ World knowledge: DBedia ◦ Terminological knowledge: IATE • Storage in ePub based on Internationalization Tag Set (ITS) 2.0 ◦ Standardised markup for multilingual content production ◦ Storage of translation suggestions here are ITS “Localization Note”
  • 29. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 29 WANT TO TRY THINGS OUT? • Go to http://api.freme-project.eu/doc/0.1/ • Check out API demo calls • Time line for next prototypes ◦ 0.2: mid July ◦ 0.3: end of August ◦ Feedback to GitHub: https://github.com/freme-project - Will be made public repro mid July
  • 30. Sasaki – MLKRep – 10 July 2015 WWW.FREME-PROJECT.EU 30 CONTACTS Felix Sasaki, on behalf of the FREME consortium E-mail: felix.sasaki@dfki.de CONSORTIUM

Notes de l'éditeur

  1. This slide probably needs no visualization.
  2. BC 1 “Digital publishing”: Digital content itself is exploding and is loosing value BC 2 “Translation and Localisation”: Demand for speed and quality is increasing, prices are going down BC 3 “Agriculture and food data”: Discovery of data is difficult due to missing multilingual metadata BC 4 “Web site personalisation”: solutions are focusing on English speaking market
  3. Robust language technologies Machine translation, terminology extraction & annotation Robust linked data (LD) technologies Entity annotation, linking to data sources More and more platforms as silos that allow to deploy certain parts of these technology stacks GAPS that hinder businesses: No easy to use interfaces to LT and LD tooling Plethora of formats to process Adaptability and scalability of solutions Usability: “Give the adequate tool to the right person”
  4. e-Translation: “Translate from Dutch to English” e-Terminology: “Add terminology annotations” e-Entity: “Identify unique entities” e-Link: “Add information from (linked open) data sources” e-Publishing: “Publish as digital book content” e-Internationalisation: “Use standardised metadata for multilingual content production” A KEY ASPECT FREME: FREME will allow to combine data and language technologies via adequate software interfaces (APIs) and graphical user interfaces (GUIs)
  5. Back Page #1 Social network icons refer to speaker (he/she has to link his/her accounts)