Slides of the first paper on the ULiS project, availiable at http://maxime-lefrancois.info/Publications
We are interested in bridging the world of natural language and the world of the semantic web in particular to support natural multilingual access to the web of data. In this paper we introduce a new type of lexical ontology called interlingual lexical ontology (ILexicOn), which uses semantic web formalisms to make each interlingual lexical unit class (ILUc) support the projection of its semantic decomposition on itself. After a short overview of existing lexical ontologies, we briefly introduce the semantic web formalisms we use. We then present the three layered architecture of our approach: i) the interlingual lexical metaontology (ILexiMOn); ii) the ILexicOn where ILUcs are formally defined; iii) the data layer. We illustrate our approach with a standalone ILexicOn, and introduce and explain a concise human-readable notation to represent ILexicOns. Finally, we show how semantic web formalisms enable the projection of a semantic decomposition on the decomposed ILUc.
2. ITEM1>ITEM2>
open and link data
Governments, organisations, …
Lefrançois & Gandon, ILexicOn. MTT 2011 - 2
3. ITEM1>ITEM2>
April 2008
September 2008
May 2007
a Web of Linked Open Data
One standard beats them all: RDF
March 2009
September 2010
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ MTT 2011 - 3
4. ITEM1>ITEM2>
>25 billion RDF triples
203 data sets
Interlinked by >395 million RDF links
September 2010
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
MTT 2011 - 4
5. ITEM1>ITEM2>
Web of data
?
multilingual access to the web of data
The ULiS project
Lefrançois & Gandon, ILexicOn. MTT 2011 - 5
6. The core of the ULiS project:
ILexicOn:
An ECD-compliant
interlingual lexical ontology
described with semantic web formalisms
Maxime Lefrançois & Fabien Gandon
Edelweiss – INRIA Sophia-Antipolis – France
{Maxime.Lefrancois|Fabien.Gandon}@inria.fr
MTT 2011
7. ITEM1>ITEM2>
OUTLINE
The Semantic Web Formalisms
The ULiS project: multilingual access to web of data
Proposal 1:
An architecture that unlocks motivating scenarios
The ILexicOn : defining interlingual lexical units
Proposal 2:
A formal way to represent lexicographic definitions
Lefrançois & Gandon, ILexicOn. MTT 2011 - 7
8. 1. The Semantic Web Formalisms
Lefrançois & Gandon, ILexicOn. MTT 2011 -8
9. ITEM1>ITEM2>
One standard beats them all: RDF
September 2010
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
MTT 2011 - 9
10. ITEM1>ITEM2>
RDF stands for
Resource: pages, images, ILUc, ...
everything that can have a URI
Description: attributes, features, and
relations of the resources
Framework: model, languages and
syntaxes for these descriptions
Lefrançois & Gandon, ILexicOn. MTT 2011 - 10
12. ITEM1>ITEM2>
RDF is a triple model i.e. every
piece of knowledge is broken down into
( subject , predicate , object )
Lefrançois & Gandon, ILexicOn. MTT 2011 - 12
13. ITEM1>ITEM2>
doc.html has for author Fabien
and has for theme Music
Lefrançois & Gandon, ILexicOn. MTT 2011 - 13
14. ITEM1>ITEM2>
( doc.html , author , Fabien )
( doc.html , theme , Music )
the RDF atom:
a triple
Lefrançois & Gandon, ILexicOn. MTT 2011 - 14
15. ITEM1>ITEM2>
Fabien
author
doc.html
theme
Music
RDF: a graph
Lefrançois & Gandon, ILexicOn. MTT 2011 - 15
16. ITEM1>ITEM2>
http://inria.fr/~fabien#me
http://inria.fr/schema#author
http://inria.fr/rr/doc.html
http://inria.fr/schema#theme
Music
resources and properties
are identified by URIs
Lefrançois & Gandon, ILexicOn. MTT 2011 - 16
17. ITEM1>ITEM2>
< RDF/> has an XML syntax
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:inria="http://inria.fr/schema#" >
<rdf:Description rdf:about="http://inria.fr/rr/doc.html">
<inria:author rdf:resource=
"http://inria.fr/~fabien#me" />
<inria:theme>Music</inria:theme>
</rdf:Description>
</rdf:RDF>
Lefrançois & Gandon, ILexicOn. MTT 2011 - 17
18. ITEM1>ITEM2>
SPARQL: query / update
SPARQL Protocol and
RDF Query Language
Lefrançois & Gandon, ILexicOn. MTT 2011 - 18
20. ITEM1>ITEM2>
RDFSchema
to declare classes of resources,
properties, and organize their hierarchy
Document creator
author
Report Document Person
Lefrançois & Gandon, ILexicOn. MTT 2011 - 20
22. 2. The ULiS project:
Multilingual access to the web of data
Web of data
?
Lefrançois & Gandon, ILexicOn. MTT 2011 - 22
23. ITEM1>ITEM2>
+RDF?
A pivot-based NLP technique
+MTT?
Universal Networking Language
Web of data
Interlanguage
Convert
Deconvert
Lefrançois & Gandon, ILexicOn. MTT 2011 - 23
24. ITEM1>ITEM2>
The ULiS project
a Universal Linguistic System
Web of data
Interlanguage
Convert
Deconvert
Lefrançois & Gandon, ILexicOn. MTT 2011 - 24
25. ITEM1>ITEM2>
The ULiS project
a Universal Linguistic System
to redesign
Pivot-based NLP
technique
100% using
ULiS
Semantic Web
formalisms
Meaning-Text
Theory
compliant with
Lefrançois & Gandon, ILexicOn. MTT 2011 - 25
27. ITEM1>ITEM2>
The ULiS project - scenario
- Machine Translation
-
-
The RDF-World
RDF interlingual
representations
IR RDF
RDF situational John01 kill@past Mary01. John01 tuer@past Mary01.
representations SR RDF
John01 kill@past Mary01.
SR RDF
John01 tuer@past Mary01
John EN Mary
killed John FR Mary
a tué
InputTEXT
John killed Mary. John a tué TEXT
Output1 Mary.
Lefrançois & Gandon, ILexicOn. MTT 2011 - 27
28. ITEM1>ITEM2>
The ULiS project - scenario
- Machine Translation
-
-
The RDF-World
RDF interlingual
representations
RDF situational John01 kill@past Mary01. John01 tuer@past Mary01.
representations John01 kill@past Mary01. John01 tuer@past Mary01
John killed Mary John a tué Mary
John killed Mary. John a tué Mary.
Lefrançois & Gandon, ILexicOn. MTT 2011 - 28
29. ITEM1>ITEM2>
The ULiS project - scenario
- Machine Translation
- Management of Interlingual Knowledge Bases
-
The RDF-World
SELECT ?person
SPARQLRDF + X RDF
SPARQL WHERE {
Request ?person ikb:kill ikb:Mary01. IKB
IDBpedia
RDF X RDF
ikb:John01 RDF
Output
}
RDF interlingual
representations
IR RDF IR RDF
RDF situational Who kill@past@? Mary01 John01 kill@past Mary01.
representations SR RDF
Who kill@past@? Mary01 SR RDF
John01 kill@past Mary01.
EN
Who killed Mary ? John EN Mary
killed
InputTEXT
Who killed Mary? John killed TEXT
Output2 Mary.
Lefrançois & Gandon, ILexicOn. MTT 2011 - 29
30. ITEM1>ITEM2>
The ULiS project - scenario
- Machine Translation
- Management of Interlingual Knowledge Bases
-
The RDF-World
SELECT ?person
SPARQL WHERE {
ikb:John01 RDF
Request ?person ikb:kill ikb:Mary01.
}
IDBpedia Output
RDF interlingual
representations
RDF situational Who kill@past@? Mary01 John01 kill@past Mary01.
representations Who kill@past@? Mary01 John01 kill@past Mary01.
Who killed Mary ? John killed Mary
Who killed Mary? John killed Mary.
Lefrançois & Gandon, ILexicOn. MTT 2011 - 30
31. ITEM1>ITEM2>
The ULiS project - scenario
- Machine Translation
- Management of Interlingual Knowledge Bases
- Management of the Universal Linguistic Knowledge base
The RDF-World
SPARQLRDF + X RDF X RDF
SPARQL
RDF
Request ULKRDF
IKB RDF Output
RDF interlingual
representations
IR RDF IR RDF
RDF situational
representations SRFR RDF SRFR RDF SRFR RDF
InputTEXT Output1TEXT Output2TEXT
Lefrançois & Gandon, ILexicOn. MTT 2011 - 31
41. With the Semantic Web formalisms,
We designed a simple ILexiMOn…
Interlingual Lexical Units Classes may
be formally defined in the ILexicOn…
…by supporting the projection
of their semantic decomposition
on themselves
Lefrançois & Gandon, ILexicOn. MTT 2011 - 41
42. ITEM1>ITEM2>
Three layers
owl:Class owl:ObjectProperty xsd:boolean
OWL owl:intersectionOf is-a
is-a
owl:propertyChainAxiom
owl:unionOf owl:hasSelf
subClassOf is-a subClassOf
core-ILexiMOn layer
:ILexicalUnit :ISemanticRelation
is-a :onISemanticRelation
subClassOf
:allValuesFrom
:ILexicalPrimitive
is-a :isObligatory range
:Entity :hasEntity
ILexicOn layer
:State true
:Person
class/instance A B
A is an instance of B
intersectionOf
property
A B
:Alive A is a subClass of B
Data-layer
B
A p
hasEntity C A B
:Mary01 :Alive01 A is the intersection A is linked to B
of B and C through property p
Lefrançois & Gandon, ILexicOn. MTT 2011 - 42
61. Conceptual participants
Named ConP slots
(an infinite number of) specialized semantic relations
ConP inheritance
ConP partial inheritance
ConP composition
ConP merging
Optional / obligatory ConP
Relation between two ConPs
Formal definitions of ILUCs
Lefrançois & Gandon, ILexicOn. MTT 2011 - 62
62. ITEM1>ITEM2>
The three layers
owl:Class owl:ObjectProperty xsd:boolean
OWL owl:intersectionOf is-a
is-a
owl:propertyChainAxiom
owl:unionOf owl:hasSelf
subClassOf is-a subClassOf
core-ILexiMOn layer
:ILexicalUnit :ISemanticRelation
is-a :onISemanticRelation
subClassOf
:allValuesFrom
:ILexicalPrimitive
is-a :isObligatory range
:Entity :hasEntity
ILexicOn layer
:State true
:Person
class/instance A B
A is an instance of B
intersectionOf
property
A B
:Alive A is a subClass of B
Data-layer
B
A p
hasEntity C A B
:Mary01 :Alive01 A is the intersection A is linked to B
of B and C through property p
Lefrançois & Gandon, ILexicOn. MTT 2011 - 63
63. ULiS: A Universal Linguistic System
- to redesign Pivot-based NLP techniques
- 100% using the Semantic Web Formalisms
- compliant with the Meaning-Text Theory
ILexicOn: The Interlingual Lexical Ontology
- formal lexicographic definitions
- Lexical functions (Fin + named ConP slots => interesting)
- Populate
- SLexicOn
+ Perspectives…
Thank You
Maxime Lefrançois & Fabien Gandon
Edelweiss – INRIA Sophia-Antipolis – France
{Maxime.Lefrancois|Fabien.Gandon}@inria.fr
This presentation on
http://maxime-lefrancois.info