SlideShare une entreprise Scribd logo
1  sur  66
Télécharger pour lire hors ligne
Fueling the future with 
Semantic Web Patterns 
Valentina Presutti! 
STLab Institute of Cognitive Sciences and Technologies, CNR, Rome (IT)! 
! 
WOP 2014, October 19th, Riva del Garda (IT)!
Outline 
• Can we implement the original Semantic Web scenario? 
• Knowledge sources heterogeneity problem 
• Semantic alignment at pattern level 
• Knowledge Patterns as key elements 
• Some STLab results on KP-based knowledge extraction 
• A possible research direction to pattern alignment 
2 
• Conclusion
What’s the message? 
Knowledge Patterns are a wormhole in 
the Web to knowledge interpretation and 
understanding 
3
We all want a Personal Assistant Robot! 
Answering our 
Giving opinion questions 
on facts and 
things Providing 
guidelines for 
procedures 
Solving our 
problems Planning and 
reminding our 
schedule 
WOODY 4
WOODY 
“Pete and Lucy could use their agents to carry 
out all these tasks thanks not to the World Wide 
Web of today but rather the Semantic Web that 
it will evolve into tomorrow.” 
–Tim Berners-Lee, James Hendler and Ora Lassila, 2001 
5
Today is 13 years later 
How would we implement it? 6
Background knowledge 
7
Background knowledge 
We want WOODY to read and understand 
background knowledge and use it in a smart way 
Heterogeneity 
! 
Structured and Unstructured data 
Syntactic and Semantic introperability 
8
Heterogeneity 
Syntactic interoperability 
• To unify the format of 
knowledge sources 
enabling e.g. distributed 
query 
Tom Heath, Christian Bizer: Linked Data: Evolving the Web into a Global Data Space. Synthesis Lectures on the 
Semantic Web, Morgan & Claypool Publishers 2011
Semantic interoperability 
• Making sense of distributed 
data 
• Enabling their automatic 
interpretation 
• Different semantic 
perspectives must be 
addressed 
10 
Heterogeneity
Semantic interoperability 
An ontology is a formal 
specification of a shared 
conceptualisation 
11 
Heterogeneity 
This definition is valid for any Semantic Web 
knowledge resource
Semantic interoperability: 
formal specification 
• Shared knowledge 
representation language 
• Semantic interoperability to 
the extent of its formal 
semantics 
12 
rdfs:subClassOf 
owl:sameAs 
rdfs:subPropertyOf 
owl:equivalentProperty 
owl:equivalentClass
Semantic interoperability: 
conceptualisation 
• We have to cope with 
knowledge sources 
conceptualisations 
• Aligning knowledge sources 
at a conceptual level 
formal specification 
13 
knowledge representation 
cognition 
conceptualisation
Semantic alignment
Semantic alignment 1+2+3 
• One-by-one alignment 
of classes, properties 
and individuals 
Xianpei Han, Le Sun, Jun Zhao: Collective entity linking in web text: a graph-based method, Proceedings of SIGIR 2011, ACM. 
Euzenat, Jérôme, Shvaiko, Pavel: Ontology Matching 2nd ed. 2013, Springer.
Semantic alignment 1+2+3 
• Alignment to foundational 
theories, e.g. DOLCE 
• They provide a universal 
reference framework from 
which to derive all 
possible consequences, 
inferences, errors. 
• Assumption: foundational 
theory axioms always hold 
dul:Agent! 
dul:NaturalPerson 
Daniel Oberle et al., DOLCE ergo SUMO: On foundational and domain models in the SmartWeb Integrated Ontology (SWIntO). J. Web Sem. 5(3): 156-174 (2007) 
Aldo Gangemi, Nicola Guarino, Claudio Masolo, Alessandro Oltramari, Luc Schneider: Sweetening Ontologies with DOLCE. EKAW 2002: 166-181 
Prateek Jain et al.: Contextual Ontology Alignment of LOD with an Upper Ontology: A Case Study with Proton 
Smith B, Rosse C.: The role of foundational relations in the alignment of biomedical ontologies. Stud Health Technol Inform. 2004;107(Pt 1):444-8
Semantic alignment 1+2+3 
• They provide a decontextualized view on data 
• It is not enough for contextualized interoperability: 
making sense of data for a certain interactive/ 
cognitive task 
17 
Alignment one-by-one 
Alignment to 
foundational theories
Imagine we are interested in comparing the governors of California based 
on the laws they created. 
18
Imagine we are interested in comparing the governors of California based 
on the laws they created. 
18 
one-by-one 
one-by-one 
one-by-one 
one-by-one 
one-by-one 
one-by-one
Imagine we are interested in comparing the governors of California based 
on the laws they created. 
In order to select the information that are relevant for performing our task we need to 
extract only those facts that are framed by certain political concepts and relations. 
18 
one-by-one 
one-by-one 
one-by-one 
one-by-one 
one-by-one 
one-by-one
The boundary problem 
ex:law_dp_CA_2010 rdf:type ex:Law 
ex:law_dp_CA_2010 ex:creator dbpedia:Arnold_Schwarzenegger 
ex:law_dp_CA_2010 ex:jurisdiction dbpedia:California 
ex:law_dp_CA_2010 ex:name ex:drug_policy_CA_2010 
ex:law_dp_CA_2010 ex:creationTime ^^xsd:date:2010 
ex:law_dp_CA_2010 ex:forbidden “marijuana possession of up to one ounce” 
lmdb:Terminator rdf:type lmdb:film 
lmdb:Terminator lmdb:actor dbpedia:Arnold_Schwarzenegger 
lmdb:Terminator lmdb:date ^^xsd:date:1984 
lmdb:Terminator lmdb:directordbpedia:James_Cameron 
lmdb:Terminator lmdb:sequel dbpedia:Terminator_2 
dbpedia:Arnold_Schwarzenegger rdf:type dbpedia-owl:Office_Holder 
dbpedia:Arnold_Schwarzenegger dbpprop:predecessor dbpedia:Lee_Haney 
dbpedia:California_foie_gras_law dbpprop:governor dbpedia:Arnold_Schwarzenegger 
Aldo Gangemi, Valentina Presutti: Towards a pattern science for the Semantic Web. Semantic Web 1(1-2): 61-68 (2010)
The boundary problem 
ex:law_dp_CA_2010 rdf:type ex:Law 
ex:law_dp_CA_2010 ex:creator dbpedia:Arnold_Schwarzenegger 
ex:law_dp_CA_2010 ex:jurisdiction dbpedia:California 
ex:law_dp_CA_2010 ex:name ex:drug_policy_CA_2010 
ex:law_dp_CA_2010 ex:creationTime ^^xsd:date:2010 
ex:law_dp_CA_2010 ex:forbidden “marijuana possession of up to one ounce” 
similar 
lmdb:Terminator rdf:type lmdb:film 
lmdb:Terminator lmdb:actor dbpedia:Arnold_Schwarzenegger 
lmdb:Terminator lmdb:date ^^xsd:date:1984 
lmdb:Terminator lmdb:directordbpedia:James_Cameron 
lmdb:Terminator lmdb:sequel dbpedia:Terminator_2 
dbpedia:Arnold_Schwarzenegger rdf:type dbpedia-owl:Office_Holder 
dbpedia:Arnold_Schwarzenegger dbpprop:predecessor dbpedia:Lee_Haney 
dbpedia:California_foie_gras_law dbpprop:governor dbpedia:Arnold_Schwarzenegger 
Aldo Gangemi, Valentina Presutti: Towards a pattern science for the Semantic Web. Semantic Web 1(1-2): 61-68 (2010)
Semantic alignment 1+2+3 
• We need interoperability at the level of groups of 
relations that together identify specific 
interpretational contexts! 
• We need local reference theories defining 
conceptual boundaries -> Knowledge Patterns* 
20 *(cf. Gangemi&Presutti, 2010)
Patterns are present in 
the (Semantic) Web 
domain
Administrative 
frames 
Geographic 
frames 
Communication 
22 
frames 
DBpedia
Top-down resources 
• Linguistic resources: FrameNet, 
VerbNet, Corpus Pattern Analysis 
• Ontology Design Patterns 
(Content Patterns) 
• EarthCube content patterns 
• Component Library 
• Cyc micro theories 
• Data model patterns (David C. 
Hay) 
• Infobox templates, microformats 
23 
All of them define patterns that 
provide conceptual context for 
representing data
Knowledge extraction 
methods 
• Entity Linking based on 
key discovery (almost-key 
discovery*) 
• Data/graph mining: 
frequent itemset/ 
subgraphs, anomalies 
• NLP: frame detection, 
event extraction 
* Danai Symeonidou: Automatic key discovery for Data Linking, PhD Thesis, 2014. 
24 
They all mine data looking 
for patterns that allow to 
make sense of it.
KP hypothesis 
Independently of the specific data structure or 
knowledge representation format, certain patterns 
share a same intensional meaning 
25
Three heterogeneous knowledge sources (different data structures, different format), 
but sharing the same intensional meaning i.e. describing a cooking situation 
26
Three heterogeneous knowledge sources (different data structures, different format), 
but sharing the same intensional meaning i.e. describing a cooking situation 
26 
Knowledge 
Pattern
Three heterogeneous knowledge sources (different data structures, different format), but 
sharing the same intensional meaning i.e. modelling of a cooking situation 
27
Three heterogeneous knowledge sources (different data structures, different format), but 
sharing the same intensional meaning i.e. modelling of a cooking situation 
27 
Knowledge 
Pattern
Cognitive foundations of KPs 
• People tend to remember items that fit into a 
schema (cf. Bartlett and a lot of CS from then) 
• In particular, schemas that are associated with 
some functional similarity (cf. Gibson’s 
affordances) 
• Schema similar to (conceptual) frame, script, 
knowledge pattern 
28
How to represent KPs 
• Class or property punning (with KP description) 
• Property domain/range axiom punning (with KP roles) 
• Typed named graphs 
• OWL ontology modules (cf. ODP) 
• SPARQL query patterns, SPIN patterns 
• hasKey patterns 
29
Pattern alignment 
30 
Peter Clark’s KP morphisms 
Content Pattern specialisation 
Dedre Gentner’s analogical 
structure mapping
Pattern alignment 
31 
Investigating the 
application of similarity 
measures to complex 
structures 
vector spaces, graph 
matching, structure 
matching, etc.
Pattern alignment 
• Network alignment (cf. 
Roded Sharan*) 
! 
• Modular structure of 
conserved clusters among 
yeast, worm, and fly 
! 
• Multiple network alignment 
revealed 183 conserved 
clusters. 
*Roded Sharan et al.: Conserved patterns of protein interaction in multiple species, Pnas, 2005. 
32
Some results at STLab 
on KP-based KE
Content Ontology Patterns 
http://www.ontologydesignpatterns.org 
34
Pattern-based Ontology Design 
35 
eXtreme Design 
Including patterns in ontologies 
by design
Schema induction of linked datasets based on patterns. 
Patterns are built around central concepts and used for automatic design of SPARQL queries 
Centrality discovery in datasets 
mo:Track 
mo:track 
mo:MusicArtist 
mo:Playlist 
mo:Torrent 
tags:taggedWithTag 
tags:Tag 
mo:Record 
foaf:maker 
mo:image dc:date 
rdfs:Literal 
dc:title 
dc:description 
mo:available_as 
mo:available_as 
mo:available_as 
Valentina Presutti, Lora Aroyo, Alessandro Adamou, Balthasar 
Schopman, Aldo Gangemi, Guus Schreiber: Extracting Core 
Knowledge from Linked Data. COLD2011, CEUR-WS.org Vol-782. 
36
Encyclopedic Knowledge 
Patterns: example 
• An Encyclopedic Knowledge Pattern (EKP) is discovered from the 
paths emerging from Wikipedia page link structure 
• They are represented as OWL2 ontologies 
Andrea Giovanni Nuzzolese, Aldo Gangemi, Valentina Presutti, Paolo Ciancarini: Encyclopedic Knowledge 
Patterns from Wikipedia Links. International Semantic Web Conference (1) 2011: 520-536 
37
Using Encyclopedic Knolwedge Patterns for browsing Wikipedia 
Serendipity in exploratory browsing 
http://www.aemoo.org 
Andrea Giovanni Nuzzolese, Valentina Presutti, Aldo Gangemi, Alberto Musetti, Paolo 
Ciancarini: Aemoo: exploring knowledge on the web. WebSci 2013: 272-275 
Aemoo: exploratory search based on EKP - Semantic Web 
Challenge @ISWC 2011 – Short listed, 4th place 
38
KP-based machine reading with FRED 
39 
http://wit.istc.cnr.it/stlab-tools/fred/ 
Valentina Presutti, Francesco Draicchio, Aldo Gangemi: Knowledge Extraction Based on 
Discourse Representation Theory and Linguistic Frames. EKAW 2012: 114-129
KP-based machine reading with FRED 
http://wit.istc.cnr.it/stlab-tools/fred/ 
The New York Times reported that John McCarthy 
died. He invented the programming language LISP. 
From natural language to linked data graphs, which are 
designed including event- and frame-based patterns 
40
Relation discovery and property generation 
http://wit.istc.cnr.it/kore-dev/legalo 
41 
f-measure=.83 
Exploiting event- and frame-based 
patterns for relation discovery 
Valentina Presutti et al. Uncovering the semantics of 
Wikipedia pagelinks. EKAW 2014.
Overimposing sentic frames on event- and frame-based linked 
data graphs representing opinions, for sentiment analysis 
Sentic frames from text 
http://wit.istc.cnr.it/stlab-tools/sentilo 
42
Overimposing sentic frames on event- and frame-based linked 
data graphs representing opinions, for sentiment analysis 
Sentic frames from text 
http://wit.istc.cnr.it/stlab-tools/sentilo 
42
Overimposing sentic frames on event- and frame-based linked 
data graphs representing opinions, for sentiment analysis 
Sentic frames from text 
http://wit.istc.cnr.it/stlab-tools/sentilo 
42
• Hybridisation is the common factor of these 
methods 
• Still far from solving the pattern alignment problem 
• KP-based design of knowledge sources can 
support easier procedure for pattern alignment 
43
Back to pattern 
alignment
KP hypothesis 
45 
Independently of the 
specific data structure or 
knowledge representation 
format, certain patterns 
share a same intensional 
meaning
Building a KP distributed system 
Event extraction Events 
46 
Ontology Matching 
Social Network 
Analysis 
Frame detection 
Leveraging different techniques 
for knowledge extraction 
Data Mining 
Graph Mining 
Rules 
Correspondence 
patterns 
Unusual records 
Frames 
Association rules 
Frequent 
subgraphs 
Anomalies 
Frequent itemset 
Unifying their results by 
representing them as KPs 
KP distributed system 
The KP system starts with potentially approximate and incomplete 
patterns and evolves to become more and more robust and 
accurate thanks to continuous feedback
Knowledge pattern system 
• Inspired by Minsky’s 
frame-systems 
• Statistical methods 
can help to identify 
relations between 
KPs: 
• co-occurrence, 
causality, 
triggering, etc. 
47 
KPs 
KPs 
KPs 
KPs 
KPs 
KPs 
KPs
Knowledge pattern system 
• Inspired by Minsky’s 
frame-systems 
• Statistical methods 
can help to identify 
relations between 
KPs: 
• co-occurrence, 
causality, 
triggering, etc. 
47 
KPs 
KPs 
KPs 
KPs 
KPs 
KPs 
KPs
A reviewing complaint case 
• Imagine someone gets a paper rejection … 
• … and comments on Facebook …
If we want to enable smart reasoning on 
heterogeneous sources we need a way to relate data 
like this paper’s review with this FB status
KP entailment 
E.g. Patrick Pantel’s “Verb Ocean” 
reject [can-result-in] argue :: 11.634112 
fn:Respond_to_proposal vo:can-result-in fn:Quarreling
reject ⊑ Respond_to_proposal argue ⊑ Quarreling 
x ∈ Interlocutor.respond_to_proposal 
y ∈ Speaker.respond_to_proposal 
z ∈ Proposal.respond_to_proposal 
k ∈ Arguer1.quarreling 
m ∈ Arguer2.quarreling 
n ∈ Issue.quarreling 
= 
= 
≈ 
⊢ 
reject(r,x,y,z,…) entails argue(s,k,m,n,…)
However… 
• Automatic methods 
are never 100% 
accurate 
• Regularities can 
emerge for statistical 
significance even if 
they are not relevant 
• We need procedure 
and metrics for 
validating KPs 
http://tylervigen.com/ 
52
Patterns vs KP 
• A pattern is a motivated structure that is proposed 
by experts or emerges from inductive methods 
• A KP formalises the intensional description of a 
class of situations, events, cases, etc. 
• When a proposed or emerging pattern is a KP? 
• Real data are dirty: spurious correlations 
• How to single out spurious ones?
“Human is the measure of all things.” 
–Protagoras, ~450 B.C. 
54
We need humans in the cycle 
55 
K KP 
K 
K 
K 
K 
K 
Correspondence 
patterns 
Unusual records 
Frames 
Association rules 
Frequent 
subgraphs 
Anomalies 
Frequent itemset 
Events 
Ontology Matching 
Social Network 
Analysis 
Frame detection 
Data Mining 
Graph Mining 
Rules 
Event extraction 
Crowdsourcing 
methods
We need humans in the cycle 
55 
K KP 
K 
K 
K 
K 
K 
Correspondence 
patterns 
Unusual records 
Frames 
Association rules 
Frequent 
subgraphs 
Anomalies 
Frequent itemset 
Events 
Ontology Matching 
Social Network 
Analysis 
Frame detection 
Data Mining 
Graph Mining 
Rules 
Event extraction 
Crowdsourcing 
methods 
Marco Fossati, Claudio Giuliano, Sara Tonelli: Outsourcing 
FrameNet to the Crowd. ACL (2) 2013: 742-747 
VideoGames with a purpose applied to semantic tasks 
http://knowledgeforge.org/, Roberto Navigli
Conclusion 
• We are less than half-way for implementing the original Semantic Web scenario 
• A significant step ahead is introducing semantic interoperability at pattern level 
• This requires the hybridisation of knowledge extraction methods as well as the 
reconciliation of patterns having different provenance (data mining, graph 
mining, ontology patterns, etc.) 
• Knowledge Patterns are key element for enabling such hybridisation 
• Knowledge Patterns should be organised as a distributed linked system where 
links are relations enabling smart reasoning 
• A distributed KP system is a resource evolving by a feeding cycle, which 
includes human computation 
56
Special thanks to: 
Aldo Gangemi, Malvina Nissim, Misael Mongiovì, Claudia d’Amato for their help 
and inspiring discussions.

Contenu connexe

Tendances

M1. sem web & ontology introd
M1. sem web & ontology introdM1. sem web & ontology introd
M1. sem web & ontology introd
Michele Missikoff
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic Wave
Kaniska Mandal
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
Svitlana volkova
 
Semantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge GraphsSemantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge Graphs
Artificial Intelligence Institute at UofSC
 
Mdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-dataMdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-data
Rafael Alvarado
 

Tendances (19)

M1. sem web & ontology introd
M1. sem web & ontology introdM1. sem web & ontology introd
M1. sem web & ontology introd
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic Wave
 
Different Semantic Perspectives for Question Answering Systems
Different Semantic Perspectives for Question Answering SystemsDifferent Semantic Perspectives for Question Answering Systems
Different Semantic Perspectives for Question Answering Systems
 
Intelligent expert systems for location planning
Intelligent expert systems for location planningIntelligent expert systems for location planning
Intelligent expert systems for location planning
 
Knowledge Extraction and Linked Data: Playing with Frames
Knowledge Extraction and Linked Data: Playing with FramesKnowledge Extraction and Linked Data: Playing with Frames
Knowledge Extraction and Linked Data: Playing with Frames
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
 
Semantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge GraphsSemantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge Graphs
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
 
Context, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge OntologyContext, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge Ontology
 
The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic web
 
PhD thesis defense of Christopher Thomas
PhD thesis defense of Christopher ThomasPhD thesis defense of Christopher Thomas
PhD thesis defense of Christopher Thomas
 
SemTecBiz 2012: Corporate Semantic Web
SemTecBiz 2012: Corporate Semantic WebSemTecBiz 2012: Corporate Semantic Web
SemTecBiz 2012: Corporate Semantic Web
 
"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)
 
Semantic engagement handouts
Semantic engagement handoutsSemantic engagement handouts
Semantic engagement handouts
 
Mdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-dataMdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-data
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
 

Similaire à Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC

Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
Andre Freitas
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
Amit Sheth
 
An Introduction to Onological Modeling
An Introduction to Onological ModelingAn Introduction to Onological Modeling
An Introduction to Onological Modeling
Amanda L. Goodman
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
Silvia Puglisi
 
A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...
Patricia Tavares Boralli
 

Similaire à Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC (20)

Looking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebLooking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic Web
 
Objective Fiction, i-semantics keynote
Objective Fiction, i-semantics keynoteObjective Fiction, i-semantics keynote
Objective Fiction, i-semantics keynote
 
Information Quality in the Web Era
Information Quality in the Web EraInformation Quality in the Web Era
Information Quality in the Web Era
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
Semantic technology in nutshell 2013. Semantic! are you a linguist?
Semantic technology in nutshell 2013. Semantic! are you a linguist?Semantic technology in nutshell 2013. Semantic! are you a linguist?
Semantic technology in nutshell 2013. Semantic! are you a linguist?
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
What is What, When?
What is What, When?What is What, When?
What is What, When?
 
Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
 
An Introduction to Onological Modeling
An Introduction to Onological ModelingAn Introduction to Onological Modeling
An Introduction to Onological Modeling
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 
How to model digital objects within the semantic web
How to model digital objects within the semantic webHow to model digital objects within the semantic web
How to model digital objects within the semantic web
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
ESWC 2011 BLOOMS+
ESWC 2011 BLOOMS+ ESWC 2011 BLOOMS+
ESWC 2011 BLOOMS+
 
A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...
 
Ontology
OntologyOntology
Ontology
 
Automatically converting tabular data to
Automatically converting tabular data toAutomatically converting tabular data to
Automatically converting tabular data to
 
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge BasesExplanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
 

Plus de Valentina Presutti

Plus de Valentina Presutti (6)

Building the ArCo knowledge graph: process, experience and struggle with exis...
Building the ArCo knowledge graph: process, experience and struggle with exis...Building the ArCo knowledge graph: process, experience and struggle with exis...
Building the ArCo knowledge graph: process, experience and struggle with exis...
 
ArCo: the Knowledge Graph of Italian Cultural Heritage
ArCo: the Knowledge Graph of Italian Cultural HeritageArCo: the Knowledge Graph of Italian Cultural Heritage
ArCo: the Knowledge Graph of Italian Cultural Heritage
 
Frame-based Sentiment Analysis with Sentilo
Frame-based Sentiment Analysis with SentiloFrame-based Sentiment Analysis with Sentilo
Frame-based Sentiment Analysis with Sentilo
 
Fred sw jpaper2017
Fred sw jpaper2017Fred sw jpaper2017
Fred sw jpaper2017
 
Using cognitive tools in robots dealing with people with dementia
Using cognitive tools in robots dealing with people with dementiaUsing cognitive tools in robots dealing with people with dementia
Using cognitive tools in robots dealing with people with dementia
 
Methods for Ontology Design Patterns reuse
Methods for Ontology Design Patterns reuseMethods for Ontology Design Patterns reuse
Methods for Ontology Design Patterns reuse
 

Dernier

biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 

Dernier (20)

Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 

Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC

  • 1. Fueling the future with Semantic Web Patterns Valentina Presutti! STLab Institute of Cognitive Sciences and Technologies, CNR, Rome (IT)! ! WOP 2014, October 19th, Riva del Garda (IT)!
  • 2. Outline • Can we implement the original Semantic Web scenario? • Knowledge sources heterogeneity problem • Semantic alignment at pattern level • Knowledge Patterns as key elements • Some STLab results on KP-based knowledge extraction • A possible research direction to pattern alignment 2 • Conclusion
  • 3. What’s the message? Knowledge Patterns are a wormhole in the Web to knowledge interpretation and understanding 3
  • 4. We all want a Personal Assistant Robot! Answering our Giving opinion questions on facts and things Providing guidelines for procedures Solving our problems Planning and reminding our schedule WOODY 4
  • 5. WOODY “Pete and Lucy could use their agents to carry out all these tasks thanks not to the World Wide Web of today but rather the Semantic Web that it will evolve into tomorrow.” –Tim Berners-Lee, James Hendler and Ora Lassila, 2001 5
  • 6. Today is 13 years later How would we implement it? 6
  • 8. Background knowledge We want WOODY to read and understand background knowledge and use it in a smart way Heterogeneity ! Structured and Unstructured data Syntactic and Semantic introperability 8
  • 9. Heterogeneity Syntactic interoperability • To unify the format of knowledge sources enabling e.g. distributed query Tom Heath, Christian Bizer: Linked Data: Evolving the Web into a Global Data Space. Synthesis Lectures on the Semantic Web, Morgan & Claypool Publishers 2011
  • 10. Semantic interoperability • Making sense of distributed data • Enabling their automatic interpretation • Different semantic perspectives must be addressed 10 Heterogeneity
  • 11. Semantic interoperability An ontology is a formal specification of a shared conceptualisation 11 Heterogeneity This definition is valid for any Semantic Web knowledge resource
  • 12. Semantic interoperability: formal specification • Shared knowledge representation language • Semantic interoperability to the extent of its formal semantics 12 rdfs:subClassOf owl:sameAs rdfs:subPropertyOf owl:equivalentProperty owl:equivalentClass
  • 13. Semantic interoperability: conceptualisation • We have to cope with knowledge sources conceptualisations • Aligning knowledge sources at a conceptual level formal specification 13 knowledge representation cognition conceptualisation
  • 15. Semantic alignment 1+2+3 • One-by-one alignment of classes, properties and individuals Xianpei Han, Le Sun, Jun Zhao: Collective entity linking in web text: a graph-based method, Proceedings of SIGIR 2011, ACM. Euzenat, Jérôme, Shvaiko, Pavel: Ontology Matching 2nd ed. 2013, Springer.
  • 16. Semantic alignment 1+2+3 • Alignment to foundational theories, e.g. DOLCE • They provide a universal reference framework from which to derive all possible consequences, inferences, errors. • Assumption: foundational theory axioms always hold dul:Agent! dul:NaturalPerson Daniel Oberle et al., DOLCE ergo SUMO: On foundational and domain models in the SmartWeb Integrated Ontology (SWIntO). J. Web Sem. 5(3): 156-174 (2007) Aldo Gangemi, Nicola Guarino, Claudio Masolo, Alessandro Oltramari, Luc Schneider: Sweetening Ontologies with DOLCE. EKAW 2002: 166-181 Prateek Jain et al.: Contextual Ontology Alignment of LOD with an Upper Ontology: A Case Study with Proton Smith B, Rosse C.: The role of foundational relations in the alignment of biomedical ontologies. Stud Health Technol Inform. 2004;107(Pt 1):444-8
  • 17. Semantic alignment 1+2+3 • They provide a decontextualized view on data • It is not enough for contextualized interoperability: making sense of data for a certain interactive/ cognitive task 17 Alignment one-by-one Alignment to foundational theories
  • 18. Imagine we are interested in comparing the governors of California based on the laws they created. 18
  • 19. Imagine we are interested in comparing the governors of California based on the laws they created. 18 one-by-one one-by-one one-by-one one-by-one one-by-one one-by-one
  • 20. Imagine we are interested in comparing the governors of California based on the laws they created. In order to select the information that are relevant for performing our task we need to extract only those facts that are framed by certain political concepts and relations. 18 one-by-one one-by-one one-by-one one-by-one one-by-one one-by-one
  • 21. The boundary problem ex:law_dp_CA_2010 rdf:type ex:Law ex:law_dp_CA_2010 ex:creator dbpedia:Arnold_Schwarzenegger ex:law_dp_CA_2010 ex:jurisdiction dbpedia:California ex:law_dp_CA_2010 ex:name ex:drug_policy_CA_2010 ex:law_dp_CA_2010 ex:creationTime ^^xsd:date:2010 ex:law_dp_CA_2010 ex:forbidden “marijuana possession of up to one ounce” lmdb:Terminator rdf:type lmdb:film lmdb:Terminator lmdb:actor dbpedia:Arnold_Schwarzenegger lmdb:Terminator lmdb:date ^^xsd:date:1984 lmdb:Terminator lmdb:directordbpedia:James_Cameron lmdb:Terminator lmdb:sequel dbpedia:Terminator_2 dbpedia:Arnold_Schwarzenegger rdf:type dbpedia-owl:Office_Holder dbpedia:Arnold_Schwarzenegger dbpprop:predecessor dbpedia:Lee_Haney dbpedia:California_foie_gras_law dbpprop:governor dbpedia:Arnold_Schwarzenegger Aldo Gangemi, Valentina Presutti: Towards a pattern science for the Semantic Web. Semantic Web 1(1-2): 61-68 (2010)
  • 22. The boundary problem ex:law_dp_CA_2010 rdf:type ex:Law ex:law_dp_CA_2010 ex:creator dbpedia:Arnold_Schwarzenegger ex:law_dp_CA_2010 ex:jurisdiction dbpedia:California ex:law_dp_CA_2010 ex:name ex:drug_policy_CA_2010 ex:law_dp_CA_2010 ex:creationTime ^^xsd:date:2010 ex:law_dp_CA_2010 ex:forbidden “marijuana possession of up to one ounce” similar lmdb:Terminator rdf:type lmdb:film lmdb:Terminator lmdb:actor dbpedia:Arnold_Schwarzenegger lmdb:Terminator lmdb:date ^^xsd:date:1984 lmdb:Terminator lmdb:directordbpedia:James_Cameron lmdb:Terminator lmdb:sequel dbpedia:Terminator_2 dbpedia:Arnold_Schwarzenegger rdf:type dbpedia-owl:Office_Holder dbpedia:Arnold_Schwarzenegger dbpprop:predecessor dbpedia:Lee_Haney dbpedia:California_foie_gras_law dbpprop:governor dbpedia:Arnold_Schwarzenegger Aldo Gangemi, Valentina Presutti: Towards a pattern science for the Semantic Web. Semantic Web 1(1-2): 61-68 (2010)
  • 23. Semantic alignment 1+2+3 • We need interoperability at the level of groups of relations that together identify specific interpretational contexts! • We need local reference theories defining conceptual boundaries -> Knowledge Patterns* 20 *(cf. Gangemi&Presutti, 2010)
  • 24. Patterns are present in the (Semantic) Web domain
  • 25. Administrative frames Geographic frames Communication 22 frames DBpedia
  • 26. Top-down resources • Linguistic resources: FrameNet, VerbNet, Corpus Pattern Analysis • Ontology Design Patterns (Content Patterns) • EarthCube content patterns • Component Library • Cyc micro theories • Data model patterns (David C. Hay) • Infobox templates, microformats 23 All of them define patterns that provide conceptual context for representing data
  • 27. Knowledge extraction methods • Entity Linking based on key discovery (almost-key discovery*) • Data/graph mining: frequent itemset/ subgraphs, anomalies • NLP: frame detection, event extraction * Danai Symeonidou: Automatic key discovery for Data Linking, PhD Thesis, 2014. 24 They all mine data looking for patterns that allow to make sense of it.
  • 28. KP hypothesis Independently of the specific data structure or knowledge representation format, certain patterns share a same intensional meaning 25
  • 29. Three heterogeneous knowledge sources (different data structures, different format), but sharing the same intensional meaning i.e. describing a cooking situation 26
  • 30. Three heterogeneous knowledge sources (different data structures, different format), but sharing the same intensional meaning i.e. describing a cooking situation 26 Knowledge Pattern
  • 31. Three heterogeneous knowledge sources (different data structures, different format), but sharing the same intensional meaning i.e. modelling of a cooking situation 27
  • 32. Three heterogeneous knowledge sources (different data structures, different format), but sharing the same intensional meaning i.e. modelling of a cooking situation 27 Knowledge Pattern
  • 33. Cognitive foundations of KPs • People tend to remember items that fit into a schema (cf. Bartlett and a lot of CS from then) • In particular, schemas that are associated with some functional similarity (cf. Gibson’s affordances) • Schema similar to (conceptual) frame, script, knowledge pattern 28
  • 34. How to represent KPs • Class or property punning (with KP description) • Property domain/range axiom punning (with KP roles) • Typed named graphs • OWL ontology modules (cf. ODP) • SPARQL query patterns, SPIN patterns • hasKey patterns 29
  • 35. Pattern alignment 30 Peter Clark’s KP morphisms Content Pattern specialisation Dedre Gentner’s analogical structure mapping
  • 36. Pattern alignment 31 Investigating the application of similarity measures to complex structures vector spaces, graph matching, structure matching, etc.
  • 37. Pattern alignment • Network alignment (cf. Roded Sharan*) ! • Modular structure of conserved clusters among yeast, worm, and fly ! • Multiple network alignment revealed 183 conserved clusters. *Roded Sharan et al.: Conserved patterns of protein interaction in multiple species, Pnas, 2005. 32
  • 38. Some results at STLab on KP-based KE
  • 39. Content Ontology Patterns http://www.ontologydesignpatterns.org 34
  • 40. Pattern-based Ontology Design 35 eXtreme Design Including patterns in ontologies by design
  • 41. Schema induction of linked datasets based on patterns. Patterns are built around central concepts and used for automatic design of SPARQL queries Centrality discovery in datasets mo:Track mo:track mo:MusicArtist mo:Playlist mo:Torrent tags:taggedWithTag tags:Tag mo:Record foaf:maker mo:image dc:date rdfs:Literal dc:title dc:description mo:available_as mo:available_as mo:available_as Valentina Presutti, Lora Aroyo, Alessandro Adamou, Balthasar Schopman, Aldo Gangemi, Guus Schreiber: Extracting Core Knowledge from Linked Data. COLD2011, CEUR-WS.org Vol-782. 36
  • 42. Encyclopedic Knowledge Patterns: example • An Encyclopedic Knowledge Pattern (EKP) is discovered from the paths emerging from Wikipedia page link structure • They are represented as OWL2 ontologies Andrea Giovanni Nuzzolese, Aldo Gangemi, Valentina Presutti, Paolo Ciancarini: Encyclopedic Knowledge Patterns from Wikipedia Links. International Semantic Web Conference (1) 2011: 520-536 37
  • 43. Using Encyclopedic Knolwedge Patterns for browsing Wikipedia Serendipity in exploratory browsing http://www.aemoo.org Andrea Giovanni Nuzzolese, Valentina Presutti, Aldo Gangemi, Alberto Musetti, Paolo Ciancarini: Aemoo: exploring knowledge on the web. WebSci 2013: 272-275 Aemoo: exploratory search based on EKP - Semantic Web Challenge @ISWC 2011 – Short listed, 4th place 38
  • 44. KP-based machine reading with FRED 39 http://wit.istc.cnr.it/stlab-tools/fred/ Valentina Presutti, Francesco Draicchio, Aldo Gangemi: Knowledge Extraction Based on Discourse Representation Theory and Linguistic Frames. EKAW 2012: 114-129
  • 45. KP-based machine reading with FRED http://wit.istc.cnr.it/stlab-tools/fred/ The New York Times reported that John McCarthy died. He invented the programming language LISP. From natural language to linked data graphs, which are designed including event- and frame-based patterns 40
  • 46. Relation discovery and property generation http://wit.istc.cnr.it/kore-dev/legalo 41 f-measure=.83 Exploiting event- and frame-based patterns for relation discovery Valentina Presutti et al. Uncovering the semantics of Wikipedia pagelinks. EKAW 2014.
  • 47. Overimposing sentic frames on event- and frame-based linked data graphs representing opinions, for sentiment analysis Sentic frames from text http://wit.istc.cnr.it/stlab-tools/sentilo 42
  • 48. Overimposing sentic frames on event- and frame-based linked data graphs representing opinions, for sentiment analysis Sentic frames from text http://wit.istc.cnr.it/stlab-tools/sentilo 42
  • 49. Overimposing sentic frames on event- and frame-based linked data graphs representing opinions, for sentiment analysis Sentic frames from text http://wit.istc.cnr.it/stlab-tools/sentilo 42
  • 50. • Hybridisation is the common factor of these methods • Still far from solving the pattern alignment problem • KP-based design of knowledge sources can support easier procedure for pattern alignment 43
  • 51. Back to pattern alignment
  • 52. KP hypothesis 45 Independently of the specific data structure or knowledge representation format, certain patterns share a same intensional meaning
  • 53. Building a KP distributed system Event extraction Events 46 Ontology Matching Social Network Analysis Frame detection Leveraging different techniques for knowledge extraction Data Mining Graph Mining Rules Correspondence patterns Unusual records Frames Association rules Frequent subgraphs Anomalies Frequent itemset Unifying their results by representing them as KPs KP distributed system The KP system starts with potentially approximate and incomplete patterns and evolves to become more and more robust and accurate thanks to continuous feedback
  • 54. Knowledge pattern system • Inspired by Minsky’s frame-systems • Statistical methods can help to identify relations between KPs: • co-occurrence, causality, triggering, etc. 47 KPs KPs KPs KPs KPs KPs KPs
  • 55. Knowledge pattern system • Inspired by Minsky’s frame-systems • Statistical methods can help to identify relations between KPs: • co-occurrence, causality, triggering, etc. 47 KPs KPs KPs KPs KPs KPs KPs
  • 56. A reviewing complaint case • Imagine someone gets a paper rejection … • … and comments on Facebook …
  • 57. If we want to enable smart reasoning on heterogeneous sources we need a way to relate data like this paper’s review with this FB status
  • 58. KP entailment E.g. Patrick Pantel’s “Verb Ocean” reject [can-result-in] argue :: 11.634112 fn:Respond_to_proposal vo:can-result-in fn:Quarreling
  • 59. reject ⊑ Respond_to_proposal argue ⊑ Quarreling x ∈ Interlocutor.respond_to_proposal y ∈ Speaker.respond_to_proposal z ∈ Proposal.respond_to_proposal k ∈ Arguer1.quarreling m ∈ Arguer2.quarreling n ∈ Issue.quarreling = = ≈ ⊢ reject(r,x,y,z,…) entails argue(s,k,m,n,…)
  • 60. However… • Automatic methods are never 100% accurate • Regularities can emerge for statistical significance even if they are not relevant • We need procedure and metrics for validating KPs http://tylervigen.com/ 52
  • 61. Patterns vs KP • A pattern is a motivated structure that is proposed by experts or emerges from inductive methods • A KP formalises the intensional description of a class of situations, events, cases, etc. • When a proposed or emerging pattern is a KP? • Real data are dirty: spurious correlations • How to single out spurious ones?
  • 62. “Human is the measure of all things.” –Protagoras, ~450 B.C. 54
  • 63. We need humans in the cycle 55 K KP K K K K K Correspondence patterns Unusual records Frames Association rules Frequent subgraphs Anomalies Frequent itemset Events Ontology Matching Social Network Analysis Frame detection Data Mining Graph Mining Rules Event extraction Crowdsourcing methods
  • 64. We need humans in the cycle 55 K KP K K K K K Correspondence patterns Unusual records Frames Association rules Frequent subgraphs Anomalies Frequent itemset Events Ontology Matching Social Network Analysis Frame detection Data Mining Graph Mining Rules Event extraction Crowdsourcing methods Marco Fossati, Claudio Giuliano, Sara Tonelli: Outsourcing FrameNet to the Crowd. ACL (2) 2013: 742-747 VideoGames with a purpose applied to semantic tasks http://knowledgeforge.org/, Roberto Navigli
  • 65. Conclusion • We are less than half-way for implementing the original Semantic Web scenario • A significant step ahead is introducing semantic interoperability at pattern level • This requires the hybridisation of knowledge extraction methods as well as the reconciliation of patterns having different provenance (data mining, graph mining, ontology patterns, etc.) • Knowledge Patterns are key element for enabling such hybridisation • Knowledge Patterns should be organised as a distributed linked system where links are relations enabling smart reasoning • A distributed KP system is a resource evolving by a feeding cycle, which includes human computation 56
  • 66. Special thanks to: Aldo Gangemi, Malvina Nissim, Misael Mongiovì, Claudia d’Amato for their help and inspiring discussions.