SlideShare une entreprise Scribd logo
1  sur  54
Télécharger pour lire hors ligne
Semantics at Scale:
A Distributional Approach
André Freitas
UFRJ
Rio de Janeiro, March 2015
Motivation
Semantic Computing for coping with the
long tail of data variety
frequency of use
# of entities and attributes
relational NoSQL
schema-less
unstructured
more
knowledge
Full data coverage
Full automation
Full knowledge
Structure/Semantics
Unstructured Data Structured Data
Consistent
Comparable
Processable
Easy to generate Easy to analyze
Semantic Computing
Distributional
Semantics
Robust Semantic Model
 Semantic intelligent behavior is highly dependent on
knowledge scale (commonsense, semantic)
Semantics
=
Formal meaning representation model
(lots of data)
+
inference model
6
Robust Semantic Model
 Not scalable!
1st Hard problem: Acquisition
Semantics
=
Formal meaning representation model
(lots of data)
+
inference model
7
Robust Semantic Model
 Not scalable!
2nd Hard problem: Consistency
Semantics
=
Formal meaning representation model
(lots of data)
+
inference model
8
Robust Semantic Model
 Not scalable!
3rd Hard problem: Performance
Semantics
=
Formal meaning representation model
(lots of data)
+
inference model
9
 “Most semantic models have dealt with particular types of
constructions, and have been carried out under very simplifying
assumptions, in true lab conditions.”
 “If these idealizations are removed it is not clear at all that modern
semantics can give a full account of all but the simplest
models/statements.”
Formal World Real World
Baroni et al. 2013
Semantics for a Complex World
10
Distributional Semantic Models
 Semantic Model with low acquisition effort
(automatically built from text)
Simplification of the representation
 Enables the construction of comprehensive
commonsense/semantic KBs
 What is the cost?
Some level of noise
(semantic best-effort)
Limited semantic model
11
Distributional Hypothesis
“Words occurring in similar (linguistic) contexts tend
to be semantically similar”
 “He filled the wampimuk with the substance, passed it
around and we all drunk some”
12 McDonald & Ramscar, 2001Baroni & Boleda, 2010Harris, 1954
Distributional Semantic Models (DSMs)
“The dog barked in the park. The owner of the dog put him on the
leash since he barked.”
contexts = nouns and verbs in the same
sentence
13
Distributional Semantic Models (DSMs)
“The dog barked in the park. The owner of the dog put him on the
leash since he barked.”
bark
dog
park
leash
contexts = nouns and verbs in the same
sentence
bark : 2
park : 1
leash : 1
owner : 1
14
Semantic Relatedness
car
dog
bark
run
leash
15
Semantic Relatedness
θ
car
dog
cat
bark
run
leash
16
DSMs as Commonsense Reasoning
θ
car
dog
cat
bark
run
leash
17
First Application
Schema-agnostic
Query Approach
Shift in the Database Landscape
 Very-large and dynamic “schemas”.
10s-100s attributes
1,000s-1,000,000s attributes
before 2000
circa 2015
19 Brodie & Liu, 2010
Databases for a Complex World
How do you query data on this scenario?
20
Schema-agnosticism
Abstraction
Layer
21
Who is the daughter
of Bill Clinton?
Bill
Clinton
Chelsea
Clinton
child
Vocabulary Problem for Databases
Who is the daughter of Bill Clinton married to?
Semantic Gap Schema-agnostic
query mechanisms
 Abstraction level differences
 Lexical variation
 Structural (compositional) differences
22
Proposed Approach
Who is the daughter of Bill Clinton married to?
 Abstraction level differences
 Lexical variation
 Structural (compositional) differences
23
Ƭ-Space: Hybrid Distributional-Relational
Semantic Model
24
A Distributional Structured Semantic Space for
Querying RDF Graph Data, IJSC 2012
Approach Overview
Query Planner
Ƭ
Large-scale
unstructured data
Database
Query AnalysisSchema-agnostic
Query
Query Features
Query Plan
25
Addressing the Vocabulary Problem for
Databases (with Distributional Semantics)
Gaelic: direction
26
Dataset
Dataset (DBpedia 3.6 + YAGO classes):
45,768 properties
288,316 classes
9,434,677 instances
128,071,259 triples
27
Simple Queries (Video)
28
More Complex Queries (Video)
29
Treo Answers Jeopardy Queries (Video)
http://bit.ly/1hWcch9
Relevance
31
Comparative Analysis
 Better recall and query coverage compared to baselines with
equivalent precision.
 More comprehensive semantic matching.
32
Distributional Semantics vs WordNet
 Distributional semantics provides a more comprehensive
semantic matching
33
A Distributional Approach for Terminological Semantic Search on the Linked Data
Web, ACM SAC, 2012
Large-scale Querying
frequency of use
# of entities and attributes
relational NoSQL
schema-less
unstructured
Schema-agnostic querying
Schema-agnostic Database will be
released in April 2015
Large-Scale Graph Extraction
Relation/Graph Extraction
 Now that we are schema-agnostic ...
 From Text to Knowledge Graph
 Relations + Context + Entity Linking
 Ontology-agnostic
 RDF serialization
Relation/Graph Extraction
In 2002, GE acquired the wind power assets of Enron.In 2002 GE acquired the wind power assets of Enron
Relation/Graph Extraction
General Electric Company, or GE , is an American multinational conglomerate
corporation incorporated in Schenectady , New York
A Semantic Best-Effort Approach for Extracting Structured
Discourse Graphs from Wikipedia, WoLE 2012
Large-scale Extraction
frequency of use
# of entities and attributes
relational NoSQL
schema-less
unstructured
Large-scale Graph Extraction
Approximate &
Selective Reasoning
Commonsense Reasoning
 Coping with KB incompleteness
- Supporting semantic approximation
 Selective (focussed) reasoning
- Selecting the relevant facts in the context of the inference
Acquisition
Scalability
Strategy: Using distributional semantics to solve both the acquisition
and scalability problems
42
Commonsense Reasoning
43
John Smith EngineerInstance-level
occupation
Does John Smith have a degree?
Commonsense Reasoning
44
John Smith EngineerInstance-level
occupation
Engineer learn
subjectof
Does John Smith have a degree?
Commonsense
KB
Selective Reasoning
45
John Smith EngineerInstance-level
occupation
Engineer learn
subjectof
memorization
is a
Does John Smith have a degree?
Commonsense
KB
Selective reasoning
Commonsense Reasoning
46
John Smith EngineerInstance-level
occupation
Engineer learn
subjectof
memorization
is a
educationhave or
involve
Does John Smith have a degree?
Commonsense
KB
Commonsense Reasoning
47
John Smith EngineerInstance-level
occupation
Engineer learn
subjectof
memorization
is a
education
have or
involve
university at location
Does John Smith have a degree?
Commonsense
KB
Coping with Incompleteness
48
John Smith EngineerInstance-level
occupation
Engineer learn
subjectof
memorization
is a
education
have or
involve
university at locationcollege
Does John Smith have a degree?
Commonsense
KB
Coping with KB
Incompleteness
Commonsense Reasoning
Does John Smith have a degree?
49
John Smith EngineerInstance-level
occupation
Engineer learn
subjectof
memorization
is a
education
have or
involve
university at locationcollege
degreegives
Commonsense
KB
A Distributional Semantics Approach for Selective Reasoning on
Commonsense Graph Knowledge Bases, NLDB 2014.
Programming in a Schema-agnostic World
50
Towards An Approximative Ontology-Agnostic Approach for Logic
Programs, FOIKS 2014.
Semantics at Scale: When Distributional Semantics meets Logic
Programming, ALP Newsletter, 2014
Programming in a Schema-agnostic World
frequency of use
# of entities and attributes
relational NoSQL
schema-less
unstructured
Schema-agnostic programs
Concluding Remarks
 Existing semantic technologies can address today major data
management problems
 Muiti-disciplinarity is one key:
- NLP + IR + Semantic Web + Databases
 Schema-agnosticism is a central property/functionality/goal!
 Distributional Semantics + semantics of structured data =
schema-agnosticism
 Schema-agnosticism brings major impact for information systems.
 We can tame the long tail of data variety!
 The wave is just starting. Be a part of it!
Take-away Message
53
Want to play with Distributional
Semantics?
http://easy-esa.org
54

Contenu connexe

Tendances

Semantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering SystemsSemantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering SystemsAndre Freitas
 
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...Andre Freitas
 
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...Andre Freitas
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Andre Freitas
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalNik Spirin
 
Introduction to Ontology Engineering with Fluent Editor 2014
Introduction to Ontology Engineering with Fluent Editor 2014Introduction to Ontology Engineering with Fluent Editor 2014
Introduction to Ontology Engineering with Fluent Editor 2014Cognitum
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology MappingPradeep B Pillai
 
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge BasesExplanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge BasesDaniel Sonntag
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalDustin Smith
 
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Valeria Fionda
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignmentGuus Schreiber
 
State of NLP and Amazon Comprehend
State of NLP and Amazon ComprehendState of NLP and Amazon Comprehend
State of NLP and Amazon ComprehendEgor Pushkin
 
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-RuleML
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Lecture 2: Computational Semantics
Lecture 2: Computational SemanticsLecture 2: Computational Semantics
Lecture 2: Computational SemanticsMarina Santini
 

Tendances (20)

Semantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering SystemsSemantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering Systems
 
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
 
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
Introduction to Ontology Engineering with Fluent Editor 2014
Introduction to Ontology Engineering with Fluent Editor 2014Introduction to Ontology Engineering with Fluent Editor 2014
Introduction to Ontology Engineering with Fluent Editor 2014
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 
Ontology
OntologyOntology
Ontology
 
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge BasesExplanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
 
Learning ontologies
Learning ontologiesLearning ontologies
Learning ontologies
 
Mpi talk
Mpi talkMpi talk
Mpi talk
 
ppt
pptppt
ppt
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
State of NLP and Amazon Comprehend
State of NLP and Amazon ComprehendState of NLP and Amazon Comprehend
State of NLP and Amazon Comprehend
 
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Lecture 2: Computational Semantics
Lecture 2: Computational SemanticsLecture 2: Computational Semantics
Lecture 2: Computational Semantics
 

En vedette

On the Semantic Representation and Extraction of Complex Category Descriptors
On the Semantic Representation and Extraction of Complex Category DescriptorsOn the Semantic Representation and Extraction of Complex Category Descriptors
On the Semantic Representation and Extraction of Complex Category DescriptorsAndre Freitas
 
Harmonay delaney proposal
Harmonay delaney proposalHarmonay delaney proposal
Harmonay delaney proposalDelaneyHarmony
 
TomTom introduce la función speak & go en su modelo tomtom via 130
TomTom introduce la función speak & go en su modelo tomtom via 130TomTom introduce la función speak & go en su modelo tomtom via 130
TomTom introduce la función speak & go en su modelo tomtom via 130TomTomSpain
 
Sentencia contra Graciela Villca
Sentencia contra Graciela VillcaSentencia contra Graciela Villca
Sentencia contra Graciela VillcaJuan Macias
 
Invat.tur Emprende - emprender en turismo en la Comunitat Valenciana
Invat.tur Emprende - emprender en turismo en la Comunitat ValencianaInvat.tur Emprende - emprender en turismo en la Comunitat Valenciana
Invat.tur Emprende - emprender en turismo en la Comunitat ValencianaDavid Giner Sánchez
 
Institut Mollerussa
Institut MollerussaInstitut Mollerussa
Institut Mollerussamairacocu
 
Crnologiasociedaddechilesigloxx 100428224040-phpapp01
Crnologiasociedaddechilesigloxx 100428224040-phpapp01Crnologiasociedaddechilesigloxx 100428224040-phpapp01
Crnologiasociedaddechilesigloxx 100428224040-phpapp01Susana Pereira
 
Oscilloscopes and Scan Tools
Oscilloscopes and Scan ToolsOscilloscopes and Scan Tools
Oscilloscopes and Scan ToolsPraneel Chand
 
TKR-371MP_Song List_0905-2013.pdf
TKR-371MP_Song List_0905-2013.pdfTKR-371MP_Song List_0905-2013.pdf
TKR-371MP_Song List_0905-2013.pdfvogman
 
Recursos de formación online y semipresenciales para internistas
Recursos de formación online y semipresenciales para internistasRecursos de formación online y semipresenciales para internistas
Recursos de formación online y semipresenciales para internistasdigaPLE33
 
A study on internal perspectives of marketing strategy implemented by ktdc
A study on internal perspectives of marketing strategy implemented by ktdcA study on internal perspectives of marketing strategy implemented by ktdc
A study on internal perspectives of marketing strategy implemented by ktdcBella Meraki
 
Protocolo Atención Sanitaria a Personas Transexuales
Protocolo Atención Sanitaria a Personas TransexualesProtocolo Atención Sanitaria a Personas Transexuales
Protocolo Atención Sanitaria a Personas TransexualesCanarias Saludable
 
E drejta-civile-1
E drejta-civile-1E drejta-civile-1
E drejta-civile-1zogaj
 
Caracteristicas de la gran empresa
Caracteristicas de la gran empresaCaracteristicas de la gran empresa
Caracteristicas de la gran empresabelandriajbm
 
Programme des festivités du quarantenaire des Trompettes de Jéricho
Programme des festivités du quarantenaire des Trompettes de JérichoProgramme des festivités du quarantenaire des Trompettes de Jéricho
Programme des festivités du quarantenaire des Trompettes de JérichoKoffi Sani
 
Measurement and scaling techniques
Measurement  and  scaling  techniquesMeasurement  and  scaling  techniques
Measurement and scaling techniquesUjjwal 'Shanu'
 

En vedette (19)

On the Semantic Representation and Extraction of Complex Category Descriptors
On the Semantic Representation and Extraction of Complex Category DescriptorsOn the Semantic Representation and Extraction of Complex Category Descriptors
On the Semantic Representation and Extraction of Complex Category Descriptors
 
Harmonay delaney proposal
Harmonay delaney proposalHarmonay delaney proposal
Harmonay delaney proposal
 
TomTom introduce la función speak & go en su modelo tomtom via 130
TomTom introduce la función speak & go en su modelo tomtom via 130TomTom introduce la función speak & go en su modelo tomtom via 130
TomTom introduce la función speak & go en su modelo tomtom via 130
 
Sentencia contra Graciela Villca
Sentencia contra Graciela VillcaSentencia contra Graciela Villca
Sentencia contra Graciela Villca
 
Invat.tur Emprende - emprender en turismo en la Comunitat Valenciana
Invat.tur Emprende - emprender en turismo en la Comunitat ValencianaInvat.tur Emprende - emprender en turismo en la Comunitat Valenciana
Invat.tur Emprende - emprender en turismo en la Comunitat Valenciana
 
Institut Mollerussa
Institut MollerussaInstitut Mollerussa
Institut Mollerussa
 
Crnologiasociedaddechilesigloxx 100428224040-phpapp01
Crnologiasociedaddechilesigloxx 100428224040-phpapp01Crnologiasociedaddechilesigloxx 100428224040-phpapp01
Crnologiasociedaddechilesigloxx 100428224040-phpapp01
 
Oscilloscopes and Scan Tools
Oscilloscopes and Scan ToolsOscilloscopes and Scan Tools
Oscilloscopes and Scan Tools
 
TKR-371MP_Song List_0905-2013.pdf
TKR-371MP_Song List_0905-2013.pdfTKR-371MP_Song List_0905-2013.pdf
TKR-371MP_Song List_0905-2013.pdf
 
Recursos de formación online y semipresenciales para internistas
Recursos de formación online y semipresenciales para internistasRecursos de formación online y semipresenciales para internistas
Recursos de formación online y semipresenciales para internistas
 
A study on internal perspectives of marketing strategy implemented by ktdc
A study on internal perspectives of marketing strategy implemented by ktdcA study on internal perspectives of marketing strategy implemented by ktdc
A study on internal perspectives of marketing strategy implemented by ktdc
 
Direct mail marketing
Direct mail marketingDirect mail marketing
Direct mail marketing
 
Protocolo Atención Sanitaria a Personas Transexuales
Protocolo Atención Sanitaria a Personas TransexualesProtocolo Atención Sanitaria a Personas Transexuales
Protocolo Atención Sanitaria a Personas Transexuales
 
E drejta-civile-1
E drejta-civile-1E drejta-civile-1
E drejta-civile-1
 
Caracteristicas de la gran empresa
Caracteristicas de la gran empresaCaracteristicas de la gran empresa
Caracteristicas de la gran empresa
 
Computer crime
Computer crimeComputer crime
Computer crime
 
Programme des festivités du quarantenaire des Trompettes de Jéricho
Programme des festivités du quarantenaire des Trompettes de JérichoProgramme des festivités du quarantenaire des Trompettes de Jéricho
Programme des festivités du quarantenaire des Trompettes de Jéricho
 
Measurement and scaling techniques
Measurement  and  scaling  techniquesMeasurement  and  scaling  techniques
Measurement and scaling techniques
 
Arte conceptual
Arte conceptualArte conceptual
Arte conceptual
 

Similaire à Semantics at Scale: A Distributional Approach

Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...
Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...
Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...Andre Freitas
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 
Towards a Distributional Semantic Web Stack
Towards a Distributional Semantic Web StackTowards a Distributional Semantic Web Stack
Towards a Distributional Semantic Web StackAndre Freitas
 
AI Beyond Deep Learning
AI Beyond Deep LearningAI Beyond Deep Learning
AI Beyond Deep LearningAndre Freitas
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsAndre Freitas
 
A Compositional-distributional Semantic Model over Structured Data
A Compositional-distributional Semantic Model over Structured DataA Compositional-distributional Semantic Model over Structured Data
A Compositional-distributional Semantic Model over Structured DataAndre Freitas
 
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCFueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCValentina Presutti
 
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts Saeedeh Shekarpour
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsAndre Freitas
 
Semantic Search Component
Semantic Search ComponentSemantic Search Component
Semantic Search ComponentMario Flecha
 
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Pramit Choudhary
 
A Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationA Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationRich Heimann
 
Big Data Conference
Big Data ConferenceBig Data Conference
Big Data ConferenceDataTactics
 
A general framework for predicting the optimal computing configuration for cl...
A general framework for predicting the optimal computing configuration for cl...A general framework for predicting the optimal computing configuration for cl...
A general framework for predicting the optimal computing configuration for cl...Scott Farley
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachAndre Freitas
 
5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information RetrievalBhaskar Mitra
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC
 
Improving Question Answering by Bridging Linguistic Structures with Statistic...
Improving Question Answering by Bridging Linguistic Structures with Statistic...Improving Question Answering by Bridging Linguistic Structures with Statistic...
Improving Question Answering by Bridging Linguistic Structures with Statistic...Jinho Choi
 

Similaire à Semantics at Scale: A Distributional Approach (20)

Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...
Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...
Talking to your Data: Natural Language Interfaces for a schema-less world (Ke...
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
Towards a Distributional Semantic Web Stack
Towards a Distributional Semantic Web StackTowards a Distributional Semantic Web Stack
Towards a Distributional Semantic Web Stack
 
AI Beyond Deep Learning
AI Beyond Deep LearningAI Beyond Deep Learning
AI Beyond Deep Learning
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge Graphs
 
A Compositional-distributional Semantic Model over Structured Data
A Compositional-distributional Semantic Model over Structured DataA Compositional-distributional Semantic Model over Structured Data
A Compositional-distributional Semantic Model over Structured Data
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCFueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
 
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Semantic Search Component
Semantic Search ComponentSemantic Search Component
Semantic Search Component
 
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
 
A Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationA Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics Corporation
 
Big Data Conference
Big Data ConferenceBig Data Conference
Big Data Conference
 
A general framework for predicting the optimal computing configuration for cl...
A general framework for predicting the optimal computing configuration for cl...A general framework for predicting the optimal computing configuration for cl...
A general framework for predicting the optimal computing configuration for cl...
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
 
5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
Where Does It Break?
Where Does It Break?Where Does It Break?
Where Does It Break?
 
Improving Question Answering by Bridging Linguistic Structures with Statistic...
Improving Question Answering by Bridging Linguistic Structures with Statistic...Improving Question Answering by Bridging Linguistic Structures with Statistic...
Improving Question Answering by Bridging Linguistic Structures with Statistic...
 

Plus de Andre Freitas

AI & Scientific Discovery in Oncology: Opportunities, Challenges & Trends
AI & Scientific Discovery in Oncology: Opportunities, Challenges & TrendsAI & Scientific Discovery in Oncology: Opportunities, Challenges & Trends
AI & Scientific Discovery in Oncology: Opportunities, Challenges & TrendsAndre Freitas
 
AI Systems @ Manchester
AI Systems @ ManchesterAI Systems @ Manchester
AI Systems @ ManchesterAndre Freitas
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018Andre Freitas
 
WiSS Challenge - Day 2
WiSS Challenge - Day 2WiSS Challenge - Day 2
WiSS Challenge - Day 2Andre Freitas
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataAndre Freitas
 
A Semantic Web Platform for Automating the Interpretation of Finite Element ...
A Semantic Web Platform for Automating the Interpretation of Finite Element ...A Semantic Web Platform for Automating the Interpretation of Finite Element ...
A Semantic Web Platform for Automating the Interpretation of Finite Element ...Andre Freitas
 
How Semantic Technologies can help to cure Hearing Loss?
How Semantic Technologies can help to cure Hearing Loss?How Semantic Technologies can help to cure Hearing Loss?
How Semantic Technologies can help to cure Hearing Loss?Andre Freitas
 
Introduction to Distributional Semantics
Introduction to Distributional SemanticsIntroduction to Distributional Semantics
Introduction to Distributional SemanticsAndre Freitas
 
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Andre Freitas
 
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...Andre Freitas
 

Plus de Andre Freitas (10)

AI & Scientific Discovery in Oncology: Opportunities, Challenges & Trends
AI & Scientific Discovery in Oncology: Opportunities, Challenges & TrendsAI & Scientific Discovery in Oncology: Opportunities, Challenges & Trends
AI & Scientific Discovery in Oncology: Opportunities, Challenges & Trends
 
AI Systems @ Manchester
AI Systems @ ManchesterAI Systems @ Manchester
AI Systems @ Manchester
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018
 
WiSS Challenge - Day 2
WiSS Challenge - Day 2WiSS Challenge - Day 2
WiSS Challenge - Day 2
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked Data
 
A Semantic Web Platform for Automating the Interpretation of Finite Element ...
A Semantic Web Platform for Automating the Interpretation of Finite Element ...A Semantic Web Platform for Automating the Interpretation of Finite Element ...
A Semantic Web Platform for Automating the Interpretation of Finite Element ...
 
How Semantic Technologies can help to cure Hearing Loss?
How Semantic Technologies can help to cure Hearing Loss?How Semantic Technologies can help to cure Hearing Loss?
How Semantic Technologies can help to cure Hearing Loss?
 
Introduction to Distributional Semantics
Introduction to Distributional SemanticsIntroduction to Distributional Semantics
Introduction to Distributional Semantics
 
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
 
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
 

Dernier

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 

Dernier (20)

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 

Semantics at Scale: A Distributional Approach

  • 1. Semantics at Scale: A Distributional Approach André Freitas UFRJ Rio de Janeiro, March 2015
  • 3. Semantic Computing for coping with the long tail of data variety frequency of use # of entities and attributes relational NoSQL schema-less unstructured more knowledge Full data coverage Full automation Full knowledge
  • 4. Structure/Semantics Unstructured Data Structured Data Consistent Comparable Processable Easy to generate Easy to analyze Semantic Computing
  • 6. Robust Semantic Model  Semantic intelligent behavior is highly dependent on knowledge scale (commonsense, semantic) Semantics = Formal meaning representation model (lots of data) + inference model 6
  • 7. Robust Semantic Model  Not scalable! 1st Hard problem: Acquisition Semantics = Formal meaning representation model (lots of data) + inference model 7
  • 8. Robust Semantic Model  Not scalable! 2nd Hard problem: Consistency Semantics = Formal meaning representation model (lots of data) + inference model 8
  • 9. Robust Semantic Model  Not scalable! 3rd Hard problem: Performance Semantics = Formal meaning representation model (lots of data) + inference model 9
  • 10.  “Most semantic models have dealt with particular types of constructions, and have been carried out under very simplifying assumptions, in true lab conditions.”  “If these idealizations are removed it is not clear at all that modern semantics can give a full account of all but the simplest models/statements.” Formal World Real World Baroni et al. 2013 Semantics for a Complex World 10
  • 11. Distributional Semantic Models  Semantic Model with low acquisition effort (automatically built from text) Simplification of the representation  Enables the construction of comprehensive commonsense/semantic KBs  What is the cost? Some level of noise (semantic best-effort) Limited semantic model 11
  • 12. Distributional Hypothesis “Words occurring in similar (linguistic) contexts tend to be semantically similar”  “He filled the wampimuk with the substance, passed it around and we all drunk some” 12 McDonald & Ramscar, 2001Baroni & Boleda, 2010Harris, 1954
  • 13. Distributional Semantic Models (DSMs) “The dog barked in the park. The owner of the dog put him on the leash since he barked.” contexts = nouns and verbs in the same sentence 13
  • 14. Distributional Semantic Models (DSMs) “The dog barked in the park. The owner of the dog put him on the leash since he barked.” bark dog park leash contexts = nouns and verbs in the same sentence bark : 2 park : 1 leash : 1 owner : 1 14
  • 17. DSMs as Commonsense Reasoning θ car dog cat bark run leash 17
  • 19. Shift in the Database Landscape  Very-large and dynamic “schemas”. 10s-100s attributes 1,000s-1,000,000s attributes before 2000 circa 2015 19 Brodie & Liu, 2010
  • 20. Databases for a Complex World How do you query data on this scenario? 20
  • 21. Schema-agnosticism Abstraction Layer 21 Who is the daughter of Bill Clinton? Bill Clinton Chelsea Clinton child
  • 22. Vocabulary Problem for Databases Who is the daughter of Bill Clinton married to? Semantic Gap Schema-agnostic query mechanisms  Abstraction level differences  Lexical variation  Structural (compositional) differences 22
  • 23. Proposed Approach Who is the daughter of Bill Clinton married to?  Abstraction level differences  Lexical variation  Structural (compositional) differences 23
  • 24. Ƭ-Space: Hybrid Distributional-Relational Semantic Model 24 A Distributional Structured Semantic Space for Querying RDF Graph Data, IJSC 2012
  • 25. Approach Overview Query Planner Ƭ Large-scale unstructured data Database Query AnalysisSchema-agnostic Query Query Features Query Plan 25
  • 26. Addressing the Vocabulary Problem for Databases (with Distributional Semantics) Gaelic: direction 26
  • 27. Dataset Dataset (DBpedia 3.6 + YAGO classes): 45,768 properties 288,316 classes 9,434,677 instances 128,071,259 triples 27
  • 29. More Complex Queries (Video) 29
  • 30. Treo Answers Jeopardy Queries (Video) http://bit.ly/1hWcch9
  • 32. Comparative Analysis  Better recall and query coverage compared to baselines with equivalent precision.  More comprehensive semantic matching. 32
  • 33. Distributional Semantics vs WordNet  Distributional semantics provides a more comprehensive semantic matching 33 A Distributional Approach for Terminological Semantic Search on the Linked Data Web, ACM SAC, 2012
  • 34. Large-scale Querying frequency of use # of entities and attributes relational NoSQL schema-less unstructured Schema-agnostic querying
  • 35. Schema-agnostic Database will be released in April 2015
  • 37. Relation/Graph Extraction  Now that we are schema-agnostic ...  From Text to Knowledge Graph  Relations + Context + Entity Linking  Ontology-agnostic  RDF serialization
  • 38. Relation/Graph Extraction In 2002, GE acquired the wind power assets of Enron.In 2002 GE acquired the wind power assets of Enron
  • 39. Relation/Graph Extraction General Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York A Semantic Best-Effort Approach for Extracting Structured Discourse Graphs from Wikipedia, WoLE 2012
  • 40. Large-scale Extraction frequency of use # of entities and attributes relational NoSQL schema-less unstructured Large-scale Graph Extraction
  • 42. Commonsense Reasoning  Coping with KB incompleteness - Supporting semantic approximation  Selective (focussed) reasoning - Selecting the relevant facts in the context of the inference Acquisition Scalability Strategy: Using distributional semantics to solve both the acquisition and scalability problems 42
  • 43. Commonsense Reasoning 43 John Smith EngineerInstance-level occupation Does John Smith have a degree?
  • 44. Commonsense Reasoning 44 John Smith EngineerInstance-level occupation Engineer learn subjectof Does John Smith have a degree? Commonsense KB
  • 45. Selective Reasoning 45 John Smith EngineerInstance-level occupation Engineer learn subjectof memorization is a Does John Smith have a degree? Commonsense KB Selective reasoning
  • 46. Commonsense Reasoning 46 John Smith EngineerInstance-level occupation Engineer learn subjectof memorization is a educationhave or involve Does John Smith have a degree? Commonsense KB
  • 47. Commonsense Reasoning 47 John Smith EngineerInstance-level occupation Engineer learn subjectof memorization is a education have or involve university at location Does John Smith have a degree? Commonsense KB
  • 48. Coping with Incompleteness 48 John Smith EngineerInstance-level occupation Engineer learn subjectof memorization is a education have or involve university at locationcollege Does John Smith have a degree? Commonsense KB Coping with KB Incompleteness
  • 49. Commonsense Reasoning Does John Smith have a degree? 49 John Smith EngineerInstance-level occupation Engineer learn subjectof memorization is a education have or involve university at locationcollege degreegives Commonsense KB A Distributional Semantics Approach for Selective Reasoning on Commonsense Graph Knowledge Bases, NLDB 2014.
  • 50. Programming in a Schema-agnostic World 50 Towards An Approximative Ontology-Agnostic Approach for Logic Programs, FOIKS 2014. Semantics at Scale: When Distributional Semantics meets Logic Programming, ALP Newsletter, 2014
  • 51. Programming in a Schema-agnostic World frequency of use # of entities and attributes relational NoSQL schema-less unstructured Schema-agnostic programs
  • 53.  Existing semantic technologies can address today major data management problems  Muiti-disciplinarity is one key: - NLP + IR + Semantic Web + Databases  Schema-agnosticism is a central property/functionality/goal!  Distributional Semantics + semantics of structured data = schema-agnosticism  Schema-agnosticism brings major impact for information systems.  We can tame the long tail of data variety!  The wave is just starting. Be a part of it! Take-away Message 53
  • 54. Want to play with Distributional Semantics? http://easy-esa.org 54