SlideShare une entreprise Scribd logo
1  sur  69
Télécharger pour lire hors ligne
BACK TO THE DRAWING BOARD
The Myth of Data-Driven NLU and How
to go Forward (Again)
WALID S. SABA, PhD
Principal AI Scientist
Astound.ai
BACK TO THE DRAWING BOARD
The Myth of Data-Driven NLU and How
to go Forward (Again)
WALID S. SABA, PhD
Principal AI Scientist
Neva.ai
SRI International
November 2, 2017
Every thing in nature, in the inanimate as well
as in the animate world, happens according to
some rules, though we do not always know
them …
I reject the contention that an important
theoretical difference exists between formal
and natural languages …
One can assume a theory of the world that
is isomorphic to the way we talk about it …
in this case, semantics becomes very
nearly trivial
about the resurgence of
and the currently dominant
paradigm in ‘AI’ …
the availability of huge amounts data, coupled with advances
in computer hardware and distributed computing resulted in
some advances in certain types of (data-centric) problems
(image, speech, fraud detection, text categorization, etc.)
But …
many problems in AI require
understanding that is beyond
discovering patterns in data
Identifying an
adult female in an
image might be a
data-centric
problem
However, picking up the
photo with the teacher and
the photo with the mother
requires information that
is not in the data
Identifying an
adult female in an
image might be a
data-centric
problem
where’s the
musical band?
where’s the
musical band?
Musician?
a person who plays a musical
instrument?
So what is
at issue
here?
So what is
at issue
here?
The issue here is that there are no musicians,
teachers, lawyers, or even mothers! What
exists, ontologically (metaphysically), are
humans, and the above concepts are logical
concepts that are, sometimes, true of a
certain human
So what is
at issue
here?
The issue here is that there are no musicians,
teachers, lawyers, or even mothers! What
exists, ontologically (metaphysically), are
humans, and the above concepts are logical
concepts that are, sometimes, true of a
certain human
Data-driven (quantitative) approaches can
only reason with (detect, infer, recognize)
objects that are of an ontological type, but
not logical concepts, that form the
majority of objects of (human) thought
human lawyer
dancer
ONTOLOGIACL CONCEPTS LOGIACL CONCEPTS
teacher
mother
...
Can a ‘lawyer’ or a ‘dancer’ be identified by data-analysis only?
failure to distinguish between logical and
ontological concepts is not only a flaw in
data-driven approaches
logical/formal semantics also failed to
provide adequate models for natural
language and for exactly the same reason
but data-driven approaches are
inadequate for natural language
understanding for other reasons
besides their inability to
distinguish between logical and
ontological concepts
(1) The trophy did not fit in the brown suitcase
because it was too
a. big
b. small
(2) Dr. Spok told Jon that he should soon be done
a. writing his thesis
b. reading his thesis
No Statistical Significance in the Data
(1) The trophy did not fit in the brown suitcase
because it was too
a. big
b. small
(2) Dr. Spok told Jon that he should soon be done
a. writing his thesis
b. reading his thesis
No Statistical Significance in the Data
In a data-driven
approach it is not
clear how such
decisions can be
made since the only
difference between
the sentence-pairs in
(1) and (2) are words
that co-occur with
equal probabilities
(1) The trophy did not fit in the brown suitcase
because it was too
a. big
b. small
(2) Dr. Spok told Jon that he should soon be done
a. writing his thesis
b. reading his thesis
No Statistical Significance in the Data
How many labeled
examples would be
required to learn how
to resolve such
references?
(1) Don’t worry, Simon is [as solid as] a rock.
(2) The [person sitting at the] corner table wants another beer.
(3) Carlos likes to play [the game] bridge.
(4) Mary enjoyed [watching] the movie.
(5) Carl owns a [different] house on every street in the village.
It’s not even in the data
challenges in the computational comprehension of ordinary text is often
due to quite a bit of missing text – text which is not explicitly stated but
is often assumed as shared knowledge among a community of language
users
Innate Ontological Structure?
The readings in (1a) and (2a) are clearly preferred over the readings in (1b)
and (2b), although there are no structural/syntactic rules that speakers of
ordinary language seem to be following. What makes the AORs phenomenon
even more intriguing is the fact that these preferences are also consistently
made across multiple languages.
Our grade school teachers once
told us that
256 = 16
Reasoning
with data only
in NL leads to
absurdities
Our grade school teachers once
told us that
256 = 16
But is this (always) correct?
?
Reasoning with data only in NL
leads to absurdities
?
Reasoning with data only in NL
leads to absurdities
If 16 always equals (and
is replaceable by) the
square root of 256 then
we can alter reality and
restate the above as
If logical/formal semantics
failed, and if data-driven
approaches are inadequate,
then what?
logical semantics failed us (thus
far) because they did not
differentiate between logical
and ontological concepts
as such, logical semantics were mere symbol
manipulation systems adequate for mathematics,
but not for natural language
what happens
when we do not
distinguish
between
ontological and
logical concepts?
what happens
when we do not
distinguish
between
ontological and
logical concepts?
CARL GUSTAV HEMPEL
Consider the following
That is, we have the hypothesis H1 that ‘All ravens are black’.
This hypothesis, however, is logically equivalent to the
hypothesis H2 that ‘All non-black things are not ravens’.
Consider the following
That is, we have the hypothesis H1 that ‘All ravens are black’.
This hypothesis, however, is logically equivalent to the
hypothesis H2 that ‘All non-black things are not ravens’.
In FOPL, we have:
Observing non-black and non-raven
objects confirms hypothesis H2 (that
‘All non-black things are not ravens’).
Observing black ravens confirms
hypothesis H1, namely that ‘All
ravens are black’.
H1 H2
and here’s the paradox
H1 is equivalent to H2, and so observing a red apple now
confirms the hypothesis that ‘All ravens are black’
Observing non-black and non-raven
objects confirms hypothesis H2 (that
‘All non-black things are not ravens’).
Observing black ravens confirms
hypothesis H1, namely that ‘All
ravens are black’.
H1 H2
and here’s the solution
First, let us make our logic strongly-typed:
• in addition to a scope, a variable also has a type.
• the type associated with a variable is the most general type
for which the predicate makes sense
and here’s the solution
First, let us make our logic strongly-typed:
• in addition to a scope, a variable also has a type.
• the type associated with a variable is the most general type
for which the predicate makes sense
and here’s the solution
First, let us make our logic strongly-typed:
• in addition to a scope, a variable also has a type.
• the type associated with a variable is the most general type
for which the predicate makes sense
The predicate IMMINENT is applicable
to objects of type event (or, it makes
sense to say IMMINENT x when x is an
event or any of its subtypes)
1. (8x)(raven(x) ¾ black(x))
the traditional representation in FOPL
All ravens are black
)
1. (8x)(raven(x) ¾ black(x))
2. (8x)(raven(x :: raven) ¾ black(x :: physical))
types are associated with
variables in the context of
the predicates in which
they are used
All ravens are black
)
1. (8x)(raven(x) ¾ black(x))
2. (8x)(raven(x :: raven) ¾ black(x :: physical))
3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven)))
x is associated with more
than one type in the same
scope, requiring a type
unification
All ravens are black
)
1. (8x)(raven(x) ¾ black(x))
2. (8x)(raven(x :: raven) ¾ black(x :: physical))
3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven)))
4. (8x)(raven(x :: raven) ¾ black(x :: raven))
since raven v physical,
the type unification here is
simple, resulting in raven
All ravens are black
)
1. (8x)(raven(x) ¾ black(x))
2. (8x)(raven(x :: raven) ¾ black(x :: physical))
3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven)))
4. (8x)(raven(x :: raven) ¾ black(x :: raven))
5. (8x :: raven)(true ¾ black(x))
(i) since x is now associated with one type throughout
its scope, the type can be moved to where x was
introduced; and (ii) the predicate raven(x :: raven)
is replaced by true
All ravens are black
)
1. (8x)(raven(x) ¾ black(x))
2. (8x)(raven(x :: raven) ¾ black(x :: physical))
3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven)))
4. (8x)(raven(x :: raven) ¾ black(x :: raven))
5. (8x :: raven)(true ¾ black(x))
6. (8x :: raven)(black(x))
All ravens are black
)
All ravens are black ) (8x :: raven)(black(x))
ontological concept logical concept
1. (8x)(:black(x) ¾ :raven(x))
the traditional representation in FOPL
All non-black things are not ravens
)
1. (8x)(:black(x) ¾ :raven(x))
2. (8x)(:black(x :: physical) ¾ :raven(x :: raven))
types are associated with
variables in the context of
the predicates in which
they are used
All non-black things are not ravens
)
1. (8x)(:black(x) ¾ :raven(x))
2. (8x)(:black(x :: physical) ¾ :raven(x :: raven))
3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven))
x is associated with more
than one type in the same
scope, requiring a type
unification
All non-black things are not ravens
)
1. (8x)(:black(x) ¾ :raven(x))
2. (8x)(:black(x :: physical) ¾ :raven(x :: raven))
3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven))
4. (8x)(:black(x :: raven) ¾ :raven(x :: raven))
since raven v physical,
the type unification here is
simple, resulting in raven
All non-black things are not ravens
)
1. (8x)(:black(x) ¾ :raven(x))
2. (8x)(:black(x :: physical) ¾ :raven(x :: raven))
3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven))
4. (8x)(:black(x :: raven) ¾ :raven(x :: raven))
5. (8x :: raven)(:black(x) ¾ :true)
All non-black things are not ravens
)
(i) since x is now associated with one type throughout
its scope, the type can be moved to where x was
introduced; and (ii) the predicate raven(x :: raven)
is replaced by true
1. (8x)(:black(x) ¾ :raven(x))
2. (8x)(:black(x :: physical) ¾ :raven(x :: raven))
3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven))
4. (8x)(:black(x :: raven) ¾ :raven(x :: raven))
5. (8x :: raven)(:black(x) ¾ :true)
6. (8x :: raven)(black(x))
All non-black things are not ravens
)
simplifying we get (6) – we used the facts:
(P ¾ Q) = (:P _ Q)
:true = false
(P _ false) = P
1. (8x)(:black(x) ¾ :raven(x))
2. (8x)(:black(x :: physical) ¾ :raven(x :: raven))
3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven))
4. (8x)(:black(x :: raven) ¾ :raven(x :: raven))
5. (8x :: raven)(:black(x) ¾ :true)
6. (8x :: raven)(black(x))
All non-black things are not ravens
)
All non-black things are not ravens ) (8x :: raven)(black(x))
ontological concept
logical concept
All non-black things are not ravens ) (8x :: raven)(black(x))
what paradox of the ravens?
All ravens are black ) (8x :: raven)(black(x))
It seems that making a clear distinction between logical
and ontological concepts shows that the problem was in
the representation and there is no so-called ‘paradox of
the ravens’
logical
vs.
ontological
concepts
let‘s put the typed ontologic to use
let‘s put the typed ontologic to use
Type unification is required
let‘s put the typed ontologic to use
Type unification is required
Final representation (we now know Sheba must be a human)
let‘s put the typed ontologic to use
Two type unifications are required
Final representation
let‘s put the typed ontologic to use
How is it that ‘beautiful’ could
be meant to describe Sara or
her dancing?
let‘s put the typed ontologic to use
Two type unifications are required
let‘s put the typed ontologic to use
Final interpretation
Since ‘beautiful’ can be said of activities as well as
humans, the sentence remains ambiguous. But how about
‘Sara is a tall dancer’ ?
let‘s put the typed ontologic to use
Two type unifications are required
let‘s put the typed ontologic to use
Since we finally get
Unlike the case of ‘Sara is a beautiful dancer’, in this case
type unification rejected the possibility of a ‘tall dancing’
Adjective-ordering restrictions
John owns a beautiful red car
John owns a red beautiful car
Adjective-ordering restrictions
John owns a beautiful red car
John owns a red beautiful car
Downcasting is undecidable so (a) is acceptable but (b) is not
Adjective-ordering restrictions
John owns a beautiful red car
John owns a red beautiful car
The second order requires us to go up, and then down again!
Adjective-ordering restrictions
John owns a red and beautiful car
)
(John owns a red car)
and (John owns a beautiful car)
We can, though we rarely do, say two separate statements!
Lexical disambiguation
Type unifications relevant to ‘party’
Type unification will remove the
‘political group’ meaning of ‘party’
Discovering ‘missing text’ - metonymy
Discovering ‘missing text’ - metonymy
x, introduced as a HamSandwich is also
considered to be an object of type Human
in the context of a want relation
Discovering ‘missing text’ - metonymy
The unification of two types
that are not on the same path
is some salient relationship
between the two types
thank you
Data-driven approaches to NLU are inadequate for several reasons, most
important of which is the fact that understanding ordinary spoken language
involves discovering the missing text that is not even explicitly stated in the data
Logical/formal semantics treated logical and ontological concepts the same, as a
consequence, they were for the most part symbol manipulation systems devoid of
any ontological content
Typed logical semantics (embedded with ontological types) seem to suggest very
simple solutions to many challenging problems in language understanding

Contenu connexe

Tendances

Question Answering as Search - the Anserini Pipeline and Other Stories
Question Answering as Search - the Anserini Pipeline and Other StoriesQuestion Answering as Search - the Anserini Pipeline and Other Stories
Question Answering as Search - the Anserini Pipeline and Other StoriesSujit Pal
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processingrohitnayak
 
Random forest and decision tree
Random forest and decision treeRandom forest and decision tree
Random forest and decision treeAAKANKSHA JAIN
 
Benchmark comparison of Large Language Models
Benchmark comparison of Large Language ModelsBenchmark comparison of Large Language Models
Benchmark comparison of Large Language ModelsMatej Varga
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsMárton Kodok
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPAnuj Gupta
 

Tendances (7)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Question Answering as Search - the Anserini Pipeline and Other Stories
Question Answering as Search - the Anserini Pipeline and Other StoriesQuestion Answering as Search - the Anserini Pipeline and Other Stories
Question Answering as Search - the Anserini Pipeline and Other Stories
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Random forest and decision tree
Random forest and decision treeRandom forest and decision tree
Random forest and decision tree
 
Benchmark comparison of Large Language Models
Benchmark comparison of Large Language ModelsBenchmark comparison of Large Language Models
Benchmark comparison of Large Language Models
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 

Similaire à BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward (Again)

Deep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUDeep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUWalid Saba
 
___ __ Newlanguage evolution ___ BernabeuatLangUE.pdf
___ __ Newlanguage evolution ___  BernabeuatLangUE.pdf___ __ Newlanguage evolution ___  BernabeuatLangUE.pdf
___ __ Newlanguage evolution ___ BernabeuatLangUE.pdftkobelt
 
Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Cornelius Puschmann
 
ESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic Turn
ESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic TurnESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic Turn
ESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic TurnDaisuke BEKKI
 
Theoretical Issues In Pragmatics And Discourse Analysis
Theoretical Issues In Pragmatics And Discourse AnalysisTheoretical Issues In Pragmatics And Discourse Analysis
Theoretical Issues In Pragmatics And Discourse AnalysisLouis de Saussure
 
Wonderful Critical Essay Example Thatsnotus
Wonderful Critical Essay Example  ThatsnotusWonderful Critical Essay Example  Thatsnotus
Wonderful Critical Essay Example ThatsnotusJaclyn Creedon
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
semantics ppt 2.pptx
semantics ppt 2.pptxsemantics ppt 2.pptx
semantics ppt 2.pptxMaya
 
An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsTye Rausch
 
Theories related to semantic 2
Theories related to semantic 2Theories related to semantic 2
Theories related to semantic 2Gustina Savhira
 
Adding Semantics to Ontologies
Adding Semantics to OntologiesAdding Semantics to Ontologies
Adding Semantics to OntologiesCristiano Longo
 
II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...
II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...
II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...Dr. Haxel Consult
 
Language and the body
Language and the bodyLanguage and the body
Language and the bodySabiq Hafidz
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + ContextStefan Gradmann
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 

Similaire à BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward (Again) (20)

Deep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUDeep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLU
 
___ __ Newlanguage evolution ___ BernabeuatLangUE.pdf
___ __ Newlanguage evolution ___  BernabeuatLangUE.pdf___ __ Newlanguage evolution ___  BernabeuatLangUE.pdf
___ __ Newlanguage evolution ___ BernabeuatLangUE.pdf
 
A Bridge Not too Far
A Bridge Not too FarA Bridge Not too Far
A Bridge Not too Far
 
Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)
 
ESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic Turn
ESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic TurnESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic Turn
ESSLLI2016 DTS Lecture Day 5-2: Proof-theoretic Turn
 
Theoretical Issues In Pragmatics And Discourse Analysis
Theoretical Issues In Pragmatics And Discourse AnalysisTheoretical Issues In Pragmatics And Discourse Analysis
Theoretical Issues In Pragmatics And Discourse Analysis
 
Wonderful Critical Essay Example Thatsnotus
Wonderful Critical Essay Example  ThatsnotusWonderful Critical Essay Example  Thatsnotus
Wonderful Critical Essay Example Thatsnotus
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
semantics ppt 2.pptx
semantics ppt 2.pptxsemantics ppt 2.pptx
semantics ppt 2.pptx
 
An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
 
Theories related to semantic 2
Theories related to semantic 2Theories related to semantic 2
Theories related to semantic 2
 
Advances In Wsd Acl 2005
Advances In Wsd Acl 2005Advances In Wsd Acl 2005
Advances In Wsd Acl 2005
 
Advances In Wsd Aaai 2005
Advances In Wsd Aaai 2005Advances In Wsd Aaai 2005
Advances In Wsd Aaai 2005
 
Advances In Wsd Aaai 2005
Advances In Wsd Aaai 2005Advances In Wsd Aaai 2005
Advances In Wsd Aaai 2005
 
Meghyn slides-hse-2014
Meghyn slides-hse-2014Meghyn slides-hse-2014
Meghyn slides-hse-2014
 
Adding Semantics to Ontologies
Adding Semantics to OntologiesAdding Semantics to Ontologies
Adding Semantics to Ontologies
 
II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...
II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...
II-SDV 2017: Artificial Intelligence is not a Matter of Strength but of Intel...
 
Language and the body
Language and the bodyLanguage and the body
Language and the body
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + Context
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 

Dernier

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 

Dernier (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 

BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward (Again)

  • 1. BACK TO THE DRAWING BOARD The Myth of Data-Driven NLU and How to go Forward (Again) WALID S. SABA, PhD Principal AI Scientist Astound.ai
  • 2. BACK TO THE DRAWING BOARD The Myth of Data-Driven NLU and How to go Forward (Again) WALID S. SABA, PhD Principal AI Scientist Neva.ai SRI International November 2, 2017
  • 3. Every thing in nature, in the inanimate as well as in the animate world, happens according to some rules, though we do not always know them … I reject the contention that an important theoretical difference exists between formal and natural languages … One can assume a theory of the world that is isomorphic to the way we talk about it … in this case, semantics becomes very nearly trivial
  • 4.
  • 5. about the resurgence of and the currently dominant paradigm in ‘AI’ …
  • 6. the availability of huge amounts data, coupled with advances in computer hardware and distributed computing resulted in some advances in certain types of (data-centric) problems (image, speech, fraud detection, text categorization, etc.)
  • 7. But … many problems in AI require understanding that is beyond discovering patterns in data
  • 8. Identifying an adult female in an image might be a data-centric problem
  • 9. However, picking up the photo with the teacher and the photo with the mother requires information that is not in the data Identifying an adult female in an image might be a data-centric problem
  • 11. where’s the musical band? Musician? a person who plays a musical instrument?
  • 12. So what is at issue here?
  • 13. So what is at issue here? The issue here is that there are no musicians, teachers, lawyers, or even mothers! What exists, ontologically (metaphysically), are humans, and the above concepts are logical concepts that are, sometimes, true of a certain human
  • 14. So what is at issue here? The issue here is that there are no musicians, teachers, lawyers, or even mothers! What exists, ontologically (metaphysically), are humans, and the above concepts are logical concepts that are, sometimes, true of a certain human Data-driven (quantitative) approaches can only reason with (detect, infer, recognize) objects that are of an ontological type, but not logical concepts, that form the majority of objects of (human) thought
  • 15. human lawyer dancer ONTOLOGIACL CONCEPTS LOGIACL CONCEPTS teacher mother ... Can a ‘lawyer’ or a ‘dancer’ be identified by data-analysis only?
  • 16. failure to distinguish between logical and ontological concepts is not only a flaw in data-driven approaches logical/formal semantics also failed to provide adequate models for natural language and for exactly the same reason
  • 17. but data-driven approaches are inadequate for natural language understanding for other reasons besides their inability to distinguish between logical and ontological concepts
  • 18. (1) The trophy did not fit in the brown suitcase because it was too a. big b. small (2) Dr. Spok told Jon that he should soon be done a. writing his thesis b. reading his thesis No Statistical Significance in the Data
  • 19. (1) The trophy did not fit in the brown suitcase because it was too a. big b. small (2) Dr. Spok told Jon that he should soon be done a. writing his thesis b. reading his thesis No Statistical Significance in the Data In a data-driven approach it is not clear how such decisions can be made since the only difference between the sentence-pairs in (1) and (2) are words that co-occur with equal probabilities
  • 20. (1) The trophy did not fit in the brown suitcase because it was too a. big b. small (2) Dr. Spok told Jon that he should soon be done a. writing his thesis b. reading his thesis No Statistical Significance in the Data How many labeled examples would be required to learn how to resolve such references?
  • 21. (1) Don’t worry, Simon is [as solid as] a rock. (2) The [person sitting at the] corner table wants another beer. (3) Carlos likes to play [the game] bridge. (4) Mary enjoyed [watching] the movie. (5) Carl owns a [different] house on every street in the village. It’s not even in the data challenges in the computational comprehension of ordinary text is often due to quite a bit of missing text – text which is not explicitly stated but is often assumed as shared knowledge among a community of language users
  • 22. Innate Ontological Structure? The readings in (1a) and (2a) are clearly preferred over the readings in (1b) and (2b), although there are no structural/syntactic rules that speakers of ordinary language seem to be following. What makes the AORs phenomenon even more intriguing is the fact that these preferences are also consistently made across multiple languages.
  • 23. Our grade school teachers once told us that 256 = 16 Reasoning with data only in NL leads to absurdities Our grade school teachers once told us that 256 = 16 But is this (always) correct?
  • 24. ? Reasoning with data only in NL leads to absurdities
  • 25. ? Reasoning with data only in NL leads to absurdities If 16 always equals (and is replaceable by) the square root of 256 then we can alter reality and restate the above as
  • 26. If logical/formal semantics failed, and if data-driven approaches are inadequate, then what?
  • 27. logical semantics failed us (thus far) because they did not differentiate between logical and ontological concepts as such, logical semantics were mere symbol manipulation systems adequate for mathematics, but not for natural language
  • 28. what happens when we do not distinguish between ontological and logical concepts?
  • 29. what happens when we do not distinguish between ontological and logical concepts? CARL GUSTAV HEMPEL
  • 30. Consider the following That is, we have the hypothesis H1 that ‘All ravens are black’. This hypothesis, however, is logically equivalent to the hypothesis H2 that ‘All non-black things are not ravens’.
  • 31. Consider the following That is, we have the hypothesis H1 that ‘All ravens are black’. This hypothesis, however, is logically equivalent to the hypothesis H2 that ‘All non-black things are not ravens’. In FOPL, we have:
  • 32. Observing non-black and non-raven objects confirms hypothesis H2 (that ‘All non-black things are not ravens’). Observing black ravens confirms hypothesis H1, namely that ‘All ravens are black’. H1 H2
  • 33. and here’s the paradox H1 is equivalent to H2, and so observing a red apple now confirms the hypothesis that ‘All ravens are black’ Observing non-black and non-raven objects confirms hypothesis H2 (that ‘All non-black things are not ravens’). Observing black ravens confirms hypothesis H1, namely that ‘All ravens are black’. H1 H2
  • 34. and here’s the solution First, let us make our logic strongly-typed: • in addition to a scope, a variable also has a type. • the type associated with a variable is the most general type for which the predicate makes sense
  • 35. and here’s the solution First, let us make our logic strongly-typed: • in addition to a scope, a variable also has a type. • the type associated with a variable is the most general type for which the predicate makes sense
  • 36. and here’s the solution First, let us make our logic strongly-typed: • in addition to a scope, a variable also has a type. • the type associated with a variable is the most general type for which the predicate makes sense The predicate IMMINENT is applicable to objects of type event (or, it makes sense to say IMMINENT x when x is an event or any of its subtypes)
  • 37. 1. (8x)(raven(x) ¾ black(x)) the traditional representation in FOPL All ravens are black )
  • 38. 1. (8x)(raven(x) ¾ black(x)) 2. (8x)(raven(x :: raven) ¾ black(x :: physical)) types are associated with variables in the context of the predicates in which they are used All ravens are black )
  • 39. 1. (8x)(raven(x) ¾ black(x)) 2. (8x)(raven(x :: raven) ¾ black(x :: physical)) 3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven))) x is associated with more than one type in the same scope, requiring a type unification All ravens are black )
  • 40. 1. (8x)(raven(x) ¾ black(x)) 2. (8x)(raven(x :: raven) ¾ black(x :: physical)) 3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven))) 4. (8x)(raven(x :: raven) ¾ black(x :: raven)) since raven v physical, the type unification here is simple, resulting in raven All ravens are black )
  • 41. 1. (8x)(raven(x) ¾ black(x)) 2. (8x)(raven(x :: raven) ¾ black(x :: physical)) 3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven))) 4. (8x)(raven(x :: raven) ¾ black(x :: raven)) 5. (8x :: raven)(true ¾ black(x)) (i) since x is now associated with one type throughout its scope, the type can be moved to where x was introduced; and (ii) the predicate raven(x :: raven) is replaced by true All ravens are black )
  • 42. 1. (8x)(raven(x) ¾ black(x)) 2. (8x)(raven(x :: raven) ¾ black(x :: physical)) 3. (8x)(raven(x :: raven) ¾ black(x :: (physical ² raven))) 4. (8x)(raven(x :: raven) ¾ black(x :: raven)) 5. (8x :: raven)(true ¾ black(x)) 6. (8x :: raven)(black(x)) All ravens are black ) All ravens are black ) (8x :: raven)(black(x)) ontological concept logical concept
  • 43. 1. (8x)(:black(x) ¾ :raven(x)) the traditional representation in FOPL All non-black things are not ravens )
  • 44. 1. (8x)(:black(x) ¾ :raven(x)) 2. (8x)(:black(x :: physical) ¾ :raven(x :: raven)) types are associated with variables in the context of the predicates in which they are used All non-black things are not ravens )
  • 45. 1. (8x)(:black(x) ¾ :raven(x)) 2. (8x)(:black(x :: physical) ¾ :raven(x :: raven)) 3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven)) x is associated with more than one type in the same scope, requiring a type unification All non-black things are not ravens )
  • 46. 1. (8x)(:black(x) ¾ :raven(x)) 2. (8x)(:black(x :: physical) ¾ :raven(x :: raven)) 3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven)) 4. (8x)(:black(x :: raven) ¾ :raven(x :: raven)) since raven v physical, the type unification here is simple, resulting in raven All non-black things are not ravens )
  • 47. 1. (8x)(:black(x) ¾ :raven(x)) 2. (8x)(:black(x :: physical) ¾ :raven(x :: raven)) 3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven)) 4. (8x)(:black(x :: raven) ¾ :raven(x :: raven)) 5. (8x :: raven)(:black(x) ¾ :true) All non-black things are not ravens ) (i) since x is now associated with one type throughout its scope, the type can be moved to where x was introduced; and (ii) the predicate raven(x :: raven) is replaced by true
  • 48. 1. (8x)(:black(x) ¾ :raven(x)) 2. (8x)(:black(x :: physical) ¾ :raven(x :: raven)) 3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven)) 4. (8x)(:black(x :: raven) ¾ :raven(x :: raven)) 5. (8x :: raven)(:black(x) ¾ :true) 6. (8x :: raven)(black(x)) All non-black things are not ravens ) simplifying we get (6) – we used the facts: (P ¾ Q) = (:P _ Q) :true = false (P _ false) = P
  • 49. 1. (8x)(:black(x) ¾ :raven(x)) 2. (8x)(:black(x :: physical) ¾ :raven(x :: raven)) 3. (8x)(:black(x :: (physical ² raven)) ¾ :raven(x :: raven)) 4. (8x)(:black(x :: raven) ¾ :raven(x :: raven)) 5. (8x :: raven)(:black(x) ¾ :true) 6. (8x :: raven)(black(x)) All non-black things are not ravens ) All non-black things are not ravens ) (8x :: raven)(black(x)) ontological concept logical concept
  • 50. All non-black things are not ravens ) (8x :: raven)(black(x)) what paradox of the ravens? All ravens are black ) (8x :: raven)(black(x)) It seems that making a clear distinction between logical and ontological concepts shows that the problem was in the representation and there is no so-called ‘paradox of the ravens’
  • 52. let‘s put the typed ontologic to use
  • 53. let‘s put the typed ontologic to use Type unification is required
  • 54. let‘s put the typed ontologic to use Type unification is required Final representation (we now know Sheba must be a human)
  • 55. let‘s put the typed ontologic to use Two type unifications are required Final representation
  • 56. let‘s put the typed ontologic to use How is it that ‘beautiful’ could be meant to describe Sara or her dancing?
  • 57. let‘s put the typed ontologic to use Two type unifications are required
  • 58. let‘s put the typed ontologic to use Final interpretation Since ‘beautiful’ can be said of activities as well as humans, the sentence remains ambiguous. But how about ‘Sara is a tall dancer’ ?
  • 59. let‘s put the typed ontologic to use Two type unifications are required
  • 60. let‘s put the typed ontologic to use Since we finally get Unlike the case of ‘Sara is a beautiful dancer’, in this case type unification rejected the possibility of a ‘tall dancing’
  • 61. Adjective-ordering restrictions John owns a beautiful red car John owns a red beautiful car
  • 62. Adjective-ordering restrictions John owns a beautiful red car John owns a red beautiful car Downcasting is undecidable so (a) is acceptable but (b) is not
  • 63. Adjective-ordering restrictions John owns a beautiful red car John owns a red beautiful car The second order requires us to go up, and then down again!
  • 64. Adjective-ordering restrictions John owns a red and beautiful car ) (John owns a red car) and (John owns a beautiful car) We can, though we rarely do, say two separate statements!
  • 65. Lexical disambiguation Type unifications relevant to ‘party’ Type unification will remove the ‘political group’ meaning of ‘party’
  • 67. Discovering ‘missing text’ - metonymy x, introduced as a HamSandwich is also considered to be an object of type Human in the context of a want relation
  • 68. Discovering ‘missing text’ - metonymy The unification of two types that are not on the same path is some salient relationship between the two types
  • 69. thank you Data-driven approaches to NLU are inadequate for several reasons, most important of which is the fact that understanding ordinary spoken language involves discovering the missing text that is not even explicitly stated in the data Logical/formal semantics treated logical and ontological concepts the same, as a consequence, they were for the most part symbol manipulation systems devoid of any ontological content Typed logical semantics (embedded with ontological types) seem to suggest very simple solutions to many challenging problems in language understanding