Lesson 26

Avijit Kumar
Avijit KumarHTML 5 Programmer à Mahiti Infotech Pvt. Ltd.
Module
                10
       Reasoning with
          Uncertainty -
Probabilistic reasoning

              Version 2 CSE IIT,Kharagpur
10.1 Instructional Objective
•   The students should understand the role of uncertainty in knowledge representation
•   Students should learn the use of probability theory to represent uncertainty
•   Students should understand the basic of probability theory, including
       o Probability distributions
       o Joint probability
       o Marginal probability
       o Conditional probability
       o Independence
       o Conditional independence
•   Should learn inference mechanisms in probability theory including
       o Bayes rule
       o Product rule
•   Should be able to convert natural language statements into probabilistic statements
    and apply inference rules
•   Students should understand Bayesian networks as a data structure to represent
    conditional independence
•   Should understand the syntax and semantics of Bayes net
•   Should understand inferencing mechanisms in Bayes net
•   Should understand efficient inferencing techniques like variable ordering
•   Should understand the concept of d-separation
•   Should understand inference mechanism for the special case of polytrees
•   Students should have idea about approximate inference techniques in Bayesian
    networks

At the end of this lesson the student should be able to do the following:
    • Represent a problem in terms of probabilistic statemenst
    • Apply Bayes rule and product rule for inferencing
    • Represent a problem using Bayes net
    • Perform probabilistic inferencing using Bayes net.




                                                             Version 2 CSE IIT,Kharagpur
Lesson
                26
      Reasoning with
Uncertain information
             Version 2 CSE IIT,Kharagpur
10. 2 Probabilistic Reasoning
Using logic to represent and reason we can represent knowledge about the world with
facts and rules, like the following ones:

bird(tweety).
fly(X) :- bird(X).

We can also use a theorem-prover to reason about the world and deduct new facts about
the world, for e.g.,
?- fly(tweety).
Yes

However, this often does not work outside of toy domains - non-tautologous certain
rules are hard to find.

A way to handle knowledge representation in real problems is to extend logic by using
certainty factors.

In other words, replace
IF condition THEN fact
with
IF condition with certainty x THEN fact with certainty f(x)

Unfortunately cannot really adapt logical inference to probabilistic inference, since the
latter is not context-free.

Replacing rules with conditional probabilities makes inferencing simpler.

Replace
smoking -> lung cancer
or
lotsofconditions, smoking -> lung cancer
with
P(lung cancer | smoking) = 0.6

Uncertainty is represented explicitly and quantitatively within probability theory, a
formalism that has been developed over centuries.

A probabilistic model describes the world in terms of a set S of possible states - the
sample space. We don’t know the true state of the world, so we (somehow) come up with
a probability distribution over S which gives the probability of any state being the true
one. The world usually described by a set of variables or attributes.

Consider the probabilistic model of a fictitious medical expert system. The ‘world’ is
described by 8 binary valued variables:

                                                              Version 2 CSE IIT,Kharagpur
Visit to Asia? A
Tuberculosis? T
Either tub. or lung cancer? E
Lung cancer? L
Smoking? S
Bronchitis? B
Dyspnoea? D
Positive X-ray? X

We have 28 = 256 possible states or configurations and so 256 probabilities to find.

10.3 Review of Probability Theory
The primitives in probabilistic reasoning are random variables. Just like primitives in
Propositional Logic are propositions. A random variable is not in fact a variable, but a
function from a sample space S to another space, often the real numbers.

For example, let the random variable Sum (representing outcome of two die throws) be
defined thus:
Sum(die1, die2) = die1 +die2

Each random variable has an associated probability distribution determined by the
underlying distribution on the sample space

Continuing our example : P(Sum = 2) = 1/36,
P(Sum = 3) = 2/36, . . . , P(Sum = 12) = 1/36

Consdier the probabilistic model of the fictitious medical expert system mentioned
before. The sample space is described by 8 binary valued variables.

Visit to Asia? A
Tuberculosis? T
Either tub. or lung cancer? E
Lung cancer? L
Smoking? S
Bronchitis? B
Dyspnoea? D
Positive X-ray? X

There are 28 = 256 events in the sample space. Each event is determined by a joint
instantiation of all of the variables.

S = {(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = f),
(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = t), . . .
(A = t, T = t,E = t,L = t, S = t,B = t,D = t,X = t)}

                                                             Version 2 CSE IIT,Kharagpur
Since S is defined in terms of joint instantations, any distribution defined on it is called a
joint distribution. ll underlying distributions will be joint distributions in this module. The
variables {A,T,E, L,S,B,D,X} are in fact random variables, which ‘project’ values.

L(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = f) = f
L(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = t) = f
L(A = t, T = t,E = t,L = t, S = t,B = t,D = t,X = t) = t

Each of the random variables {A,T,E,L,S,B,D,X} has its own distribution, determined by
the underlying joint distribution. This is known as the margin distribution. For example,
the distribution for L is denoted P(L), and this distribution is defined by the two
probabilities P(L = f) and P(L = t). For example,

P(L = f)
= P(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = f)
+ P(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = t)
+ P(A = f, T = f,E = f,L = f, S = f,B = f,D = t,X = f)
...
P(A = t, T = t,E = t,L = f, S = t,B = t,D = t,X = t)

P(L) is an example of a marginal distribution.

Here’s a joint distribution over two binary value variables A and B




We get the marginal distribution over B by simply adding up the different possible values
of A for any value of B (and put the result in the “margin”).




In general, given a joint distribution over a set of variables, we can get the marginal
distribution over a subset by simply summing out those variables not in the subset.


                                                               Version 2 CSE IIT,Kharagpur
In the medical expert system case, we can get the marginal distribution over, say, A,D by
simply summing out the other variables:




However, computing marginals is not an easy task always. For example,
P(A = t,D = f)
= P(A = t, T = f,E = f,L = f, S = f,B = f,D = f,X = f)
+ P(A = t, T = f,E = f,L = f, S = f,B = f,D = f,X = t)
+ P(A = t, T = f,E = f,L = f, S = f,B = t,D = f,X = f)
+ P(A = t, T = f,E = f,L = f, S = f,B = t,D = f,X = t)
...
P(A = t, T = t,E = t,L = t, S = t,B = t,D = f,X = t)

This has 64 summands! Each of whose value needs to be estimated from empirical data.
For the estimates to be of good quality, each of the instances that appear in the summands
should appear sufficiently large number of times in the empirical data. Often such a large
amount of data is not available.

However, computation can be simplified for certain special but common conditions. This
is the condition of independence of variables.

Two random variables A and B are independent iff

P(A,B) = P(A)P(B)

i.e. can get the joint from the marginals

This is quite a strong statement: It means for any value x of A and any value y of B

P(A = x,B = y) = P(A = x)P(B = y)

Note that the independence of two random variables is a property of a the underlying



probability distribution. We can have

Conditional probability is defined as:




It means for any value x of A and any value y of B

                                                            Version 2 CSE IIT,Kharagpur
If A and B are independent then



Conditional probabilities can represent causal relationships in both directions.

From cause to (probable) effects




From effect to (probable) cause




                                                             Version 2 CSE IIT,Kharagpur

Recommandé

8709508 par
87095088709508
8709508Fahrul Cool
3.5K vues42 diapositives
Fuzzy logic Notes AI CSE 8th Sem par
Fuzzy logic Notes AI CSE 8th SemFuzzy logic Notes AI CSE 8th Sem
Fuzzy logic Notes AI CSE 8th SemDigiGurukul
5.1K vues98 diapositives
Correspondence and Isomorphism Theorems for Intuitionistic fuzzy subgroups par
Correspondence and Isomorphism Theorems for Intuitionistic fuzzy subgroupsCorrespondence and Isomorphism Theorems for Intuitionistic fuzzy subgroups
Correspondence and Isomorphism Theorems for Intuitionistic fuzzy subgroupsjournal ijrtem
124 vues9 diapositives
Fuzzy Logic par
Fuzzy LogicFuzzy Logic
Fuzzy LogicRishikese MR
3.8K vues26 diapositives
Theory of computing par
Theory of computingTheory of computing
Theory of computingBipul Roy Bpl
3K vues15 diapositives
Theory of Computation "Chapter 1, introduction" par
Theory of Computation "Chapter 1, introduction"Theory of Computation "Chapter 1, introduction"
Theory of Computation "Chapter 1, introduction"Ra'Fat Al-Msie'deen
2.7K vues111 diapositives

Contenu connexe

Tendances

Ev4301897903 par
Ev4301897903Ev4301897903
Ev4301897903IJERA Editor
228 vues7 diapositives
A Method for Solving Balanced Intuitionistic Fuzzy Assignment Problem par
A  Method  for  Solving  Balanced  Intuitionistic  Fuzzy  Assignment  Problem A  Method  for  Solving  Balanced  Intuitionistic  Fuzzy  Assignment  Problem
A Method for Solving Balanced Intuitionistic Fuzzy Assignment Problem Navodaya Institute of Technology
710 vues7 diapositives
Classical Sets & fuzzy sets par
Classical Sets & fuzzy setsClassical Sets & fuzzy sets
Classical Sets & fuzzy setsDr.Ashvini Chaudhari Bhongade
40.1K vues35 diapositives
Fuzzy logic and application in AI par
Fuzzy logic and application in AIFuzzy logic and application in AI
Fuzzy logic and application in AIIldar Nurgaliev
9.4K vues46 diapositives
Fuzzy logic and fuzzy time series edited par
Fuzzy logic and fuzzy time series   editedFuzzy logic and fuzzy time series   edited
Fuzzy logic and fuzzy time series editedProf Dr S.M.Aqil Burney
2.2K vues84 diapositives
Fuzzy Set Theory par
Fuzzy Set TheoryFuzzy Set Theory
Fuzzy Set TheoryAMIT KUMAR
60.9K vues45 diapositives

Tendances(20)

Fuzzy Set Theory par AMIT KUMAR
Fuzzy Set TheoryFuzzy Set Theory
Fuzzy Set Theory
AMIT KUMAR60.9K vues
Handling missing data with expectation maximization algorithm par Loc Nguyen
Handling missing data with expectation maximization algorithmHandling missing data with expectation maximization algorithm
Handling missing data with expectation maximization algorithm
Loc Nguyen208 vues
Random Variables for discrete case par AliZarAwan
Random Variables for discrete caseRandom Variables for discrete case
Random Variables for discrete case
AliZarAwan137 vues
Mc0079 computer based optimization methods--phpapp02 par Rabby Bhatt
Mc0079 computer based optimization methods--phpapp02Mc0079 computer based optimization methods--phpapp02
Mc0079 computer based optimization methods--phpapp02
Rabby Bhatt2.3K vues
Master of Computer Application (MCA) – Semester 4 MC0079 par Aravind NC
Master of Computer Application (MCA) – Semester 4  MC0079Master of Computer Application (MCA) – Semester 4  MC0079
Master of Computer Application (MCA) – Semester 4 MC0079
Aravind NC2.2K vues
Fuzzy relations par naugariya
Fuzzy relationsFuzzy relations
Fuzzy relations
naugariya32.4K vues
Bc0052 theory of computer science par smumbahelp
Bc0052   theory of computer scienceBc0052   theory of computer science
Bc0052 theory of computer science
smumbahelp87 vues

En vedette

Lesson 22 par
Lesson 22Lesson 22
Lesson 22Avijit Kumar
233 vues9 diapositives
Lesson 28 par
Lesson 28Lesson 28
Lesson 28Avijit Kumar
159 vues10 diapositives
Lesson 24 par
Lesson 24Lesson 24
Lesson 24Avijit Kumar
169 vues7 diapositives
Lesson 25 par
Lesson 25Lesson 25
Lesson 25Avijit Kumar
224 vues15 diapositives
Lesson 21 par
Lesson 21Lesson 21
Lesson 21Avijit Kumar
255 vues8 diapositives
Lesson 20 par
Lesson 20Lesson 20
Lesson 20Avijit Kumar
159 vues7 diapositives

Similaire à Lesson 26

Fuzzy logic andits Applications par
Fuzzy logic andits ApplicationsFuzzy logic andits Applications
Fuzzy logic andits ApplicationsDrATAMILARASIMCA
215 vues93 diapositives
Probability cheatsheet par
Probability cheatsheetProbability cheatsheet
Probability cheatsheetJoachim Gwoke
148 vues8 diapositives
Statistics (1): estimation, Chapter 1: Models par
Statistics (1): estimation, Chapter 1: ModelsStatistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: ModelsChristian Robert
2.5K vues44 diapositives
A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR... par
A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR...A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR...
A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR...ijscmcj
9 vues15 diapositives
Boolean Algebra.pdf par
Boolean  Algebra.pdfBoolean  Algebra.pdf
Boolean Algebra.pdfSTUDY INNOVATIONS
8 vues11 diapositives
Numerical_PDE_Paper par
Numerical_PDE_PaperNumerical_PDE_Paper
Numerical_PDE_PaperWilliam Ruys
1K vues26 diapositives

Similaire à Lesson 26(20)

Statistics (1): estimation, Chapter 1: Models par Christian Robert
Statistics (1): estimation, Chapter 1: ModelsStatistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: Models
Christian Robert2.5K vues
A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR... par ijscmcj
A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR...A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR...
A PROBABILISTIC ALGORITHM OF COMPUTING THE POLYNOMIAL GREATEST COMMON DIVISOR...
ijscmcj9 vues
Extreme bound analysis based on correlation coefficient for optimal regressio... par Loc Nguyen
Extreme bound analysis based on correlation coefficient for optimal regressio...Extreme bound analysis based on correlation coefficient for optimal regressio...
Extreme bound analysis based on correlation coefficient for optimal regressio...
Loc Nguyen3 vues
Use of the correlation coefficient as a measure of effectiveness of a scoring... par Wajih Alaiyan
Use of the correlation coefficient as a measure of effectiveness of a scoring...Use of the correlation coefficient as a measure of effectiveness of a scoring...
Use of the correlation coefficient as a measure of effectiveness of a scoring...
Wajih Alaiyan222 vues
project par Vishnu V
projectproject
project
Vishnu V4.7K vues
The Fundamental theorem of calculus par AhsanIrshad8
The Fundamental theorem of calculus The Fundamental theorem of calculus
The Fundamental theorem of calculus
AhsanIrshad8146 vues

Plus de Avijit Kumar

Lesson 27 par
Lesson 27Lesson 27
Lesson 27Avijit Kumar
132 vues5 diapositives
Lesson 29 par
Lesson 29Lesson 29
Lesson 29Avijit Kumar
226 vues15 diapositives
Lesson 30 par
Lesson 30Lesson 30
Lesson 30Avijit Kumar
208 vues5 diapositives
Lesson 31 par
Lesson 31Lesson 31
Lesson 31Avijit Kumar
146 vues6 diapositives
Lesson 32 par
Lesson 32Lesson 32
Lesson 32Avijit Kumar
300 vues11 diapositives
Lesson 33 par
Lesson 33Lesson 33
Lesson 33Avijit Kumar
177 vues9 diapositives

Dernier

CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T par
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TCloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TShapeBlue
81 vues34 diapositives
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue par
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueShapeBlue
68 vues13 diapositives
The Power of Heat Decarbonisation Plans in the Built Environment par
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built EnvironmentIES VE
67 vues20 diapositives
Why and How CloudStack at weSystems - Stephan Bienek - weSystems par
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsShapeBlue
172 vues13 diapositives
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... par
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...James Anderson
142 vues32 diapositives
Igniting Next Level Productivity with AI-Infused Data Integration Workflows par
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
373 vues86 diapositives

Dernier(20)

CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T par ShapeBlue
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TCloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
ShapeBlue81 vues
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue par ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
ShapeBlue68 vues
The Power of Heat Decarbonisation Plans in the Built Environment par IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE67 vues
Why and How CloudStack at weSystems - Stephan Bienek - weSystems par ShapeBlue
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystems
ShapeBlue172 vues
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... par James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson142 vues
Igniting Next Level Productivity with AI-Infused Data Integration Workflows par Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software373 vues
Digital Personal Data Protection (DPDP) Practical Approach For CISOs par Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash103 vues
State of the Union - Rohit Yadav - Apache CloudStack par ShapeBlue
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStack
ShapeBlue218 vues
"Surviving highload with Node.js", Andrii Shumada par Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays49 vues
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... par ShapeBlue
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
ShapeBlue114 vues
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... par ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue52 vues
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... par ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue69 vues
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue par ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue75 vues
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue par ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue149 vues
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... par ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue105 vues
Data Integrity for Banking and Financial Services par Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely76 vues

Lesson 26

  • 1. Module 10 Reasoning with Uncertainty - Probabilistic reasoning Version 2 CSE IIT,Kharagpur
  • 2. 10.1 Instructional Objective • The students should understand the role of uncertainty in knowledge representation • Students should learn the use of probability theory to represent uncertainty • Students should understand the basic of probability theory, including o Probability distributions o Joint probability o Marginal probability o Conditional probability o Independence o Conditional independence • Should learn inference mechanisms in probability theory including o Bayes rule o Product rule • Should be able to convert natural language statements into probabilistic statements and apply inference rules • Students should understand Bayesian networks as a data structure to represent conditional independence • Should understand the syntax and semantics of Bayes net • Should understand inferencing mechanisms in Bayes net • Should understand efficient inferencing techniques like variable ordering • Should understand the concept of d-separation • Should understand inference mechanism for the special case of polytrees • Students should have idea about approximate inference techniques in Bayesian networks At the end of this lesson the student should be able to do the following: • Represent a problem in terms of probabilistic statemenst • Apply Bayes rule and product rule for inferencing • Represent a problem using Bayes net • Perform probabilistic inferencing using Bayes net. Version 2 CSE IIT,Kharagpur
  • 3. Lesson 26 Reasoning with Uncertain information Version 2 CSE IIT,Kharagpur
  • 4. 10. 2 Probabilistic Reasoning Using logic to represent and reason we can represent knowledge about the world with facts and rules, like the following ones: bird(tweety). fly(X) :- bird(X). We can also use a theorem-prover to reason about the world and deduct new facts about the world, for e.g., ?- fly(tweety). Yes However, this often does not work outside of toy domains - non-tautologous certain rules are hard to find. A way to handle knowledge representation in real problems is to extend logic by using certainty factors. In other words, replace IF condition THEN fact with IF condition with certainty x THEN fact with certainty f(x) Unfortunately cannot really adapt logical inference to probabilistic inference, since the latter is not context-free. Replacing rules with conditional probabilities makes inferencing simpler. Replace smoking -> lung cancer or lotsofconditions, smoking -> lung cancer with P(lung cancer | smoking) = 0.6 Uncertainty is represented explicitly and quantitatively within probability theory, a formalism that has been developed over centuries. A probabilistic model describes the world in terms of a set S of possible states - the sample space. We don’t know the true state of the world, so we (somehow) come up with a probability distribution over S which gives the probability of any state being the true one. The world usually described by a set of variables or attributes. Consider the probabilistic model of a fictitious medical expert system. The ‘world’ is described by 8 binary valued variables: Version 2 CSE IIT,Kharagpur
  • 5. Visit to Asia? A Tuberculosis? T Either tub. or lung cancer? E Lung cancer? L Smoking? S Bronchitis? B Dyspnoea? D Positive X-ray? X We have 28 = 256 possible states or configurations and so 256 probabilities to find. 10.3 Review of Probability Theory The primitives in probabilistic reasoning are random variables. Just like primitives in Propositional Logic are propositions. A random variable is not in fact a variable, but a function from a sample space S to another space, often the real numbers. For example, let the random variable Sum (representing outcome of two die throws) be defined thus: Sum(die1, die2) = die1 +die2 Each random variable has an associated probability distribution determined by the underlying distribution on the sample space Continuing our example : P(Sum = 2) = 1/36, P(Sum = 3) = 2/36, . . . , P(Sum = 12) = 1/36 Consdier the probabilistic model of the fictitious medical expert system mentioned before. The sample space is described by 8 binary valued variables. Visit to Asia? A Tuberculosis? T Either tub. or lung cancer? E Lung cancer? L Smoking? S Bronchitis? B Dyspnoea? D Positive X-ray? X There are 28 = 256 events in the sample space. Each event is determined by a joint instantiation of all of the variables. S = {(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = f), (A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = t), . . . (A = t, T = t,E = t,L = t, S = t,B = t,D = t,X = t)} Version 2 CSE IIT,Kharagpur
  • 6. Since S is defined in terms of joint instantations, any distribution defined on it is called a joint distribution. ll underlying distributions will be joint distributions in this module. The variables {A,T,E, L,S,B,D,X} are in fact random variables, which ‘project’ values. L(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = f) = f L(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = t) = f L(A = t, T = t,E = t,L = t, S = t,B = t,D = t,X = t) = t Each of the random variables {A,T,E,L,S,B,D,X} has its own distribution, determined by the underlying joint distribution. This is known as the margin distribution. For example, the distribution for L is denoted P(L), and this distribution is defined by the two probabilities P(L = f) and P(L = t). For example, P(L = f) = P(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = f) + P(A = f, T = f,E = f,L = f, S = f,B = f,D = f,X = t) + P(A = f, T = f,E = f,L = f, S = f,B = f,D = t,X = f) ... P(A = t, T = t,E = t,L = f, S = t,B = t,D = t,X = t) P(L) is an example of a marginal distribution. Here’s a joint distribution over two binary value variables A and B We get the marginal distribution over B by simply adding up the different possible values of A for any value of B (and put the result in the “margin”). In general, given a joint distribution over a set of variables, we can get the marginal distribution over a subset by simply summing out those variables not in the subset. Version 2 CSE IIT,Kharagpur
  • 7. In the medical expert system case, we can get the marginal distribution over, say, A,D by simply summing out the other variables: However, computing marginals is not an easy task always. For example, P(A = t,D = f) = P(A = t, T = f,E = f,L = f, S = f,B = f,D = f,X = f) + P(A = t, T = f,E = f,L = f, S = f,B = f,D = f,X = t) + P(A = t, T = f,E = f,L = f, S = f,B = t,D = f,X = f) + P(A = t, T = f,E = f,L = f, S = f,B = t,D = f,X = t) ... P(A = t, T = t,E = t,L = t, S = t,B = t,D = f,X = t) This has 64 summands! Each of whose value needs to be estimated from empirical data. For the estimates to be of good quality, each of the instances that appear in the summands should appear sufficiently large number of times in the empirical data. Often such a large amount of data is not available. However, computation can be simplified for certain special but common conditions. This is the condition of independence of variables. Two random variables A and B are independent iff P(A,B) = P(A)P(B) i.e. can get the joint from the marginals This is quite a strong statement: It means for any value x of A and any value y of B P(A = x,B = y) = P(A = x)P(B = y) Note that the independence of two random variables is a property of a the underlying probability distribution. We can have Conditional probability is defined as: It means for any value x of A and any value y of B Version 2 CSE IIT,Kharagpur
  • 8. If A and B are independent then Conditional probabilities can represent causal relationships in both directions. From cause to (probable) effects From effect to (probable) cause Version 2 CSE IIT,Kharagpur