lec23_bayes

Next: About this document ...

Belief Networks ILecture 23
(Chapter 15.1-2)
Artificial Intelligence I
Autumn 2001
Henry Kautz

Outline

Conditional independence

Bayesian networks: syntax and semantics

Exact inference

Approximate inference

Independence

Two random variables 1#1 2#2 are (absolutely) independent iff 3#3 or 4#4
e.g., 1#1 and 2#2 are two coin tosses

If 5#5 Boolean variables are independent, the full joint is 6#6
hence can be specified by just 5#5 numbers

Absolute independence is a very strong requirement, seldom met

Conditional independence

Consider the dentist problem with three random variables: 7#7, 8#8, 9#9 (steel probe catches in my tooth)

The full joint distribution has 10#10 = 7 independent entries

If I have a cavity, the probability that the probe catches in it doesn't depend on whether I have a toothache: (1) 11#11
i.e., 9#9 is conditionally independent of 7#7 given 8#8

The same independence holds if I haven't got a cavity: (2) 12#12

Conditional independence contd.

Product rule:

13#13

Independence:

14#14

Full joint distribution can now requires only 5 independent numbers (instead of 7)

Belief networks

A simple, graphical notation for conditional independence assertions
and hence for compact specification of full joint distributions

Syntax: a set of nodes, one per variable a directed, acyclic graph (link 15#15 ``directly influences'') a conditional distribution for each node given its parents: 16#16

In the simplest case, conditional distribution represented as
a conditional probability table (CPT)

Example

I'm at work, neighbor John calls to say my alarm is ringing, but neighbor Mary doesn't call. Sometimes it's set off by minor earthquakes. Is there a burglar?

Variables: 17#17, 18#18, 19#19, 20#20, 21#21
Network topology reflects ``causal'' knowledge:

=0.5figuresburglary2.ps

Note: 22#22 parents 23#23 numbers vs. 24#24

Semantics

``Global'' semantics defines the full joint distribution as
the product of the local conditional distributions:

25#25

e.g., 26#26 is given by =

Semantics

``Global'' semantics defines the full joint distribution as
the product of the local conditional distributions:

25#25

e.g., 26#26 is given by = 27#27

``Local'' semantics: each node is conditionally independent
of its nondescendants given its parents

Theorem: Local semantics 28#28 global semantics

Markov blanket

Each node is conditionally independent of all others given its
Markov blanket: parents + children + children's parents

=0.5figuresmarkov-blanket.ps

Constructing belief networks

Need a method such that a series of locally testable assertions of
conditional independence guarantees the required global semantics

1. Choose an ordering of variables 29#29
2. For 30#30 = 1 to 5#5 add 31#31 to the network select parents from 32#32 such that 33#33

This choice of parents guarantees the global semantics: 34#34 (chain rule) = 35#35 by construction

Example