• 검색 결과가 없습니다.

Directed Graphical Models

N/A
N/A
Protected

Academic year: 2021

Share "Directed Graphical Models"

Copied!
19
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

Directed Graphical Models

Seung-Hoon Na

Chonbuk National University

(2)

Directed Graphical Models

• a GM whose graph is a DAG

• Bayesian networks

• Belief networks

• Causal networks

– But, nothing inherently “Bayesian”, “subjective”

(belief), or “causal”

(3)

Directed Graphical Models

• The ordered Markov property

– Node only depends on its immediate parents, not on all predecessors in the ordering

(4)

Directed Graphical Models

(5)

Directed Graphical Models: Examples

• Naive Bayes classifiers

(6)

Directed Graphical Models: Examples

• Markov and hidden Markov models

(7)

Directed Graphical Models: Examples

• Directed Gaussian graphical models

– Also, called Gaussian Bayes net – Use a linear gaussian CPD:

– The directed GGM means the joint distribution:

– We can derive 𝝁, 𝚺 from the CPD parameters

(8)

Inference

• Computing the posterior distribution of the unknowns given the knowns:

• Marginalizing out the nuisance variables, when only some of the hidden variables are of

interest to us:

(9)

Learning

• Computing a MAP estimate of the parameters given data:

• With a Bayesian view, the parameters are

unknown variables and should also be inferred.

 No distinction between inference and earning

(10)

Learning from complete data

• If all the variables are fully observed in each case, so there is no missing data and there are no hidden variables, we say the data is complete.

• Assume that the prior factorizes:

• Then, the posterior also factorizes

(11)

Learning from complete data: Example

• All CPDs are tabular, formally given:

• Prior:

• Posterior:

the sufficient statistics

(12)

Learning from complete data: Example

(13)

Learning from complete data: Example

• For the t=4 node under a Dirichlet prior with

(14)

Learning from missing and/or latent variables

• Bayesian inference of the parameters is even harder

• Approximate inference techniques are required

– Variational inference – MCMC

(15)

Conditional independence properties of DGMs

– if A is independent of B given C in the graph G

• 𝐼 𝐺 : the set of all such CI statements encoded by 𝐺

• 𝐼 𝑝 : the set of all CI statements that hold for 𝑝

• G is an I-map for 𝑝:

– The graph G does not make any assertions of CI that are not true of the distribution p.

(16)

Conditional independence properties of DGMs

• G is a minimal I-map of p: G is an I-map of p, and if there is no 𝐺′ ⊆ 𝐺 which is an I-map of p

• Define the CI properties of a DAG using d- separation

𝐴 is d-separated from 𝐵 given 𝐸 Directed global Markov property (G)

(17)

Conditional independence properties of DGMs

• The directed local Markov property (L)

• Ordered Markov property (O)

The non-descendants of a node t

(18)

Conditional independence properties of DGMs

• All these Markov properties (G, L, O) are equivalent

although it looks less obvious

(19)

D-separation: Example

참조

관련 문서

If a path is possible (i.e. , has positive probability), we want the hedge to work along that path. The actual value of the probability is irrelevant. There are no

When static keyword is used with the variables and the methods, it signifies that these members belongs to the class and these members are shared by all the objects of

If there are no results in each case, then your query result is identical to the answer query result, and therefore your query is correct (or at least, it is good enough that

In the heating method by line heating, the initial properties of steel are changed by variables such as temperature, time, and speed. The experimental data

Purpose: Calcaneal fracture is a rare fracture, which accounts for about 2% of all fractures, but is one of the most common fractures in the ankle bone.. There is

There are various legal grounds about finance bills and the budgetary system. For example, there are some nations of which the authority of the legislature is simply

There are several factors that make proper English article use difficult for Korean speakers.. First, there is no article system

Note that the number of adjacent cells is the same as the number of input variables since it is equal to the number of bits... If we don’t use DC terms, the logic function f