Skip to main content

Markov Random Fields

Markov Blanket (MB): the set of nodes that makes XiX_i conditionally independent of the other nodes.

Undirected graphical models are called Markov Random Fields (MRFs) which are the models with dependencies described by an undirected graph

  • the edges of undirected model represent probabilistic interactions between neighbors
  • a clique is a a subset of nodes such that every two vertices in the subset are connected by an edge.
  • a maximal clique is a clique that cannot be extended by adding a new vertex.

Let's make generalization. Let X=(X1,X2,,Xm)X = (X_1, X_2, \dots, X_m) be a random vector in our graph GG and let C\mathcal{C} be the set of all maximal cliques of GG. Then we have the distribution pp of XX factorizes with respect to GG if p(x)CCψC(xC)p(x) \propto \prod_{C \in \mathcal{C}} \psi_C(x_C) where ψC\psi_C is a nonnegative potential function where xC=(xi)iCx_C = (x_i)_{i\in C}.

  • The MRF on GG represents the distributions that factorize w.r.t. GG.

Hammersley-Clifford Theorem

If x,p(x)>0\forall x, p(x) > 0, the following are equivalent:

  • p(x)CCψC(xC)p(x) \propto \prod_{C \in \mathcal{C}} \psi_C(x_C) where ψC\psi_C is a nonnegative potential function
  • Global Markov Properties: XAXBXSX_A\perp X_B | X_S if sets AA and BB are disjoint(separated) by SS (i.e. each path from AA to BB goes through SS).

In particular,

  • iji\ne j are not conneced by an edge, then XiXjXrestX_i\perp X_j|X_{\text{rest}}

Exponetial Family

The general form we have is that p(xθ)=1Z(θ)cCψc(xCθc),θ=(θC)CCp(x|\theta) = \frac{1}{Z(\theta)}\prod_{c\in \mathcal{C}}\psi_c(x_C|\theta_c), \theta = (\theta_C)_{C\in\mathcal{C}}.

Sometimes, we can write this in an exponential form: p(xθ)=exp(CCϕc(xCθC)logZ(θ))p(x|\theta) = \exp(\sum_{C\in \mathcal{C}}\phi_c(x_C|\theta_C) - \log Z(\theta)). Suppose the potentials have a log-linear form logϕc(xCθC)=θCTϕc(xC)\log \phi_c(x_C|\theta_C) = \theta_C^T\phi_c(x_C), then we have p(xθ)=exp(CCθCTϕC(xC)logZ(θ))p(x|\theta) = \exp(\sum_{C\in \mathcal{C}}\theta_C^T\phi_C(x_C) - \log Z(\theta))