Bayesian/Graphoid intersection property for factorisation spaces

Gr\'egoire Sergeant-Perthuis

arXiv:1903.06026·math.ST·May 25, 2021

Bayesian/Graphoid intersection property for factorisation spaces

Gr\'egoire Sergeant-Perthuis

PDF

Open Access

TL;DR

This paper generalizes the intersection property in Bayesian networks to factorisation spaces, providing new proofs and extending classical theorems like Hammersley-Clifford to broader settings, including non-finite graphs.

Contribution

It introduces a general intersection property for factorisation spaces, offers a novel proof of existing results, and extends the Hammersley-Clifford theorem to non-finite graphs.

Findings

01

Generalized intersection property for factorisation spaces

02

New proof of the Hammersley-Clifford theorem

03

Extension of decomposition into interaction subspaces to vector spaces

Abstract

We remark that Pearl's Graphoid intersection property, also called intersection property in Bayesian networks, is a particular case of a general intersection property, in the sense of intersection of coverings, for factorisation spaces, also coined as factorisation models, factor graphs or by Lauritzen in his reference book 'Graphical Models' as hierarchical model subspaces. A particular case of this intersection property appears in Lauritzen's book as a consequence of the decomposition into interaction subspaces; the novel proof that we give of this result allows us to extend it in the most general setting. It also allows us to give a direct and new proof of the Hammersley-Clifford theorem transposing and reducing it to a corresponding statement for graphs, justifying formally the geometric intuition of independency, and extending it to non finite graphs. This intersection property is…

Equations152

X\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|(Z,W)\text{ and }X\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}W|(Z,Y)\implies X\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}(Y,W)|Z

X\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|(Z,W)\text{ and }X\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}W|(Z,Y)\implies X\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}(Y,W)|Z

P_{X, Y, Z} (x, y, z) = \frac{P _{X, Z} ( x , z ) P _{Y, Z} ( y , z )}{P _{Z} ( z )}

P_{X, Y, Z} (x, y, z) = \frac{P _{X, Z} ( x , z ) P _{Y, Z} ( y , z )}{P _{Z} ( z )}

P_{X, Y, Z, W} \in F_{W, X, Z} F_{W, Y, Z} \cap F_{X, Y, Z} F_{W, Y, Z} ⟹ P_{X, Y, Z, W} \in F_{X, Z} F_{W, Y, Z}

P_{X, Y, Z, W} \in F_{W, X, Z} F_{W, Y, Z} \cap F_{X, Y, Z} F_{W, Y, Z} ⟹ P_{X, Y, Z, W} \in F_{X, Z} F_{W, Y, Z}

F_{A} = {f \in R_{> 0}^{E_{I}} : \exists (f_{a} \in R_{> 0}^{E_{a}}, a \in A), \forall x \in E_{I} f = a \in A \prod f_{a} (x_{a})}

F_{A} = {f \in R_{> 0}^{E_{I}} : \exists (f_{a} \in R_{> 0}^{E_{a}}, a \in A), \forall x \in E_{I} f = a \in A \prod f_{a} (x_{a})}

\hat{A} = {a \in P (I) : \exists b \in A, a \leq b}

\hat{A} = {a \in P (I) : \exists b \in A, a \leq b}

f (x) = k \in [1, n] \prod f_{k} (x_{a_{k}})

f (x) = k \in [1, n] \prod f_{k} (x_{a_{k}})

F_{A} = F_{\hat{A}}

F_{A} = F_{\hat{A}}

j \in J ⋂ F_{\hat{A}_{j}} = F_{j \in J ⋂ \hat{A}_{j}} .

j \in J ⋂ F_{\hat{A}_{j}} = F_{j \in J ⋂ \hat{A}_{j}} .

ln P = a \in A \sum f_{a}

ln P = a \in A \sum f_{a}

P (x) = \frac{e ^{\sum_{a \in A} ϕ_{a} (x_{a})}}{\sum _{y \in E_{I}} e ^{\sum_{a \in A} ϕ_{a} (y_{a})}}

P (x) = \frac{e ^{\sum_{a \in A} ϕ_{a} (x_{a})}}{\sum _{y \in E_{I}} e ^{\sum_{a \in A} ϕ_{a} (y_{a})}}

X_{v}\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X_{u}|X_{k}

X_{v}\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X_{u}|X_{k}

X_{i}\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X_{j}|X_{I\setminus\{i,j\}}.

X_{i}\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X_{j}|X_{I\setminus\{i,j\}}.

X_{i}\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X_{I\setminus(i\cup\partial i)}|X_{\partial i}

X_{i}\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X_{I\setminus(i\cup\partial i)}|X_{\partial i}

⟨ f, g ⟩ = x \in E_{I} \sum f (x) g (x)

⟨ f, g ⟩ = x \in E_{I} \sum f (x) g (x)

U (A) = ln F_{A}

U (A) = ln F_{A}

U ({a}) = b \subseteq a ⨁ S_{b}

U ({a}) = b \subseteq a ⨁ S_{b}

P_{X} \in P (G) ⟺ P_{X} \in L (G) ⟺ P_{X} \in F_{C} .

P_{X} \in P (G) ⟺ P_{X} \in L (G) ⟺ P_{X} \in F_{C} .

A_{P} = (i, j) : i \in / \partial j ⋂ [i, j]

A_{P} = (i, j) : i \in / \partial j ⋂ [i, j]

\hat{A}_{L} = \hat{A}_{P} = C

\hat{A}_{L} = \hat{A}_{P} = C

A R B ⟺ \forall a \in A, \exists b \in B, a \subseteq b

A R B ⟺ \forall a \in A, \exists b \in B, a \subseteq b

A ⊓ B = {a \cap b ∣ a \in A, b \in B}

A ⊓ B = {a \cap b ∣ a \in A, b \in B}

A ⊓ B = B ⊓ A, (A \cup B) ⊓ C = (A ⊓ C) \cup (B ⊓ C), A ⊓ B \leq A .

A ⊓ B = B ⊓ A, (A \cup B) ⊓ C = (A ⊓ C) \cup (B ⊓ C), A ⊓ B \leq A .

[A \leq C \land B \leq D] ⟹ A \cup B \leq C \cup D .

[A \leq C \land B \leq D] ⟹ A \cup B \leq C \cup D .

[A \leq C \land B \leq D] ⟹ A ⊓ B \leq C ⊓ D .

[A \leq C \land B \leq D] ⟹ A ⊓ B \leq C ⊓ D .

\forall a \in A, \exists b \in B, a \subseteq b \forall b \in B, \exists c \in C, b \subseteq c .

\forall a \in A, \exists b \in B, a \subseteq b \forall b \in B, \exists c \in C, b \subseteq c .

\exists a \in A, \exists b \in B, x = a \cap b ⟺ \exists a \in B, \exists b \in A, x = a \cap b

\exists a \in A, \exists b \in B, x = a \cap b ⟺ \exists a \in B, \exists b \in A, x = a \cap b

(A \cup B) ⊓ C = (a, c) \in (A \cup B) \times C ⋃ {a \cap c} = (a, c) \in A \times C or (a, c) \in B \times C ⋃ {a \cap c} = (a, c) \in A \times C ⋃ {a \cap c} \cup (b, c) \in B \times C ⋃ {b \cap c} .

(A \cup B) ⊓ C = (a, c) \in (A \cup B) \times C ⋃ {a \cap c} = (a, c) \in A \times C or (a, c) \in B \times C ⋃ {a \cap c} = (a, c) \in A \times C ⋃ {a \cap c} \cup (b, c) \in B \times C ⋃ {b \cap c} .

A \sim B ⟺ [A \leq B] \land [B \leq A] .

A \sim B ⟺ [A \leq B] \land [B \leq A] .

\begin{array}[]{ccccc}p&:&\mathscr{P}^{2}(I)&\to&\overline{\mathscr{P}^{2}(I)}\\ &&A&\mapsto&[A]\\ \end{array}

\begin{array}[]{ccccc}p&:&\mathscr{P}^{2}(I)&\to&\overline{\mathscr{P}^{2}(I)}\\ &&A&\mapsto&[A]\\ \end{array}

[A] \overline{\leq} [B] ⟺ A \leq B .

[A] \overline{\leq} [B] ⟺ A \leq B .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference

Full text

Bayesian/Graphoid intersection property for factorisation spaces.

Grégoire Sergeant-Perthuislabel=e1][email protected] [ IMJ-PRG, Université de Paris

Abstract

We remark that Pearl’s Graphoid intersection property, also called intersection property in Bayesian networks, is a particular case of a general intersection property, in the sense of intersection of coverings, for factorisation spaces, also coined as factorisation models, factor graphs or by Lauritzen in his reference book Graphical Models as hierarchical model subspaces. A particular case of this intersection property appears in Lauritzen’s book as a consequence of the decomposition into interaction subspaces; the novel proof that we give of this result allows us to extend it in the most general setting. It also allows us to give a direct and new proof of the Hammersley-Clifford theorem transposing and reducing it to a corresponding statement for graphs, justifying formally the geometric intuition of independency, and extending it to non finite graphs. This intersection property is the starting point for a generalization of the decomposition into interaction subspaces to collections of vector spaces [7].

62H22,

00X00,

06F25,

Hammersley-Clifford,

Graphical models,

keywords:

[class=MSC2020]

keywords:

\SetNewAudience

long

1 Introduction

1.1 Intersection property

1.1.1 Intersection property and graphoids

To describe the structure of dependencies of a set of random variables, as well said by Judea Pearl in Chapter 3 of [6], one can introduce a ternary operator corresponding to the conditional independence:

”The notion of informational relevance is given […] through the device of conditional independence, which successfully captures our intuition about how dependencies should change in response to news facts”.

For any three random variables with discrete values, we will note $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|_{P}Z$ the fact that $X$ is independent of $Y$ conditionally to $Z$ (see Section 5.2 Equation 5.2); in the previous expression $P$ will be omitted from now on, as in literature.

The intersection property in Bayesian networks, as found in [9] (Chapter 2 Proposition 2.12) or [6] (Chapter 3 Theorem 1), is the following proposition.

Proposition 5.1 (Intersection property).

Let $W,X,Y,Z$ be four random variables that take values in a finite set and for which the probability density $P_{W,X,Y,Z}$ is stricly positive, then,

[TABLE]

Semi-graphoids and graphoids were introduced to give a formal set of axioms, on $\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}$ , for conditional independence (see [3], [6]); in this context Proposition 5.1 is called the intersection axiom.

Definition 1.1 (Semi-graphoid, graphoid [6][5]).

A semi-graphoid structure on a collection $I=\coprod_{J}\{X_{j}\}$ is a ternary relation on subsets of $I$ , that we shall note as $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ , such that, for any, $X,Y,Z,W$ , disjoint subsets of $I$ ,

if $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ then $Y\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X|Z$ ; 2. 2.

if $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ and $U\subseteq X$ , then $U\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ ; 3. 3.

if $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ and $U\subseteq X$ , then $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z\cup U$ ; 4. 4.

if $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ and $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}W|Y\cup Z$ then $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}W\cup Y|Z$ .

It is a graphoid if furthermore it satisfies the intersection axiom.

1.1.2 Factorisation spaces

Let us suppose that $X,Y,Z$ take values respectively in finite spaces $\Omega_{X}$ , $\Omega_{Y}$ , $\Omega_{Z}$ . The fact that $X$ is independent of $Y$ conditionally to $Z$ can be restated as a factorisation property on $P_{X,Y,Z}$ ; for simplicity let us assume that $P_{X,Y,Z}$ is sticly positive, then $X\mathchoice{\mathrel{\hbox to0.0pt{$ \displaystyle\perp $\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$ \textstyle\perp $\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptstyle\perp $\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$ \scriptscriptstyle\perp $\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z$ if and only if for any $(x,y,z)\in\Omega_{X}\times\Omega_{Y}\times\Omega_{Z}$ ,

[TABLE]

where $P_{X,Z}$ , $P_{Y,Z}$ , $P_{Z}$ are repectively the marginal probabilities of $(X,Z)$ , $(Y,Z)$ and $Z$ .

If one notes $\mathscr{F}_{Y,Z}$ the set of strictly positive functions on $\Omega_{X}\times\Omega_{Y}\times\Omega_{Z}$ that only depend on $(Y,Z)$ and $\mathscr{F}_{X,Z}\mathscr{F}_{Y,Z}$ the set of functions that are the product of a function of $\mathscr{F}_{X,Z}$ and a function of $\mathscr{F}_{Y,Z}$ , then the intersection property can be restated as, for any strictly positive proability law $P_{X,Y,Z,W}$ ,

[TABLE]

We are interested in generalizing this result to intersections of factorisation spaces that we will define now.

Definition 1.2 (Factorisation space).

Let $I$ be a finite set, let $\mathscr{A}\subseteq\mathscr{P}(I)$ , where $\mathscr{P}(I)$ is the set of subsets of $I$ . Let $(E_{i},i\in I)$ be a collection of sets, let $E_{a}=\prod_{i\in a}E_{i}$ for any $a\in\mathscr{P}(I)$ ; for $x\in E_{I}$ , we will denote $x_{a}$ its projection onto $E_{a}$ . The factorisation space over $\mathscr{A}$ is defined as follows,

[TABLE]

*Notation 1**.*

From now on we shall note $\mathscr{P}\mathscr{P}(I)$ as $\mathscr{P}^{2}(I)$ .

One can extend the previous definition to the case where $\mathscr{A}$ is non finite. To do so let us introduce a notation; for any $\mathscr{A}\subseteq\mathscr{P}(I)$ , let,

[TABLE]

*Notation 2**.*

$\hat{\mathscr{A}}$ is called the lower set of $\mathscr{A}$ and the set of lower sets will be denoted as $\operatorname{\mathscr{U}}(\mathscr{P}(I))$ , i.e. $\operatorname{\mathscr{U}}(\mathscr{P}(I))=\{\hat{\mathscr{A}}|\mathscr{A}\subseteq\mathscr{P}(I)\}$ .

Definition 1.3 (Generalized factorisation spaces).

Let $I$ be any set and let $\mathscr{A}\subseteq\mathscr{P}(I)$ , any $f\in\mathbb{R}_{>0}^{E_{I}}$ is in $\mathscr{F}_{\mathscr{A}}$ if and only if there is $n\in\mathbb{N}$ , a collection $(a_{k}\in\hat{\mathscr{A}},k\in[1,n])$ and a collection $(f_{k}\in\mathbb{R}^{E_{a_{k}}})$ such that for any $k\in[1,n]$ , $|a_{k}|<\infty$ and for any $x\in E_{I}$ ,

[TABLE]

In particular,

[TABLE]

1.1.3 Main theorem

The result we want to emphasize in this document is that an intersection property still holds for factorisation spaces.

Theorem 4.1 (Intersection property for factorisation spaces).

Let $I$ be any set, let $(E_{i},i\in I)$ be any collection of sets. For any family $(\mathscr{A}_{j})_{j\in J}$ of elements of $\mathscr{P}^{2}(I)$ ,

[TABLE]

A particular case of the intersection property appears in Lauritzen’s Graphical Models [5] in Appendix B Proposition B.5 as a consequence of the decomposition into interaction subspaces, that we will introduce in the next subsection. The proof we give of this result holds in a more general setting and is a direct one that does not rely on the decomposition into interaction subspaces. In fact in [7] we show the converse statement that Equation 1.8 is a structure property that characterizes collections of vector spaces that can be decomposed into direct sums of subspaces, similarly to the decomposition into interaction subspaces, in other words satisfying the intersection property implies that this collection has such decomposition.

A direct consequence of Theorem 4.1 is that there is a complete lattice morphism between $(\operatorname{\mathscr{U}}(\mathscr{P}(I)),\subseteq)$ and factorisation spaces. This remark enable us to prove the Hammersley-Clifford Theorem in a direct and novel manner, pushing properties of the graph of dependencies directly on its graphical model, that we will now sketch and allows us to give a generalization of the Hammersley-Clifford theorem.

1.2 Hammersley-Clifford Theorem

1.2.1 Graphical models: Markov fields and Gibbs fields

A graphical model is a way to express the interactions of random variables through the properties of a graph. For example, let $I$ be a finite set and let $(X_{i},i\in I)$ be a collection of random variables. Let us associate to each random variable a vertex of an undirected graph $G=(I,A)$ , where $I$ is its set of vertices and $A$ its set of edges; one could say that two random variables are in interaction if their vertices are nearest-neighbours in $G$ and expect that there is a collection $(f_{a}\in\mathbb{R}^{E_{a}},a\in A)$ such that,

[TABLE]

This is an example of a Gibbs state with respect to a potential. The adjacent elements of $i\in G$ will be denoted as $\partial i$ .

Definition 1.4 (Gibbs States).

Let $I$ be a finite set and $(E_{i},i\in I)$ be a collection of finite sets, let $\mathscr{A}\in\mathscr{P}^{2}(I)$ and let $\Phi=(\phi_{a}\in\mathbb{R}^{E_{a}},a\in\mathscr{A})$ be a collection of interactions, which we shall call a potentiel; a Gibbs state with respect to a potential $\Phi$ is defined as follows, for any $x\in E$ ,

[TABLE]

*Remark 1.1**.*

Any probability law on $E$ is a Gibbs state; furthermore if there is a potential $\Phi=(\phi_{a}\in\mathbb{R}^{E_{a}},a\in\mathscr{A})$ such that a probability law $P$ is a Gibbs state with respect to $\Phi$ , then $P$ is in the factorisation space over $\mathscr{A}$ .

There is an other way to specify the interactions of the random variables from the properties of a graph. For example on can imagine that if two vertices $v,u$ are connected only through a third vertex $k$ , i.e. any path from $v$ to $u$ pass by $k$ , this would mean that the corresponding random variables, $X_{v}$ , $X_{u}$ are dependent only through $X_{k}$ , i.e. that,

[TABLE]

This is a particular case of spatial Markov property for the probability law of the random variables. There are several, a priori, different way to translate conditional connectedness properties of the graph into conditional independence properties, let us define two of such. Let for $a\subseteq I$ , $X_{a}$ denote $(X_{i})_{i\in a}=X_{|a}$ .

Definition 1.5 (Markov properties).

Let $G=(I,A)$ be a finite graph, a stricly positive probability $P_{X}$ on a finite set $E=\underset{i\in I}{\prod}E_{i}$ obeys,

$(P)$ the pairwise Markov property relative to $G$ , if for any pair $(i,j)$ of non-adjacent vertices

[TABLE] 2. 2.

$(L)$ the local Markov property relative to $G$ , if for any vectex $i\in V$ ,

[TABLE]

And we call the respective sets $P(G)$ , $L(G)$ .

As we will see the Hammersley-Clifford theorem asserts that the two points of view for reading the interactions from a graph, the Gibbs state and Markov property point of views, are in fact equivalent for a strictly positve probability law. One of the ways to prove the Hammersley-Clifford theorem is to build a decomposition into interaction subspaces of the factorisation spaces [8], we shall therefore give a brief presentation of this decomposition even though we shall not be using it in the rest of this document.

1.2.2 The decomposition into interaction subspaces

Let $I$ be a finite set and let $(E_{i},i\in I)$ be a collection of finite sets. Let us consider the canonical scalar product on $\mathbb{R}^{E_{I}}$ , i.e. for any $f,g\in\mathbb{R}^{E_{I}}$ ,

[TABLE]

Let for any $\mathscr{A}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Theorem 1.1 (Decomposition into interaction subspaces).

There is a collection of vector subspaces of $\mathbb{R}^{E_{I}}$ , $(S_{a},a\subseteq\mathscr{P}(I))$ , such that, for any $a\subseteq\mathscr{P}(I)$ ,

[TABLE]

and any two $S_{a},S_{b}$ , with $a\neq b$ , are orthogonal to one another.

Several proofs of this result can be found in [8].

1.2.3 A new proof of the Hammersley-Clifford Theorem

The Hammersley-Clifford theorem states that any Markov condition for a stricly positive probability law can be restated as a condition on the locality of the interactions of its potential, in other words Markov conditions correspond to some factorisation spaces.

Let $G=(I,A)$ be a graph; a clique of $G$ is a subset of $G$ such that every two distinct vertices are adjacent. We will note $\mathscr{C}$ the set of its cliques.

Theorem 1.2 (Hammersley-Clifford).

Let $G=(I,A)$ be a finite graph. For all $P_{X}$ strictly positive probability law on a finite set $\prod_{i\in I}E_{i}$ ,

[TABLE]

The intesection property for factorisation spaces enables us to bring back the proof of the Hammersley-Clifford theorem to a general property on graphs, let us sketch the proof that will will present in more details in this document.

Let $(i,j)$ be a pair of elements of $I$ , let $[i,j]=\{I\setminus\{i\}),I\setminus\{j\})\}$ and let,

[TABLE]

Similarly for all $i\in I$ , let $[i]=\{I\setminus i,i\cup\partial i\}$ and let $\mathscr{A}_{L}=\underset{i}{\bigcap}\hat{[i]}$ .

By remarking that,

[TABLE]

and applying the intersection property for factorisation spaces one ends the proof.

1.3 Structure of this document

In Section 2, we will give some general properties on partial coverings and their natural order making it a preorder with join and meet. Proposition 2.3 states that there is an increasing function between the preorder set of partial coverings and the poset of factorisation spaces that preserve the join.

In Section 3 we prove the intersection property (Theorem 3.1) and as a consequence we show that the increasing function also preserves meets. In this section we do not assume the $(E_{i},i\in I)$ to be finite, however we assume $I$ to be finite.

In the next section, Section 4, we extend the intersection property to any sets $I$ , Theorem 4.1.

Finaly in Section 5 we give apply the previous theorems giving new proofs for classical results around factorisation spaces that allow us to extend them, in particular we give a generalization of the Hammersley-Clifford theorem.

2 Order on partial coverings and factorisation spaces

In this section $I$ is a finite set, let $E=\prod_{i\in I}E_{i}$ be a product of any sets. For $a,b\in\mathscr{P}(I)$ such that $b\subseteq a$ let $\pi^{a}_{b}:E_{a}\to E_{b}$ be the projection of $E_{a}$ onto $E_{b}$ where by convention $\pi^{a}_{\emptyset}:E_{a}\to\ast$ is the projection on the set with one element $\ast$ ; for $x\in E$ , we shall note $\pi_{a}(x)$ as $x_{a}$ . In particular $\mathscr{F}_{\emptyset}$ is the set of stricly positive constant functions and for any $a\in\mathscr{P}(I)$ we note $\mathscr{F}_{\{a\}}$ as $\mathscr{F}_{a}$ .

$\mathbb{R}_{>0}$ can be seen as a vector space for the product law and the exponentiation and similarly for the product of these spaces. In this section we keep the, unusual, product convention to stay closer to the spirit of factorisation.

2.1 Order on partial coverings

Any subset $\mathscr{A}\subseteq\mathscr{P}(I)$ can be seen as a partial covering of $I$ of support $\cup_{a\in\mathscr{A}}a$ . The order for partial covering that we will now introduce is a direct extension of the usual one for coverings.

Definition 2.1.

Let us define an intersection $\sqcap$ and a relation R on $\mathscr{P}^{2}(I)$ . For all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Proposition 2.1.

R* is pre-order that we will note $\leq$ and for $\mathscr{A},\mathscr{B},\mathscr{C},\mathscr{D}\in\mathscr{P}^{2}(I)$ ,*

[TABLE]

where $\wedge$ is the logic operator ”and”.

Proof.

Let $\mathscr{A},\mathscr{B},\mathscr{C}\in\mathscr{P}^{2}(I)$ . For all $a\in\mathscr{A}$ , $a\subseteq a$ . Therefore $\mathscr{A}\leq\mathscr{A}$ . Assume, $\mathscr{A}\leq\mathscr{B}$ and $\mathscr{B}\leq C$ , then,

[TABLE]

For $a\in\mathscr{A}$ there is $b\in\mathscr{B}$ and $c\in\mathscr{C}$ such that $a\subseteq b\subseteq c$ so $a\subseteq c$ and $[\mathscr{A}\leq\mathscr{B}\wedge\mathscr{B}\leq\mathscr{C}]\implies\mathscr{A}\leq\mathscr{C}$ . Therefore $\leq$ is a pre-order.

[TABLE]

So $\mathscr{A}\sqcap\mathscr{B}=\mathscr{B}\sqcap\mathscr{A}$ .

[TABLE]

So $(\mathscr{A}\cup\mathscr{B})\sqcap\mathscr{C}=(\mathscr{A}\sqcap\mathscr{C})\cup(\mathscr{B}\sqcap\mathscr{C})$ .

Let $c\in\mathscr{A}\sqcap\mathscr{B}$ then there is $a\in A$ , $b\in\mathscr{B}$ such that, $c\subseteq a\cap b\subseteq a$ . So, $[\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{A}]\wedge[\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{B}]$ .

Assume $\mathscr{A}\leq\mathscr{C}$ and $\mathscr{B}\leq\mathscr{D}$ then for all $a\in\mathscr{A}$ there is $c\in\mathscr{C}$ such that $a\subseteq c$ , for all $b\in\mathscr{B}$ there is $d\in\mathscr{D}$ such that $b\subseteq d$ . So for $x\in\mathscr{A}\cup\mathscr{B}$ there is $c\in\mathscr{C}$ such that $x\subseteq c$ or $d\in\mathscr{D}$ such that $x\subseteq d$ . However $c$ and $d\in\mathscr{C}\cup\mathscr{D}$ so $\mathscr{A}\cup\mathscr{B}\leq\mathscr{C}\cup\mathscr{D}$ . The last is proven the same way noting that $a\subseteq c$ , $b\subseteq d$ implies $a\cap b\subseteq c\cap d$ .

∎

Definition 2.2.

Let us introduce the usual equivalence relation for a pre-order (see E.III.3 [1]), for all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Let $q:\mathscr{P}^{2}(I)\to J$ , with $J$ any poset, be a pre-order morphism, in the sense that for any $a,b\in\mathscr{P}^{2}(I)$ such that $a\leq b$ , $q(a)\leq q(b)$ . $q$ is said to preserve the equivalence relation when for all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ , $\left[\mathscr{A}\sim\mathscr{B}\implies q(\mathscr{A})=q(\mathscr{B})\right]$ . In what follows we supporse that $q$ preserves the equivalence relation.

If, for any $f:\mathscr{P}^{2}(I)\to K$ , with $K$ a poset, that is a pre-order morphism and that preserves the equivalence relation, there is a unique $\overline{f}$ that is a poset morphism such that $f=\overline{f}\circ q$ , then we will say that $q$ verifies the universal property $(P)$ .

Let us note $\mathscr{P}^{2}(I)/\sim$ as $\overline{\mathscr{P}^{2}(I)}$ .

Proposition 2.2.

*If two pre-order morphism, $p_{1}:\mathscr{P}^{2}(I)\to J$ , $p_{2}:\mathscr{P}^{2}(I)\to K$ , that preserve the equivalence relation, verify the universal property $(P)$ , then there is a poset isomorphism between $J$ and $K$ .

Let us define $p$ as,

[TABLE]

There is a unique order $\overline{\leq}$ on $\overline{\mathscr{P}^{2}(I)}$ such that $p:(\mathscr{P}^{2}(I),\leq)\to(\overline{\mathscr{P}^{2}(I)},\overline{\leq})$ is a pre-order morphism and verifies $(P)$ . It verifies for all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Furthermore one can define a union on $\overline{\mathscr{P}^{2}(I)}$ and an intersection such for all $\mathscr{A}$ , $\mathscr{B}$ ,

[TABLE]

Equations 2.3,2.4,2.5 stay true on $\overline{\mathscr{P}^{2}(I)}$ . Let us recall them, $\mathscr{A},\mathscr{B},\mathscr{C},\mathscr{D}\in\overline{\mathscr{P}^{2}(I)}$ ,

[TABLE]

Proof.

Let $p_{1}:\mathscr{P}^{2}(I)\to J$ , $p_{2}:\mathscr{P}^{2}(I)\to K$ , that preserve the equivalence relation, verify the universal property $(P)$ . Then there is $\overline{p_{1}}$ , $\overline{p_{2}}$ , two poset morphisms, such that $p_{1}=\overline{p_{1}}\circ p_{2}$ , $p_{2}=\overline{p_{2}}\circ p_{1}$ . So $p_{1}=\overline{p_{1}}\circ\overline{p_{2}}\circ p_{1}$ , in other words the following diagram commutes:

[TABLE]

But $p_{1}=\operatorname{id}\circ p_{1}$ , therefore by the unicity statement in $(P)$ , $\overline{p_{1}}\circ\overline{p_{2}}=\operatorname{id}$ . One also has that $p_{2}=\overline{p_{2}}\circ\overline{p_{1}}\circ p_{2}$ , so $\overline{p_{2}}\circ\overline{p_{1}}=\operatorname{id}$ . Therefore $\overline{p_{1}}$ is a poset isomorphism between $J$ and $K$ .

Le us define the following relation for $x,y\in\overline{\mathscr{P}^{2}(I)}$ ,

[TABLE]

$(\overline{\mathscr{P}^{2}(I)},\overline{\leq})$ is a poset (see E.III.3 [1]).

Let $f:\mathscr{P}^{2}(I)\to K$ , with $K$ a poset, be a pre-order morphism that preserves the equivalence relation. By the universal property for the quotient map,there is a unique $\overline{f}$ such that $f=\overline{f}\circ p$ . For $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ , suppose $[\mathscr{A}]\overline{\leq}[\mathscr{B}]$ , then $\mathscr{A}\leq\mathscr{B}$ and $f(\mathscr{A})\leq f(\mathscr{B})$ . $\overline{f}([\mathscr{A}])=f(\mathscr{A})$ and $\overline{f}([\mathscr{B}])=f(\mathscr{B})$ , so $\overline{f}([\mathscr{A}])\leq\overline{f}([\mathscr{B}])$ . Therefore $\overline{f}$ is a poset morphism.

Suppose that there are two orders $\leq_{1}$ and $\leq_{2}$ on $\overline{\mathscr{P}^{2}(I)}$ such that $p:(\mathscr{P}^{2}(I),\leq)\to(\overline{\mathscr{P}^{2}(I)},\leq_{1})$ and $p:(\mathscr{P}^{2}(I),\leq)\to(\overline{\mathscr{P}^{2}(I)},\leq_{2})$ are pre-order morphism and verify $(P)$ . Then there is $\overline{p}$ , a poset isomorphism, such that $p=\overline{p}\circ p$ . But by the universal property for the quotient map, $\overline{p}=\operatorname{id}$ . Therefore $\operatorname{id}:(\overline{\mathscr{P}^{2}(I)},\leq_{1})\to(\overline{\mathscr{P}^{2}(I)},\leq_{2})$ is a poset isomorphism. For all $x,y\in\overline{\mathscr{P}^{2}(I)}$ ,

[TABLE]

So $\leq_{1}=\leq_{2}$ .

Let $\mathscr{A},\mathscr{B},\mathscr{C},\mathscr{D}\in\mathscr{P}^{2}(I)$ , such that $\mathscr{A}\sim\mathscr{C}$ , $\mathscr{B}\sim\mathscr{D}$ , then by property Eq 2.4, $\mathscr{A}\cup\mathscr{B}\leq\mathscr{C}\cup\mathscr{D}$ and $\mathscr{C}\cup\mathscr{D}\leq\mathscr{A}\cup\mathscr{B}$ , so $\mathscr{A}\cup\mathscr{B}\sim\mathscr{C}\cup\mathscr{D}$ .

Similarly, by property Eq 2.5 $\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{C}\sqcap\mathscr{D}$ and $\mathscr{C}\sqcap\mathscr{D}\leq\mathscr{A}\sqcap\mathscr{B}$ , so $\mathscr{A}\sqcap\mathscr{B}\sim\mathscr{C}\sqcap\mathscr{D}$ . Therefore the union and intersection given by Eq 2.8 are well defined.

For any $\mathscr{A},\mathscr{B},\mathscr{C},\mathscr{D}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Therefore, $[\mathscr{A}]\sqcap[\mathscr{B}]\leq[\mathscr{A}]$ . And one proceeds similarly for the two other properties.

∎

We will now also note $\overline{\leq}$ as $\leq$ .

*Example 1**.*

Consider $I=\{1,2,3,4\}$ . $\{\{1,2\},\{1,3\}\}\leq\{I\}$ and this is true for any element of $\mathscr{P}^{2}(I)$ .

[TABLE]

*Remark 2.1**.*

By construction, any section of $p$ induces a poset isomorphism. For example the application that sends $\mathscr{A}$ to its lower set induces a section $s:[\mathscr{A}]\mapsto\hat{\mathscr{A}}$ of $p$ ; $\widehat{\mathscr{P}^{2}(I)}$ and $p_{|\operatorname{\mathscr{U}}(\mathscr{P}(I))}$ is a poset isomorphism. On $\operatorname{\mathscr{U}}(\mathscr{P}(I))$ , $\leq$ is equal to the inclusion $\subseteq$ and $\sqcap=\cap$ .

2.2 Increasing function from $\mathscr{P}^{2}(I)$ to the poset of factorisation spaces

Let us denote $\mathscr{F}(I)$ the poset of factorization spaces.

Proposition 2.3.

Let,

[TABLE]

$\overline{\Phi}:(\overline{\mathscr{P}^{2}(I)},\leq)\to(\mathscr{F}(I),\subseteq)$ * is a poset morphism. For all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I),\Phi(\mathscr{A}\cup\mathscr{B})=\Phi(\mathscr{A}).\Phi(\mathscr{B})$ , $\overline{\Phi}([\mathscr{A}]\cup[\mathscr{B}])=\overline{\Phi}([\mathscr{A}]).\overline{\Phi}([\mathscr{B}])$ .

If for all $i\in I$ , $|E_{i}|\geq 2$ then $\overline{\Phi}$ is injective and is a poset isomorphism.

Let us remark that for all $a,b\subseteq I$ such that $a\subseteq b$ , $\mathscr{F}_{a}\subseteq\mathscr{F}_{b}$ and that for all $a\in\mathscr{A}$ , $\mathscr{F}_{a}\subseteq\mathscr{F}_{\mathscr{A}}$ .

Indeed, $\pi_{a}=\pi^{b}_{a}\circ\pi_{b}$ so for all $f:E_{a}\to\mathbb{R}_{>0}$ , $f\circ\pi_{a}=(f\circ\pi^{b}_{a})\circ\pi_{b}$ , so $f\circ\pi_{a}\in\mathscr{F}_{b}$ . Let us note $1$ the constant function equal to $1$ . For all $a\subseteq I$ , $1\in\mathscr{F}_{\emptyset}\subseteq\mathscr{F}_{a}$ . For $a\in\mathscr{A}$ , $f\in\mathscr{F}_{a}$ , $f=f\underset{b\in\mathscr{A}\setminus\{a\}}{\prod}1$ , so $f\in\mathscr{F}_{\mathscr{A}}$ .

Let us now prove Proposition 2.3.

Proof.

Let $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ such that $\mathscr{A}\leq\mathscr{B}$ and $f\in\mathscr{F}_{\mathscr{A}}$ such that $f=\underset{a\in\mathscr{A}}{\prod}g_{a}$ . For all $a\in\mathscr{A}$ there is $b(a)\in\mathscr{B}$ such that $a\subseteq b(a)$ , so $g_{a}\in\mathscr{F}_{b(a)}\subseteq\mathscr{F}_{\mathscr{B}}$ and $\underset{a\in\mathscr{A}}{\prod}g_{a}\in\mathscr{F}_{\mathscr{B}}$ as $\mathscr{F}_{\mathscr{B}}$ is a vector space. So $\mathscr{F}_{\mathscr{A}}\leq\mathscr{F}_{\mathscr{B}}$ .

Let $\mathscr{A}\leq\mathscr{B}$ and $\mathscr{B}\leq\mathscr{A}$ then $\mathscr{F}_{\mathscr{A}}\subseteq\mathscr{F}_{\mathscr{B}}$ and $\mathscr{F}_{\mathscr{B}}\subseteq\mathscr{F}_{\mathscr{A}}$ , then $\mathscr{F}_{\mathscr{A}}=\mathscr{F}_{\mathscr{B}}$ and $\overline{\Phi}$ is well defined and is a poset morphism.

For all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ , $\Phi(\mathscr{A})$ and $\Phi(\mathscr{B})$ are subspaces of $\Phi(\mathscr{A}\cup\mathscr{B})$ so $\Phi(\mathscr{A}).\Phi(\mathscr{B})\subseteq\Phi(\mathscr{A}\cup\mathscr{B})$ . For all $a\in\mathscr{A}\cup\mathscr{B}$ , $\mathscr{F}_{a}\subseteq\Phi(\mathscr{A}).\Phi(\mathscr{B})$ ; $\Phi(\mathscr{A}).\Phi(\mathscr{B})$ also being a vector space, $\Phi(\mathscr{A}\cup\mathscr{B})\subseteq\Phi(\mathscr{A}).\Phi(\mathscr{B})$ .

If for all $i\in I$ , $|E_{i}|\geq 2$ , Corollary 2 in [2] stipulates that $\mathscr{F}_{\mathscr{A}}=\mathscr{F}_{\mathscr{B}}$ if and only if $\hat{\mathscr{A}}=\hat{\mathscr{B}}$ but the proof of this results shows that if $\mathscr{F}_{\mathscr{A}}\subseteq\mathscr{F}_{\mathscr{B}}$ then $\hat{\mathscr{A}}\leq\hat{\mathscr{B}}$ . So $\Phi_{|\hat{\mathscr{P}^{2}(I)}}$ is injective therefore so is $\overline{\Phi}$ by remark 2.1. Furthermore $\overline{\Phi}([\mathscr{A}])\subseteq\overline{\Phi}([\mathscr{B}])$ implies $[\mathscr{A}]\leq[\mathscr{B}]$ , so $\overline{\Phi}$ is a poset isomorphism.

∎

*Remark 2.2**.*

Proposition 2.3 is a very general property for any increasing function $\Gamma$ from any poset $(\mathscr{A},\leq)$ to $\operatorname*{\textbf{Gr}}V$ the set of vector subspaces of a vector space $V$ . Indeed let $\operatorname{\mathscr{U}},\mathscr{V}\in\mathscr{P}(\mathscr{A})$ , $\sum\limits_{a\in\operatorname{\mathscr{U}}}\Gamma(a)+\sum\limits_{b\in\mathscr{V}}\Gamma(b)=\sum\limits_{a\in\operatorname{\mathscr{U}}\cup\mathscr{V}}\Gamma(a)$ , and if $\operatorname{\mathscr{U}}\leq\mathscr{V}$ , in the same sense than in Definition 2.2, then $\sum\limits_{a\in\operatorname{\mathscr{U}}}\Gamma(a)\subseteq\sum\limits_{a\in\mathscr{V}}\Gamma(a)$ . We enounced it as a proposition in order to clarify the presentation, as we use it as a know fact in later proofs.

3 Intersection property for factorisations on finite posets

In this section we still assume that $I$ is finite. For $a,b,c\subseteq I$ such that $b\cup c=a$ and $b\cap c=\emptyset$ , the map $\pi^{a}_{(c,b)}:E_{a}\to E_{b}\times E_{c}$ is a bijection. We will note for $u\in E_{b},v\in E_{c}$ , ${\pi^{a}_{(c,d)}}^{-1}(u,v)$ as $uv$ , in particular for $x\in E$ , $x_{a}=\pi^{a}_{b}(x_{a})\pi^{a}_{c}(x_{a})=x_{b}x_{c}$ . Thus we can also write, for any $a,b\subseteq I$ , $x_{a}=x_{a\cap b}x_{a\cap\overline{b}}$ .

Lemma 3.1.

Let $a\subseteq I$ , $\mathscr{B}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Proof.

Let $f\in\mathscr{F}_{a}$ and $(g_{b})_{b\in\mathscr{B}}\in\underset{b\in\mathscr{B}}{\prod}\mathscr{F}_{b}$ such that for all $x\in E$ ,

[TABLE]

There are $f_{a},(\tilde{g}_{b})_{b\in\mathscr{B}}$ such that for all $x\in E$ , $b\in\mathscr{B}$ , $f(x)=f_{a}(x_{a})$ , $g_{b}(x)=\tilde{g}_{b}(x)$ .

For all $x\in E$ ,

[TABLE]

Let $c_{\overline{a}}\in E_{\overline{a}}$ then, $\pi_{a}(x_{a}c_{\overline{a}})=x_{a}$ and $\pi_{b}(x_{a}c_{\overline{a}})=(x_{b\cap a}c_{b\cap\overline{a}})$ . So,

[TABLE]

Let us pose for all $b\in\mathscr{B}$ , $g_{1,b}(x_{b\cap a})=\tilde{g}_{b}(x_{b\cap a}c_{b\cap\overline{a}})$ , then $f=\prod\limits_{b\in B}g_{1,b}\circ\pi_{b\cap a}$ .

∎

Theorem 3.1 (Intersection property).

*Let $I$ be a finite set and let $(E_{i})_{i\in I}$ be family of non necessarily finite sets.

For $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ , $f\in\mathbb{R}_{>0}^{E}$ , $(f_{a})_{a\in\mathscr{A}}\in\underset{a\in\mathscr{A}}{\prod}\mathbb{R}_{>0}^{E_{a}}$ and $(g_{b})_{b\in\mathscr{B}}\in\underset{b\in\mathscr{B}}{\prod}\mathbb{R}_{>0}^{E_{b}}$ such that, for all $x\in E$ ,

[TABLE]

There is $(h_{a,b})_{(a,b)\in\mathscr{A}\times\mathscr{B}}\in\underset{(a,b)\in\mathscr{A}\times\mathscr{B}}{\prod}\mathbb{R}_{>0}^{E_{a\cap b}}$ such that for all $x\in E$ ,

[TABLE]

Equivalently,

[TABLE]

Proof.

For $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ , $\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{A}$ , $\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{B}$ . Therefore by Proposition.(2.3) $\mathscr{F}_{\mathscr{A}\sqcap\mathscr{B}}\subseteq\mathscr{F}_{\mathscr{A}}\cap\mathscr{F}_{\mathscr{B}}$ .

Let us prove the other inclusion by induction on $|\mathscr{A}|$ .

$|\mathscr{A}|=1$ is the previous Lemma.3.1.

Suppose that for all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ such that $|\mathscr{A}|=n\in\mathbb{N}$ , $\mathscr{F}_{\mathscr{A}}\cap\mathscr{F}_{\mathscr{B}}\subseteq\mathscr{F}_{\mathscr{A}\sqcap\mathscr{B}}$ .

Let $\mathscr{A}\in\mathscr{P}^{2}(I)$ , $|\mathscr{A}|=n+1$ . Take $\alpha\in\mathscr{A}$ , $|\mathscr{A}\setminus\{\alpha\}|=n$ . Pose $\mathscr{C}=\mathscr{A}\setminus\{\alpha\}$ . Let $f\in\mathscr{F}_{\mathscr{A}}\cap\mathscr{F}_{\mathscr{B}}$ , then there is $h_{1}\in\mathscr{F}_{\alpha}$ , $f_{1}\in\mathscr{F}_{\mathscr{C}}$ , $g\in\mathscr{F}_{\mathscr{B}}$ such that $f=h_{1}.f_{1}=g\quad$ .

So $h_{1}=\frac{g}{f_{1}}$ and $h_{1}\in\mathscr{F}_{\mathscr{C}}.\mathscr{F}_{\mathscr{B}}$ . So by Proposition.(2.3), $h_{1}\in\mathscr{F}_{\mathscr{C}\cup\mathscr{B}}$ . Then by Lemma.3.1 $h_{1}\in\mathscr{F}_{(\mathscr{C}\cup\mathscr{B})\sqcap\{\alpha\}}$ . But $(\mathscr{C}\cup\mathscr{B})\sqcap\{\alpha\}=(\mathscr{C}\sqcap\{\alpha\})\cup(\mathscr{B}\sqcap\{\alpha\})$ .

So $h_{1}\in\mathscr{F}_{\mathscr{C}\sqcap\{\alpha\}}.\mathscr{F}_{\mathscr{B}\sqcap\{\alpha\}}$ . Furthermore $f_{1}\in\mathscr{F}_{\mathscr{C}}$ so $f=h_{1}.f_{1}\in\mathscr{F}_{\mathscr{C}}.\mathscr{F}_{\mathscr{C}\sqcap\{\alpha\}}.\mathscr{F}_{\mathscr{B}\sqcap\{\alpha\}}$ . But $\mathscr{F}_{\mathscr{C}\sqcap\{\alpha\}}\subseteq\mathscr{F}_{\mathscr{C}}$ so $\mathscr{F}_{\mathscr{C}}.\mathscr{F}_{\mathscr{C}\sqcap\{\alpha\}}\subseteq\mathscr{F}_{\mathscr{C}}$ (it is even equal).

So there is $f_{2}\in\mathscr{F}_{\mathscr{C}}$ , $h_{2}\in\mathscr{F}_{\mathscr{B}\sqcap\{\alpha\}}$ such that $g=h_{2}.f_{2}$ . Therefore $f_{2}=\frac{g}{h_{2}}$ . But $\mathscr{F}_{\mathscr{B}\sqcap\{\alpha\}}\subseteq\mathscr{F}_{\mathscr{B}}$ so $f_{2}\in\mathscr{F}_{\mathscr{B}}$ .

Therefore by the induction hypothesis, $f_{2}\in\mathscr{F}_{\mathscr{C}\sqcap\mathscr{B}}$ , and so $f\in\mathscr{F}_{\mathscr{B}\sqcap\{\alpha\}}\mathscr{F}_{\mathscr{C}\sqcap\mathscr{B}}$ . One remarks that $(\{\alpha\}\sqcap\mathscr{B})\cup(\mathscr{C}\sqcap\mathscr{B})=\mathscr{A}\sqcap\mathscr{B}$ so $f\in\mathscr{F}_{\mathscr{A}\sqcap\mathscr{B}}$ . Which ends the proof by induction.

∎

Corollary 3.1.

For all $\mathscr{A},\mathscr{B}\in\mathscr{P}^{2}(I)$ ,

[TABLE]

Which can be rewritten as, for all $\mathscr{A},\mathscr{B}\in\overline{\mathscr{P}^{2}(I)}$ ,

[TABLE]

Proof.

$\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{A}$ and $\mathscr{A}\sqcap\mathscr{B}\leq\mathscr{B}$ therefore $\mathscr{F}_{\mathscr{A}\sqcap\mathscr{B}}\subseteq\mathscr{F}_{\mathscr{A}}$ and $\mathscr{F}_{\mathscr{A}\sqcap\mathscr{B}}\subseteq\mathscr{F}_{\mathscr{B}}$ .

∎

4 Extension for infinite posets

In this section $I$ is any set; let us now use the summation convention instead of the product one. We would like to give a similar definition of $\mathscr{F}_{\mathscr{A}}$ but for infinite posets. If for any $\mathscr{A}\subseteq\mathscr{P}(I)$ , we defined $U(\mathscr{A})$ as $\sum\limits_{\begin{subarray}{c}a\in\mathscr{A}\\ |a|<+\infty\end{subarray}}U(a)$ then $U(I)=0$ . One needs to consider only lower sets in $\operatorname{\mathscr{U}}(\mathscr{P}(I))$ .

Let us call $U=U_{\mathscr{P}(I)}$ and $U(I)$ the poset constituted of the $U_{\mathscr{A}}$ ; let $\Psi$ be such that,

[TABLE]

In particular,

[TABLE]

Corollary 4.1.

For all $\mathscr{A},\mathscr{B}\in\operatorname{\mathscr{U}}(\mathscr{P}(I))$ ,

[TABLE]

Proof.

Let $f\in U_{\mathscr{A}}\cap U_{\mathscr{B}}$ . There are by definition, $\mathscr{C}_{1}\subseteq\mathscr{A}$ , $\mathscr{C}_{2}\subseteq\mathscr{B}$ , that are of finite cardinal, such that $f\in U_{\mathscr{C}_{1}}$ and $f\in U_{\mathscr{C}_{2}}$ . By Corollary 3.1, $f\in U_{\mathscr{C}_{1}\sqcap\mathscr{C}_{2}}$ . As $\mathscr{C}_{1}\sqcap\mathscr{C}_{2}\subseteq\mathscr{A}\cap\mathscr{B}$ , $f\in U_{\mathscr{A}\cap\mathscr{B}}$ . ∎

We will now show that a stronger version of Corollary.(4.1) holds for the intersection on any family of elements of $\operatorname{\mathscr{U}}(\mathscr{P}(I))$ .

Theorem 4.1.

For any family $(\mathscr{A}_{j})_{j\in J}$ of elements of $\operatorname{\mathscr{U}}(\mathscr{P}(I))$ ,

[TABLE]

Before giving a proof of this result, let us first state the following lemma,

Lemma 4.1.

Let $V_{1},V_{2}$ be two vector subspaces of $U$ . If for any finite $a\in\mathscr{P}(I)$ ,

[TABLE]

Then,

[TABLE]

Proof.

Let $v\in V_{1}$ , there is a finite collection of finite subsets of $I$ , $(a_{k})_{1\leq k\leq n}$ , such that, $v\in\underset{1\leq k\leq n}{\sum}U_{a_{k}}$ .

Therefore $v\in U_{(\underset{1\leq k\leq n}{\bigcup}a_{k})}$ . But $\underset{1\leq k\leq n}{\bigcup}a_{k}$ is of finite cardinal. So $v\in V_{2}\cap U_{(\underset{1\leq k\leq n}{\bigcup}a_{k})}\subseteq V_{2}$ .

Therefore $V_{1}\subseteq V_{2}$ . ∎

A direct consequence of Lemma.(4.1) is that if for any finite $a\in\mathscr{P}(I)$ ,

[TABLE]

Then $V_{1}=V_{2}$ .

Proof of the Theorem.(4.1). Let $(\mathscr{A}_{j})_{j\in J}$ be a family of elements of $\operatorname{\mathscr{U}}(\mathscr{P}(I))$ . Let $a\subseteq I$ of finite cardinal.

$\underset{j\in J}{\bigcap}U_{\mathscr{A}_{j}}\cap U_{a}=\underset{j\in J}{\bigcap}\left(U_{\mathscr{A}_{j}}\cap U_{a}\right)$ .

But, $U_{\mathscr{A}_{j}}\cap U_{a}=U_{\mathscr{A}_{j}\cap\widehat{\{a\}}}$ . And $\{U_{\mathscr{A}_{j}\cap\widehat{\{a\}}}:\quad j\in J\}$ is finite, so $\underset{j\in J}{\bigcap}\left(U_{\mathscr{A}_{j}}\cap U_{a}\right)$ can be rewritten as a finite intersection and by Corollary (4.1),

[TABLE]

By Lemma.(4.1),

[TABLE]

The other inclusion is always true (Remark (2.1)) as for any $i\in J$ , $\bigcap\limits_{j\in J}\mathscr{A}_{j}\subseteq\mathscr{A}_{i}$ . ∎

*Remark 4.1**.*

This proposition can also be stated in terms of the $\mathscr{F}_{\mathscr{A}}$ by taking the exponential:

[TABLE]

5 Applications

5.1 Minimal factorisation

In [2] a proof of the existence of a minimum factorisation is given, based on the existence of a decomposition into interaction subspaces, when $E$ is finite and $I$ finite. Let us recall that in a poset $\mathscr{A}$ , $a\in\mathscr{A}$ is said to be a minimum if any $b\in\mathscr{A}$ is such that $a\leq b$ . Let us give a proof of this result using Theorem 4.1 so without assuming that $I$ nor $E$ are finite.

Corollary 5.1.

Let $I$ be any set and $E=\prod_{i\in I}E_{i}$ be the product of any collection of sets; for all $f\in\mathscr{F}$ let us call $\mathscr{F}(f)=\{\mathscr{F}_{\mathscr{A}}|\quad f\in\mathscr{F}_{\mathscr{A}}\}$ . $\mathscr{F}(f)$ admits a minimum and we say that $f$ admits a minimum decomposition.

Proof.

Let $\mathscr{M}(f)=\{\mathscr{A}\in\operatorname{\mathscr{U}}(\mathscr{P}(I))|\quad f\in\mathscr{F}_{\mathscr{A}}\}$ . From Theorem 4.1, one has that,

[TABLE]

Any $\text{K}\in\mathscr{F}(f)$ contains $\bigcap\limits_{\mathscr{A}\in\mathscr{M}(f)}\mathscr{F}_{\mathscr{A}}$ , therefore $\mathscr{F}_{\bigcap\limits_{\mathscr{A}\in\mathscr{M}(f)}\mathscr{A}}$ is the minimum of $\mathscr{F}(f)$ .

∎

5.2 Markov properties and Hammersley-Clifford

Let us consider four random variables $W,X,Y,Z$ taking values respectively in $E_{0}$ , $E_{1}$ , $E_{2}$ , $E_{3}$ finite sets, with strictly positive joint law. Let us recall the law of $X$ conditionally to $Y$ ,

[TABLE]

Conditional independence is usually defined as follows,

[TABLE]

Let us pose $I=\{0,1,2,3\}$ we identify $\underset{i\in I}{\Pi}E_{i}$ with $E_{0}\times E_{1}\times E_{2}\times E_{3}$ by the following $x\mapsto(x(0),x(1),x(2),x(3))$ and then $\mathscr{F}_{\mathscr{A}}$ to sets in $\mathbb{R}^{E_{0}\times E_{1}\times E_{2}\times E_{3}}_{>0}$ . Let $a=\{1,3\}$ , $b=\{2,3\}$ and $\mathscr{A}=\{a,b\}$ ,

[TABLE]

Proposition 5.1 (Bayesian or Graphoid intersection property).

[TABLE]

Proof.

Let $a=\{0,1,3\}$ , $b=\{0,2,3\}$ , $c=\{1,2,3\}$ , $d=\{1,3\}$ and $\mathscr{A}=\{a,b\}$ , $\mathscr{B}=\{b,c\}$ , $\mathscr{C}=\{b,d\}$ . $\mathscr{A}\sqcap\mathscr{B}\equiv\{a\cap c,b\}=\{d,b\}$ so $\mathscr{F}_{\mathscr{A}}\cap\mathscr{F}_{\mathscr{B}}\subseteq\mathscr{F}_{\mathscr{C}}$ . ∎

Corollary 5.2.

*(Hammersley-Clifford)

Let $G=(I,A)$ be a finite graph. For all strictly positive probability law, $P_{X}$ , on a finite $E$ ,

[TABLE]

For any pair $(i,j)$ of elements of $I$ and for all probability law $P$ ,

[TABLE]

Let $\mathscr{A}_{P}=\underset{(i,j):\text{ }i\notin\partial j}{{{{\sqcap}}}}[i,j]$ . Similarly, for all $i\in I$ ,

[TABLE]

Let $\mathscr{A}_{L}=\underset{i}{{{{\sqcap}}}}[i]$ . The following lemma is the version of the Hammersley-Clifford on graphs that we will then translate to graphical models by applying $\Psi$ .

Lemma 5.1.

[TABLE]

Proof.

Firstly, $\hat{\mathscr{A}}_{L}=\bigcap\limits_{(k,l):\text{ }k\notin\partial l}\widehat{[k,l]}$ . Let $a\in\hat{\mathscr{A}}_{L}$ and assume that $a$ is not a clique. So there is $i,j\in a$ such that $i\notin\partial j$ . But $a\in\widehat{[i,j]}$ , so $a\subseteq i\cup(I\setminus\{i,j\})$ or $a\subseteq j\cup(I\setminus\{i,j\})$ . It is not possible as any of these two sets separate $i$ and $j$ . So $a$ must be a clique. In other words, $\{i,j\}\subseteq a$ but $\{i,j\}\not\subseteq i\cup(I\setminus\{i,j\})$ and $\{i,j\}\not\subseteq j\cup(I\setminus\{i,j\})$ ( $\{i,j\}\notin\widehat{[i,j]}$ ). So if $a$ is not a clique of $G$ , $a\not\in\hat{\mathscr{A}}_{L}$ .

Suppose $a$ is a clique of $G$ . Let $i,j\in I$ such that $i\notin\partial j$ . $i\cup(I\setminus\{i,j\})$ and $j\cup(I\setminus\{i,j\})$ separate $i,j$ . So a clique most be in only one of the two sets. To be more formal, for any subset $a$ of $I$ , there is $b\subseteq I\setminus\{i,j\}$ , such that $a=b$ or $a=b\cup i$ or $a=b\cup j$ or $a=b\cup\{i,j\}$ . As $a$ is a clique $\{i,j\}\not\subseteq a$ . So there is $b\subseteq I\setminus\{i,j\}$ , such that $a=b$ or $a=b\cup i$ or $a=b\cup j$ . Which is equivalent to saying that $a\in\hat{[i,j]}$ .

So we proved that,

[TABLE]

For the local case, $\hat{\mathscr{A}}_{L}$ , one has to remark that $a$ is a clique of $G$ if and only if for all $i\in a$ , $a\subseteq\{i,\partial i\}$ (for exemple see slide $6$ [4]).

∎

Proof of Corollary 5.2. Let us remark that $P_{X}\in P(G)$ if and only if $P_{X}\in\bigcap\limits_{(i,j):\text{ }i\notin\partial j}U_{[i,j]}$ and similarly $P_{X}\in L(G)$ if and only if $P_{X}\in\bigcap\limits_{i\in I}U_{[i]}$ .

As $P_{X}$ is stricly positive, by Corollary 3.1,

[TABLE]

∎

Similarly, when $G=(I,D)$ is any graph and $(E_{i})_{i\in I}$ is any collection of sets, Lemma 5.1 still holds and one has the following result which extends the Hammersley-Clifford theorem.

Corollary 5.3.

[TABLE]

Acknowledgement

This work resulted from research supported by the University of Paris. I am very grateful to Daniel Bennequin for our numerous discussions.

Bibliography9

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Bourbaki [1939] Nicolas Bourbaki. Théorie des ensembles . Springer, 1939.
2Chan and Yeung [2011] T. H. Chan and Raymond W. Yeung. Probabilistic inference using function factorization and divergence minimization. In M. Dehmer et al., editor, Towards an Information Theory of Complex Networks: Statistical Methods and Applications , chapter 3, pages 47–74. Springer, 2011.
3Dawid [2001] Alexander Philip Dawid. Separoids: A mathematical framework for conditional independence and irrelevance. Annals of Mathematics and Artificial Intelligence 3 , 2001.
4[4] Helge Langseth. The Hammersley-Clifford Theorem and its impact on modern statistics.
5Lauritzen [1996] Steffen L. Lauritzen. Graphical Models . Oxford Science Publications, 1996.
6Pearl [1988] Judea Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference . Morgan Kaufmann Publishers, 1988.
7Sergeant-Perthuis [2019] Grégoire Sergeant-Perthuis. Intersection property and interaction decomposition. ar Xiv:1904.09017 v 1, 2019.
8Speed [1979] Terry P. Speed. A note on nearest-neighbour gibbs and markov probabilities. Sankhyā: The Indian Journal of Statistics, Series A , 1979.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Bayesian/Graphoid intersection property for factorisation spaces.

Abstract

keywords:

keywords:

1 Introduction

1.1 Intersection property

1.1.1 Intersection property and graphoids

Proposition 5.1 (Intersection property).

Definition 1.1** (Semi-graphoid, graphoid [6][5]).**

1.1.2 Factorisation spaces

Definition 1.2** (Factorisation space).**

Notation 1*.*

Notation 2*.*

Definition 1.3** (Generalized factorisation spaces).**

1.1.3 Main theorem

Theorem 4.1 (Intersection property for factorisation spaces).

1.2 Hammersley-Clifford Theorem

1.2.1 Graphical models: Markov fields and Gibbs fields

Definition 1.4** (Gibbs States).**

Remark 1.1*.*

Definition 1.5** (Markov properties).**

1.2.2 The decomposition into interaction subspaces

Theorem 1.1** (Decomposition into interaction subspaces).**

1.2.3 A new proof of the Hammersley-Clifford Theorem

Theorem 1.2** (Hammersley-Clifford).**

1.3 Structure of this document

2 Order on partial coverings and factorisation spaces

2.1 Order on partial coverings

Definition 2.1**.**

Proposition 2.1**.**

Proof.

Definition 2.2**.**

Proposition 2.2**.**

Proof.

Example 1*.*

Remark 2.1*.*

2.2 Increasing function from P2(I)\mathscr{P}^{2}(I)P2(I) to the poset of factorisation spaces

Proposition 2.3**.**

Proof.

Remark 2.2*.*

3 Intersection property for factorisations on finite posets

Lemma 3.1**.**

Proof.

Theorem 3.1** (Intersection property).**

Proof.

Corollary 3.1**.**

Proof.

4 Extension for infinite posets

Corollary 4.1**.**

Proof.

Theorem 4.1**.**

Lemma 4.1**.**

Proof.

Remark 4.1*.*

5 Applications

5.1 Minimal factorisation

Corollary 5.1**.**

Proof.

5.2 Markov properties and Hammersley-Clifford

Proposition 5.1** (Bayesian or Graphoid intersection property).**

Proof.

Corollary 5.2**.**

Lemma 5.1**.**

Proof.

Corollary 5.3**.**

Acknowledgement

Definition 1.1 (Semi-graphoid, graphoid [6][5]).

Definition 1.2 (Factorisation space).

*Notation 1**.*

*Notation 2**.*

Definition 1.3 (Generalized factorisation spaces).

Definition 1.4 (Gibbs States).

*Remark 1.1**.*

Definition 1.5 (Markov properties).

Theorem 1.1 (Decomposition into interaction subspaces).

Theorem 1.2 (Hammersley-Clifford).

Definition 2.1.

Proposition 2.1.

Definition 2.2.

Proposition 2.2.

*Example 1**.*

*Remark 2.1**.*

2.2 Increasing function from $\mathscr{P}^{2}(I)$ to the poset of factorisation spaces

Proposition 2.3.

*Remark 2.2**.*

Lemma 3.1.

Theorem 3.1 (Intersection property).

Corollary 3.1.

Corollary 4.1.

Theorem 4.1.

Lemma 4.1.

*Remark 4.1**.*

Corollary 5.1.

Proposition 5.1 (Bayesian or Graphoid intersection property).

Corollary 5.2.

Lemma 5.1.

Corollary 5.3.