Bayesian/Graphoid intersection property for factorisation spaces
Gr\'egoire Sergeant-Perthuis

TL;DR
This paper generalizes the intersection property in Bayesian networks to factorisation spaces, providing new proofs and extending classical theorems like Hammersley-Clifford to broader settings, including non-finite graphs.
Contribution
It introduces a general intersection property for factorisation spaces, offers a novel proof of existing results, and extends the Hammersley-Clifford theorem to non-finite graphs.
Findings
Generalized intersection property for factorisation spaces
New proof of the Hammersley-Clifford theorem
Extension of decomposition into interaction subspaces to vector spaces
Abstract
We remark that Pearl's Graphoid intersection property, also called intersection property in Bayesian networks, is a particular case of a general intersection property, in the sense of intersection of coverings, for factorisation spaces, also coined as factorisation models, factor graphs or by Lauritzen in his reference book 'Graphical Models' as hierarchical model subspaces. A particular case of this intersection property appears in Lauritzen's book as a consequence of the decomposition into interaction subspaces; the novel proof that we give of this result allows us to extend it in the most general setting. It also allows us to give a direct and new proof of the Hammersley-Clifford theorem transposing and reducing it to a corresponding statement for graphs, justifying formally the geometric intuition of independency, and extending it to non finite graphs. This intersection property is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
Bayesian/Graphoid intersection property for factorisation spaces.
Grégoire Sergeant-Perthuislabel=e1][email protected] [ IMJ-PRG, Université de Paris
Abstract
We remark that Pearl’s Graphoid intersection property, also called intersection property in Bayesian networks, is a particular case of a general intersection property, in the sense of intersection of coverings, for factorisation spaces, also coined as factorisation models, factor graphs or by Lauritzen in his reference book Graphical Models as hierarchical model subspaces. A particular case of this intersection property appears in Lauritzen’s book as a consequence of the decomposition into interaction subspaces; the novel proof that we give of this result allows us to extend it in the most general setting. It also allows us to give a direct and new proof of the Hammersley-Clifford theorem transposing and reducing it to a corresponding statement for graphs, justifying formally the geometric intuition of independency, and extending it to non finite graphs. This intersection property is the starting point for a generalization of the decomposition into interaction subspaces to collections of vector spaces [7].
62H22,
00X00,
06F25,
Hammersley-Clifford,
Graphical models,
keywords:
[class=MSC2020]
keywords:
\SetNewAudience
long
1 Introduction
1.1 Intersection property
1.1.1 Intersection property and graphoids
To describe the structure of dependencies of a set of random variables, as well said by Judea Pearl in Chapter 3 of [6], one can introduce a ternary operator corresponding to the conditional independence:
”The notion of informational relevance is given […] through the device of conditional independence, which successfully captures our intuition about how dependencies should change in response to news facts”.
For any three random variables with discrete values, we will note X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|_{P}Z the fact that is independent of conditionally to (see Section 5.2 Equation 5.2); in the previous expression will be omitted from now on, as in literature.
The intersection property in Bayesian networks, as found in [9] (Chapter 2 Proposition 2.12) or [6] (Chapter 3 Theorem 1), is the following proposition.
Proposition 5.1 (Intersection property).
Let be four random variables that take values in a finite set and for which the probability density is stricly positive, then,
[TABLE]
Semi-graphoids and graphoids were introduced to give a formal set of axioms, on \mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}, for conditional independence (see [3], [6]); in this context Proposition 5.1 is called the intersection axiom.
Definition 1.1** (Semi-graphoid, graphoid [6][5]).**
A semi-graphoid structure on a collection is a ternary relation on subsets of , that we shall note as X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z, such that, for any, , disjoint subsets of ,
if X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z then Y\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}X|Z; 2. 2.
if X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z and , then U\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z; 3. 3.
if X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z and , then X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z\cup U; 4. 4.
ifX\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z and X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}W|Y\cup Z then X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}W\cup Y|Z.
It is a graphoid if furthermore it satisfies the intersection axiom.
1.1.2 Factorisation spaces
Let us suppose that take values respectively in finite spaces , , . The fact that is independent of conditionally to can be restated as a factorisation property on ; for simplicity let us assume that is sticly positive, then X\mathchoice{\mathrel{\hbox to0.0pt{\displaystyle\perp\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{\textstyle\perp\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptstyle\perp\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{\scriptscriptstyle\perp\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y|Z if and only if for any ,
[TABLE]
where , , are repectively the marginal probabilities of , and .
If one notes the set of strictly positive functions on that only depend on and the set of functions that are the product of a function of and a function of , then the intersection property can be restated as, for any strictly positive proability law ,
[TABLE]
We are interested in generalizing this result to intersections of factorisation spaces that we will define now.
Definition 1.2** (Factorisation space).**
Let be a finite set, let , where is the set of subsets of . Let be a collection of sets, let for any ; for , we will denote its projection onto . The factorisation space over is defined as follows,
[TABLE]
Notation 1*.*
From now on we shall note as .
One can extend the previous definition to the case where is non finite. To do so let us introduce a notation; for any , let,
[TABLE]
Notation 2*.*
is called the lower set of and the set of lower sets will be denoted as , i.e. .
Definition 1.3** (Generalized factorisation spaces).**
Let be any set and let , any is in if and only if there is , a collection and a collection such that for any , and for any ,
[TABLE]
In particular,
[TABLE]
1.1.3 Main theorem
The result we want to emphasize in this document is that an intersection property still holds for factorisation spaces.
Theorem 4.1 (Intersection property for factorisation spaces).
Let be any set, let be any collection of sets. For any family of elements of ,
[TABLE]
A particular case of the intersection property appears in Lauritzen’s Graphical Models [5] in Appendix B Proposition B.5 as a consequence of the decomposition into interaction subspaces, that we will introduce in the next subsection. The proof we give of this result holds in a more general setting and is a direct one that does not rely on the decomposition into interaction subspaces. In fact in [7] we show the converse statement that Equation 1.8 is a structure property that characterizes collections of vector spaces that can be decomposed into direct sums of subspaces, similarly to the decomposition into interaction subspaces, in other words satisfying the intersection property implies that this collection has such decomposition.
A direct consequence of Theorem 4.1 is that there is a complete lattice morphism between and factorisation spaces. This remark enable us to prove the Hammersley-Clifford Theorem in a direct and novel manner, pushing properties of the graph of dependencies directly on its graphical model, that we will now sketch and allows us to give a generalization of the Hammersley-Clifford theorem.
1.2 Hammersley-Clifford Theorem
1.2.1 Graphical models: Markov fields and Gibbs fields
A graphical model is a way to express the interactions of random variables through the properties of a graph. For example, let be a finite set and let be a collection of random variables. Let us associate to each random variable a vertex of an undirected graph , where is its set of vertices and its set of edges; one could say that two random variables are in interaction if their vertices are nearest-neighbours in and expect that there is a collection such that,
[TABLE]
This is an example of a Gibbs state with respect to a potential. The adjacent elements of will be denoted as .
Definition 1.4** (Gibbs States).**
Let be a finite set and be a collection of finite sets, let and let be a collection of interactions, which we shall call a potentiel; a Gibbs state with respect to a potential is defined as follows, for any ,
[TABLE]
Remark 1.1*.*
Any probability law on is a Gibbs state; furthermore if there is a potential such that a probability law is a Gibbs state with respect to , then is in the factorisation space over .
There is an other way to specify the interactions of the random variables from the properties of a graph. For example on can imagine that if two vertices are connected only through a third vertex , i.e. any path from to pass by , this would mean that the corresponding random variables, , are dependent only through , i.e. that,
[TABLE]
This is a particular case of spatial Markov property for the probability law of the random variables. There are several, a priori, different way to translate conditional connectedness properties of the graph into conditional independence properties, let us define two of such. Let for , denote .
Definition 1.5** (Markov properties).**
Let be a finite graph, a stricly positive probability on a finite set obeys,
the pairwise Markov property relative to , if for any pair of non-adjacent vertices
[TABLE] 2. 2.
the local Markov property relative to , if for any vectex ,
[TABLE]
And we call the respective sets , .
As we will see the Hammersley-Clifford theorem asserts that the two points of view for reading the interactions from a graph, the Gibbs state and Markov property point of views, are in fact equivalent for a strictly positve probability law. One of the ways to prove the Hammersley-Clifford theorem is to build a decomposition into interaction subspaces of the factorisation spaces [8], we shall therefore give a brief presentation of this decomposition even though we shall not be using it in the rest of this document.
1.2.2 The decomposition into interaction subspaces
Let be a finite set and let be a collection of finite sets. Let us consider the canonical scalar product on , i.e. for any ,
[TABLE]
Let for any ,
[TABLE]
Theorem 1.1** (Decomposition into interaction subspaces).**
There is a collection of vector subspaces of , , such that, for any ,
[TABLE]
and any two , with , are orthogonal to one another.
Several proofs of this result can be found in [8].
1.2.3 A new proof of the Hammersley-Clifford Theorem
The Hammersley-Clifford theorem states that any Markov condition for a stricly positive probability law can be restated as a condition on the locality of the interactions of its potential, in other words Markov conditions correspond to some factorisation spaces.
Let be a graph; a clique of is a subset of such that every two distinct vertices are adjacent. We will note the set of its cliques.
Theorem 1.2** (Hammersley-Clifford).**
Let be a finite graph. For all strictly positive probability law on a finite set ,
[TABLE]
The intesection property for factorisation spaces enables us to bring back the proof of the Hammersley-Clifford theorem to a general property on graphs, let us sketch the proof that will will present in more details in this document.
Let be a pair of elements of , let and let,
[TABLE]
Similarly for all , let and let .
By remarking that,
[TABLE]
and applying the intersection property for factorisation spaces one ends the proof.
1.3 Structure of this document
In Section 2, we will give some general properties on partial coverings and their natural order making it a preorder with join and meet. Proposition 2.3 states that there is an increasing function between the preorder set of partial coverings and the poset of factorisation spaces that preserve the join.
In Section 3 we prove the intersection property (Theorem 3.1) and as a consequence we show that the increasing function also preserves meets. In this section we do not assume the to be finite, however we assume to be finite.
In the next section, Section 4, we extend the intersection property to any sets , Theorem 4.1.
Finaly in Section 5 we give apply the previous theorems giving new proofs for classical results around factorisation spaces that allow us to extend them, in particular we give a generalization of the Hammersley-Clifford theorem.
2 Order on partial coverings and factorisation spaces
In this section is a finite set, let be a product of any sets. For such that let be the projection of onto where by convention is the projection on the set with one element ; for , we shall note as . In particular is the set of stricly positive constant functions and for any we note as .
can be seen as a vector space for the product law and the exponentiation and similarly for the product of these spaces. In this section we keep the, unusual, product convention to stay closer to the spirit of factorisation.
2.1 Order on partial coverings
Any subset can be seen as a partial covering of of support . The order for partial covering that we will now introduce is a direct extension of the usual one for coverings.
Definition 2.1**.**
Let us define an intersection and a relation R on . For all ,
[TABLE]
[TABLE]
Proposition 2.1**.**
R* is pre-order that we will note and for ,*
[TABLE]
[TABLE]
[TABLE]
where is the logic operator ”and”.
Proof.
Let . For all , . Therefore . Assume, and , then,
[TABLE]
For there is and such that so and . Therefore is a pre-order.
[TABLE]
So .
[TABLE]
So .
Let then there is , such that, . So, .
Assume and then for all there is such that , for all there is such that . So for there is such that or such that . However and so . The last is proven the same way noting that , implies .
∎
Definition 2.2**.**
Let us introduce the usual equivalence relation for a pre-order (see E.III.3 [1]), for all ,
[TABLE]
Let , with any poset, be a pre-order morphism, in the sense that for any such that , . is said to preserve the equivalence relation when for all , . In what follows we supporse that preserves the equivalence relation.
If, for any , with a poset, that is a pre-order morphism and that preserves the equivalence relation, there is a unique that is a poset morphism such that , then we will say that verifies the universal property .
Let us note as .
Proposition 2.2**.**
*If two pre-order morphism, , , that preserve the equivalence relation, verify the universal property , then there is a poset isomorphism between and .
Let us define as,
[TABLE]
There is a unique order on such that is a pre-order morphism and verifies . It verifies for all ,
[TABLE]
Furthermore one can define a union on and an intersection such for all ,,
[TABLE]
Equations 2.3,2.4,2.5 stay true on . Let us recall them, ,
[TABLE]
[TABLE]
[TABLE]
Proof.
Let , , that preserve the equivalence relation, verify the universal property . Then there is , , two poset morphisms, such that , . So , in other words the following diagram commutes:
[TABLE]
But , therefore by the unicity statement in , . One also has that , so . Therefore is a poset isomorphism between and .
Le us define the following relation for ,
[TABLE]
is a poset (see E.III.3 [1]).
Let , with a poset, be a pre-order morphism that preserves the equivalence relation. By the universal property for the quotient map,there is a unique such that . For , suppose , then and . and , so . Therefore is a poset morphism.
Suppose that there are two orders and on such that and are pre-order morphism and verify . Then there is , a poset isomorphism, such that . But by the universal property for the quotient map, . Therefore is a poset isomorphism. For all ,
[TABLE]
So .
Let , such that , , then by property Eq 2.4, and , so .
Similarly, by property Eq 2.5 and , so . Therefore the union and intersection given by Eq 2.8 are well defined.
For any ,
[TABLE]
[TABLE]
[TABLE]
Therefore, . And one proceeds similarly for the two other properties.
∎
We will now also note as .
Example 1*.*
Consider . and this is true for any element of .
[TABLE]
[TABLE]
Remark 2.1*.*
By construction, any section of induces a poset isomorphism. For example the application that sends to its lower set induces a section of ; and is a poset isomorphism. On , is equal to the inclusion and .
2.2 Increasing function from to the poset of factorisation spaces
Let us denote the poset of factorization spaces.
Proposition 2.3**.**
Let,
[TABLE]
[TABLE]
* is a poset morphism. For all , .
If for all , then is injective and is a poset isomorphism.
Let us remark that for all such that , and that for all , .
Indeed, so for all , , so . Let us note the constant function equal to . For all , . For , , , so .
Let us now prove Proposition 2.3.
Proof.
Let such that and such that . For all there is such that , so and as is a vector space. So .
Let and then and , then and is well defined and is a poset morphism.
For all , and are subspaces of so . For all , ; also being a vector space, .
If for all , , Corollary 2 in [2] stipulates that if and only if but the proof of this results shows that if then . So is injective therefore so is by remark 2.1. Furthermore implies , so is a poset isomorphism.
∎
Remark 2.2*.*
Proposition 2.3 is a very general property for any increasing function from any poset to the set of vector subspaces of a vector space . Indeed let , , and if , in the same sense than in Definition 2.2, then . We enounced it as a proposition in order to clarify the presentation, as we use it as a know fact in later proofs.
3 Intersection property for factorisations on finite posets
In this section we still assume that is finite. For such that and , the map is a bijection. We will note for , as , in particular for , . Thus we can also write, for any , .
Lemma 3.1**.**
Let , ,
[TABLE]
Proof.
Let and such that for all ,
[TABLE]
There are such that for all , , , .
For all ,
[TABLE]
Let then, and . So,
[TABLE]
Let us pose for all , , then .
∎
Theorem 3.1** (Intersection property).**
*Let be a finite set and let be family of non necessarily finite sets.
For , , and such that, for all ,
[TABLE]
There is such that for all ,
[TABLE]
Equivalently,
[TABLE]
Proof.
For , , . Therefore by Proposition.(2.3) .
Let us prove the other inclusion by induction on .
is the previous Lemma.3.1.
Suppose that for all such that , .
Let , . Take , . Pose . Let , then there is , , such that .
So and . So by Proposition.(2.3), . Then by Lemma.3.1 . But .
So . Furthermore so . But so (it is even equal).
So there is , such that . Therefore . But so .
Therefore by the induction hypothesis, , and so . One remarks that so . Which ends the proof by induction.
∎
Corollary 3.1**.**
For all ,
[TABLE]
Which can be rewritten as, for all ,
[TABLE]
Proof.
and therefore and .
∎
4 Extension for infinite posets
In this section is any set; let us now use the summation convention instead of the product one. We would like to give a similar definition of but for infinite posets. If for any , we defined as then . One needs to consider only lower sets in .
Let us call and the poset constituted of the ; let be such that,
[TABLE]
In particular,
[TABLE]
Corollary 4.1**.**
For all ,
[TABLE]
Proof.
Let . There are by definition, , , that are of finite cardinal, such that and . By Corollary 3.1, . As , . ∎
We will now show that a stronger version of Corollary.(4.1) holds for the intersection on any family of elements of .
Theorem 4.1**.**
For any family of elements of ,
[TABLE]
Before giving a proof of this result, let us first state the following lemma,
Lemma 4.1**.**
Let be two vector subspaces of . If for any finite ,
[TABLE]
Then,
[TABLE]
Proof.
Let , there is a finite collection of finite subsets of , , such that, .
Therefore . But is of finite cardinal. So .
Therefore . ∎
A direct consequence of Lemma.(4.1) is that if for any finite ,
[TABLE]
Then .
Proof of the Theorem.(4.1). Let be a family of elements of . Let of finite cardinal.
.
But, . And is finite, so can be rewritten as a finite intersection and by Corollary (4.1),
[TABLE]
By Lemma.(4.1),
[TABLE]
The other inclusion is always true (Remark (2.1)) as for any , . ∎
Remark 4.1*.*
This proposition can also be stated in terms of the by taking the exponential:
[TABLE]
5 Applications
5.1 Minimal factorisation
In [2] a proof of the existence of a minimum factorisation is given, based on the existence of a decomposition into interaction subspaces, when is finite and finite. Let us recall that in a poset , is said to be a minimum if any is such that . Let us give a proof of this result using Theorem 4.1 so without assuming that nor are finite.
Corollary 5.1**.**
Let be any set and be the product of any collection of sets; for all let us call . admits a minimum and we say that admits a minimum decomposition.
Proof.
Let . From Theorem 4.1, one has that,
[TABLE]
Any contains , therefore is the minimum of .
∎
5.2 Markov properties and Hammersley-Clifford
Let us consider four random variables taking values respectively in , , , finite sets, with strictly positive joint law. Let us recall the law of conditionally to ,
[TABLE]
Conditional independence is usually defined as follows,
[TABLE]
Let us pose we identify with by the following and then to sets in . Let , and ,
[TABLE]
Proposition 5.1** (Bayesian or Graphoid intersection property).**
[TABLE]
Proof.
Let , , , and , , . so . ∎
Corollary 5.2**.**
*(Hammersley-Clifford)
Let be a finite graph. For all strictly positive probability law, , on a finite ,
[TABLE]
For any pair of elements of and for all probability law ,
[TABLE]
Let . Similarly, for all ,
[TABLE]
Let . The following lemma is the version of the Hammersley-Clifford on graphs that we will then translate to graphical models by applying .
Lemma 5.1**.**
[TABLE]
Proof.
Firstly, . Let and assume that is not a clique. So there is such that . But , so or . It is not possible as any of these two sets separate and . So must be a clique. In other words, but and (). So if is not a clique of , .
Suppose is a clique of . Let such that . and separate . So a clique most be in only one of the two sets. To be more formal, for any subset of , there is , such that or or or . As is a clique . So there is , such that or or . Which is equivalent to saying that .
So we proved that,
[TABLE]
For the local case, , one has to remark that is a clique of if and only if for all , (for exemple see slide [4]).
∎
Proof of Corollary 5.2. Let us remark that if and only if and similarly if and only if .
As is stricly positive, by Corollary 3.1,
[TABLE]
[TABLE]
∎
Similarly, when is any graph and is any collection of sets, Lemma 5.1 still holds and one has the following result which extends the Hammersley-Clifford theorem.
Corollary 5.3**.**
[TABLE]
Acknowledgement
This work resulted from research supported by the University of Paris. I am very grateful to Daniel Bennequin for our numerous discussions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bourbaki [1939] Nicolas Bourbaki. Théorie des ensembles . Springer, 1939.
- 2Chan and Yeung [2011] T. H. Chan and Raymond W. Yeung. Probabilistic inference using function factorization and divergence minimization. In M. Dehmer et al., editor, Towards an Information Theory of Complex Networks: Statistical Methods and Applications , chapter 3, pages 47–74. Springer, 2011.
- 3Dawid [2001] Alexander Philip Dawid. Separoids: A mathematical framework for conditional independence and irrelevance. Annals of Mathematics and Artificial Intelligence 3 , 2001.
- 4[4] Helge Langseth. The Hammersley-Clifford Theorem and its impact on modern statistics.
- 5Lauritzen [1996] Steffen L. Lauritzen. Graphical Models . Oxford Science Publications, 1996.
- 6Pearl [1988] Judea Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference . Morgan Kaufmann Publishers, 1988.
- 7Sergeant-Perthuis [2019] Grégoire Sergeant-Perthuis. Intersection property and interaction decomposition. ar Xiv:1904.09017 v 1, 2019.
- 8Speed [1979] Terry P. Speed. A note on nearest-neighbour gibbs and markov probabilities. Sankhyā: The Indian Journal of Statistics, Series A , 1979.
