A note related to the CS decomposition and the BK inequality for discrete determinantal processes
Andr\'e Goldman

TL;DR
This paper establishes that the BK inequality holds for increasing events in discrete determinantal processes and explores their relationship with the CS decomposition, providing new insights into their structure.
Contribution
It introduces a proof of the BK inequality for discrete determinantal processes and links these processes to the CS decomposition, offering novel theoretical connections.
Findings
BK inequality holds for increasing events in discrete determinantal processes
Established a relationship between determinantal processes and the CS decomposition
Provided elementary insights into the structure of these processes
Abstract
We prove that for a discrete determinantal process the BK inequality occurs for increasing events generated by simple points. We give also some elementary, but nonetheless appealing relationship, between a discrete determinantal process and the well-known CS decomposition.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications
A note related to the CS decomposition and the BK inequality for discrete determinantal processes.
André Goldman
Abstract
We prove that for a discrete determinantal process the BK inequality occurs for increasing events generated by simple points. We give also some elementary but nonetheless appealing relationship between a discrete determinantal process and the well-known CS decomposition.
1 Introduction.
There exists an extensive mathematical literature, in several theoretical and applied areas, related to determinantal point processes. Let us cite, to mention a few recent applied works, [2], [9], [10], [18], [23], [24]. A good overview of the main conceptual basis and properties can be found in [20] and in the bibliography therein.
From the theoretical point of view, determinantal point processes could be defined (in Bourbaki-like-spirit) in the general locally compact Polish spaces setting, as point processes associated to some locally square integrable, Hermitian, positive semidefinite, locally trace-class operators and thereafter specialize them for particular cases namely to discrete determinantal processes. Regarding the latter, the approach of [20] which consists to construct such processes, first in the most elementary discrete context and then gradually extend them to the general situation, provides, in our opinion, many advantages. It turns out also that some results for the most general processes are proved only [11], [20], or more simply [21], indirectly from the corresponding results of the basic processes.
The basic elementary determinantal point process can be described via the exterior product concept, as follows.
Fix and let , , be a set of orthonormal vectors in . Denote
[TABLE]
and
[TABLE]
The associated determinantal process is a point process, view as a random subset of of cardinality , characterized [19], [20] by the formulas
[TABLE]
for all subsets . Note also that (1) implies
[TABLE]
for all .
Let be the vector space spanned by . For all sets of linearly independent vectors , , we have with thus, in particular, if is another orthonormal basis of then
[TABLE]
for every and consequently .
Remark also that if is an orthonormal basis of the orthogonal complement of in then obviously
[TABLE]
A remarquable example of a non-trivial basic determinantal process is given by uniform spanning tree measure on a finite connected graph G. Roughly speaking, if G fixed, is arbitrary edge-oriented and M is the vertex-edge incidence matrix (the columns been indexed by vertexes) then the determinantal process associated to the vector space spanned by column vectors but one, provides a uniform probability on spanning trees. This result due [7] is called the Transfer Current Theorem. For more details, with clever short proofs, see [20] 2.6. p.8. Some extensions of this result are given in [4] with a serie of open questions and conjectures. Among them, Conjecture 4.6. related to BK-type inequality after J. van den Berg and H.Kesten.
Recall that an event , , is called increasing if whenever and , we have also . For a pair , of increasing events the disjoint intersection is then defined [6] by
[TABLE]
A point process on is said to have the van den Berg - Kesten property (in short the BK property) if
[TABLE]
for every pair of increasing events. In [6] J. van den Berg and H.Kesten proved that inequality (4) is satisfied when is related to a product probability on . In the basic determinantal process setting the Conjecture 4.6. which states that the same is true for the spanning tree determinantal point processes is still unsolved. The question of whether general determinantal processes have the BK property was raised in [19].
The purpose of this note is twofold. First, we introduce a new method to investigate discrete determinantal processes using the CS decomposition (CSD) of partitioned unitary matrix which is a useful non-trivial tool in numerical linear algebra: a precise statement of CSD is given in section 2. We show that the CSD gives a pertinent description of conditioning and provides (at least in our opinion…) a suggestive perspective for future investigations: see for example the result given by the proposition 2 below which seems us new and the results of [12]. Furthermore, this should be an appropriate framework for computational needs.
Second, we study the BK inequality. We prove that the BK inequality (4) is satisfied for all discrete determinantal processes when the increasing events and are generated by simple points: Theorem 3 in section 4 and section 5. We conjecture also that:
Conjecture 1
For all we have
[TABLE]
for every choice , , , of disjoint subsets of such that .
If the Conjecture 1 holds then it can be shown that the BK inequality (4) is satisfied for increasing events and generated by disjoint sets: Theorem 2 in section 3. When the sets above are reduced to being simple points then the inequality (5) is a well-known result. For general sets, note that , is the classical correlation inequality [19] and that (5) was obtained in [12] for with the precise values of conditional probabilities.
Remark 1
Note that for the process related to a product probability on the counting random variables (the sets , , being disjoint) are independent and thus the inequality (5) becomes trivial. However, the situation is less obvious if the process is conditioned to have exactly points . In the particular case when the conditioned process assigns equal probability to all subsets , that is if , it was proved in [5] that has the BK property. As regards to the inequality (5) we have, with the choice ,
[TABLE]
and, consequently, the inequality (5) is equivalent, for simple points, to the well-known log-concave inequality and thus is fulfilled. Likewise, for general sets, the correlation inequality , , , with (the non-trivial case) , follows from BK property and is equivalent to the log-concave inequality . For it is easy to see that the validity of the inequality (5) depends on whether or not the functions of the form
[TABLE]
are log-concave, a question which does not seem to me to have been really investigated. Finally, the occurrence of log-concave criteria for negative dependence properties is not quite a surprise, see for example [26] .
2 The CS decomposition and the basic determinantal point process
Following [25] the general CS decomposition (CSD) for a matrix from the unitary group specifies that for any partitionning
[TABLE]
there exist unitary matrices , , , such that (here all unnamed blocks of the matrices below are always zero and the superscript represents the conjugate transpose)
[TABLE]
where the matrices
[TABLE]
are diagonal with
, .
In some cases the matrices of zeros and as well as the unit matrices could be nonexistent. See [25] THEOREM 1 (page and the discussion that follows) for the full statement, and below for a detailled description given from Jordan’s geometrical point of view.
The CS decomposition is a deep result which has a long history going back to the work of Camille Jordan in 1875 on angles between subspaces in [17]. Nowaday it is a popular tool in numerical linear algebra, useful for solving various questions as, for example, constrained least square problems, computing principal angles between subspaces, the generalized singular value decomposition, quantum computing and more else [3], [8], [13], [14], [25].
Now, let be a vector space of dimension , an orthonormal basis of and an orthonormal basis of the orhogonal complement . Fix and consider the CSD of the partitioned unitary matrix
[TABLE]
It follows from (7) that the columns vectors of these two matrices
[TABLE]
are respectively orthonormal basis of and .
Now we will detail different cases given by (9) which need to be distinguished. The description given here is somewhat lengthy but, in our opinion, useful for both theoretical and computational purpose. We note by , , the nul vector of the space . Note also the slight change with regard to angles appearing in CSD (7) which allows values [math] and in order to recover all Jordan’s principal angles.
- I.-
The case and .
There exist:
- (a)
a sequence of orthonormal vectors in , 2. (b)
three sequences of mutually orthonormal vectors in
, and
, 3. (c)
Jordan angles
such that noting
the sequence is an orthonormal basis of and the sequence is an orthonormal basis of the orthogonal complement . 2. II.-
The case and .
There exist:
- (a)
a sequence of orthogonal vectors in , 2. (b)
two sequences of mutually orthogonal vectors in
and , 3. (c)
Jordan angles
such that noting
3.
the set is an orthonormal basis of and the set is an orthonormal basis of . 3. III.-
The case and
There exist:
- (a)
a sequence of orthogonal vectors in , 2. (b)
two sequences of mutually orthogonal vectors in
and , 3. (c)
Jordan angles
such that noting
the set is an orthonormal basis of de and the set is an orthonormal basis of 4. IV.-
The case .
With the notations of points I-III:
- (i)
- (ii)
- (iii)
By reordering the rows of the procedure described above works for every subset , , and gives related basis of the spaces and . Note that in the Euclidean context, that is for and the CSD applied to orthogonal matrices, the angles appearing in the CS decomposition (related to ) are principal Jordan angles between the space E and the basic subspace
[TABLE]
An important statistical application of principal angles is the canonical correlation analysis (CCA) of H.Hotelling [15]. In order to develop a unified algebraic formulation of concepts in multivariate analysis (as for example CCA), S.Afriat has thoroughly studied in [1] (see also [22]) the geometry of subspaces in in terms of orthogonal and oblique projectors and has introduced, among others, the notation of so-called *multiplicative cosine and sine *
[TABLE]
[TABLE]
The basis of given by the CSD is a pertinent tool for the study of the associated determinantal process. For example it gives at once
Proposition 1
For a set , we have:
[TABLE]
and for
[TABLE] 2. 2.
[TABLE]
and for
[TABLE] 3. 3.
If and then the conditioned process is determinantal such that 4. 4.
If and then the conditioned process is determinantal such that 5. 5.
If then for all we have
[TABLE]
if then
[TABLE]
Remark 2
The fact that the conditioned processes and are determinantal, as well as inequalities (14) and (15), are well-known results proved by R.Lyons [19].
Remark 3
Regarding items 1 and 2 of Proposition 1, it was proved more generally in [16] Theorem 5, that for general determinantal processes with trace-class (both discrete and continuous case) kernels, the number of points in the process has the distribution of a sum of independent Bernoulli random variables.
A more elaborate information can be obtained from this point of view. For example:
Proposition 2
Consider the discrete determinantal process associated to a set , , of orthonormal vectors in . Fix points , , such that . With the choice (to simplify the notations) , , we have
[TABLE]
Proof:
The left and rigt sides of (2) do not depend of the choice of the basis of . Choose the basis given by the CS decomposition related to the set , with the reordering ( and (the general situation, case I). The first n-coordinates of these basis have the following form:
[TABLE]
It follows from Proposition 1 that
- (a)
2. (b)
3. (c)
4. (d)
[TABLE]
From (a)-(d) an elementary computation gives the right side of (2) (note that ). Indeed we get
[TABLE]
To compute the left side of (2) denote ,
and . Observe that
[TABLE]
with
[TABLE]
and
[TABLE]
Obviously thus
[TABLE]
and consequently
[TABLE]
In order to compute A introduce the notations
, and .
A little tought provides that for
[TABLE]
Moreover we have
[TABLE]
From orthogonality properties of relevant multivectors we obtain from (24)
[TABLE]
From (21), (23) and (25) an elementary computation gives
[TABLE]
and with (22)
[TABLE]
This and (19) prove Proposition 2. Note that from (27) we get that (20) is identified as a scalar product.
For further results by using the CSD as well as for some extensions of Proposition 2 see [12].
3 The BK inequality for increasing events generated by disjoint sets
Let , , , be a pair of increasing events and suppose (obviously) that . The events being increasing, there exist
two minimal sets and such that
, 2. 2.
The sets , are minimal in the sense that none of (resp. ) is strictly included in (resp. in ).
Consider now a basic determinantal process on . In the particular case when
[TABLE]
we have at once ans thus the BK-inequality (4) becomes
[TABLE]
which is a negative association inequality. R.Lyons proved in [19], [20] that determinantal processes have negative association, meaning that (29) is fulfilled.
In the general situation it is helpful to reformulate BK inequality (4) as follows.
Proposition 3
The inequality (4) is satisfied if and only if
[TABLE]
Proof:
Observe that
[TABLE]
Thus
[TABLE]
if and only if
[TABLE]
Suppose now that . Formula (30) becomes
[TABLE]
If the sets of are disjoint, that is if
for all ,
then
[TABLE]
Therefore
[TABLE]
and formula (31) takes the following form
[TABLE]
Fix now and suppose that the Conjecture 1 is fulfilled for all .
Lemma 4
Under this hypothesis, for all , , disjoint subsets of with , such that we have
[TABLE]
Proof:
For the inequality (33) is the well-known correlation inequality. For applying (5) we get
[TABLE]
and thus Lemma 4 follows by induction.
We will need the following elementary lemma. Its proof being trivial we omit it.
Lemma 5
For all and we have
[TABLE]
Theorem 1
Let be an increasing event generated by disjoint sets . Suppose that the Conjecture 1 holds. We have
[TABLE]
Proof:
We have to prove (32). By Lemma 5 we obtain
[TABLE]
Lemma 4 implies that
[TABLE]
so it remains to apply the geometric-arithmetic mean inequality
[TABLE]
to obtain (32) as desired.
Remark 4
Consider an event of disjoint sets such that . Denote . If the Conjecture 1 holds then it is obvious that the inequality (5) is also satisfied for the conditioned process provided that the sets occuring in (5) are disjoint from those in . Consequently, if is an increasing event generated by disjoint sets such that for all and then we obtain
[TABLE]
Let , and be events such that all sets in are pairwise disjoint.
Theorem 2
Suppose that the Conjecture 1 holds. Then for increasing events , such that and we have
[TABLE]
where and all sets in are pairwise disjoint.
Proof:
The proof proceeds by induction using Theorem 1 and the following lemma.
Lemma 6
Fix , , and and suppose that BK inequality (41) is fulfilled for all conditioned processes subjected to the conditions of Theorem 2. Fix , such that for all . Denote by the increasing event generated by and . Then, the BK inequality
[TABLE]
is satisfied for all conditioned processes such that all sets of are pairwise disjoint.
Proof:
By (30) we may suppose that
[TABLE]
We have
[TABLE]
and
[TABLE]
Formulas (30) and (44) imply that the BK inequality (42) can be written as follows
[TABLE]
or, introducing the process , as
[TABLE]
The stated hypotheses imply that
[TABLE]
It is easy to see that Conjecture 1 implies the inequality
[TABLE]
and thus by (47) and (48) we obtain (46) which finish the proof of Lemma 6.
Starting from (40) and applying step by step the Lemma 6 Theorem 2 follows.
4 The BK inequality for increasing events , generated by simple points
As mentioned in the Introduction the inequality (5) is satisfied when the occuring sets are reduced to being simple points. This follows easily, for example, from Proposition 1. Therefore Theorem 2 implies
Theorem 3
Let , be increasing events generated by simple points. The BK inequality
[TABLE]
is then satisfied for all determinantal discrete processes associated to sets of orthonormal vectors of .
Remark 5
For sets reduced to being simple points the key inequality (33) can be seen from the point of view given by the CSD. Indeed, consider the CSD in the case I applied to and, according, let , , be the vectors such that
[TABLE]
for all . Denote
[TABLE]
where
[TABLE]
and is the )-minor of the unitary matrix . By (12) we obtain:
[TABLE]
Remark 6
It was pointed out to us that for an increasing event generated by simple points the inequality (36) which can be read as
[TABLE]
can be obtained also by a direct computation from formula (11) of Proposition 1 and, moreover, if one consider the product measure on the product space and increasing events
[TABLE]
, then the formulas (11) imply that . From the original J. van den Berg and H.Kesten Theorem (3.3) of [6] we get that
[TABLE]
Furthermore, note that by Remark 3 the inequalities (54) and (55) are still valid for general determinantal processes (both discrete and continuous) taking for S a Borel set.
5 Extensions and concluding remarks
Theorem 3 can be easily extended in the setting of general discrete determinantal processes. From the construction given in paragraphs 2.2. of [20], which start from the basic processes, it follows at once that Theorems 1-2 are valid (the generated sets and being finite or infinite) for determinantal point processes defined on denumerables sets and associated to closed subspaces of . Now, let be a such process on . Fix and consider the proces
[TABLE]
Let , , , be the increasing events generated respectively by and . The BK inequalities for , , and , , , involve only generating sets , . Consequently Theorem 3 is valid for as well. To finish just note that, by paragraph 2.2. of [20], discrete determinantal processes associated to positive contractions (the general case) are of the form (56).
By the transference principle ([20] 3.6.) Theorem 3 could also be extended to the continuous case but this is of little use due the fact that in the continuous setting the intensity measures related to determinantal processes of interest are of diffusive type which implies that for points (however, as mentioned in Remark 6, inequalities (54) and (55) still hold).
Acknowledgement
I thank the anonymous referees for their constructive comments, especially for a pertinent question about the validity of the Conjecture 1 for the non-determinantal point processes, which led to look at the process described in Remark 1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Afriat, S. (1957). Orthogonal and oblique projectors and the characteristics of pairs of vector spaces. Proceedings of the Cambridge Philosophical Society 53, 800–816.
- 2[2] Baccelli, F. and O’Reilly, E. (2018). Reach of repulsion for determinantal point processes in high dimensions. Journal of Applied Probability 55, 760–788.
- 3[3] Bai, Z. (1992). The CSD, GSVD, their applications and computations. Tech. Report IMA preprint series 958, Institute for Mathematics and its Applications, University of Minnesota.
- 4[4] Benjamini, I., Lyons, R., Peres, Y. and Schramm, O. (2001). Uniform spanning forests. The Annals of Probability 29, 1–65.
- 5[5] van den Berg, J. and Jonasson, J. (2012). A BK inequality for randomly drawn subsets of fixed size. Probab. Theory Relat. Fields 154, 835–844.
- 6[6] van den Berg, J. and Kesten, H. (1985). Inequalities with applications to percolation and reliability. Journal of Applied Probability 22, 556–569.
- 7[7] Burton, R. and Pemantle, R. (1993). Local Characteristics, Entropy and Limit Theorems for Spanning Trees and Domino Tilings via Transfer-Impedances. The Annals of Probability 21, 1329 – 1371.
- 8[8] Gawlik, E.S., Nakatsukasa, Y.B. and Sutton, D. (2018). A Backward Stable Algorithm for Computing the CS Decomposition via the Polar Decomposition. ar Xiv:1804.09002 v 1[math.NA].
