Products of Conditional Expectation Operators: Convergence and Divergence
Guolie Lan, Ze-Chun Hu, Wei Sun

TL;DR
This paper explores the convergence properties of products of conditional expectation operators, demonstrating divergence in non-atomic spaces and convergence in purely atomic spaces, thus resolving a long-standing conjecture.
Contribution
It proves that products of conditional expectation operators can diverge in non-atomic spaces and always converge in purely atomic spaces, settling a major open question.
Findings
Divergence of operator products in non-atomic spaces.
Convergence of operator products in purely atomic spaces.
Resolution of a long-standing conjecture on conditional expectations.
Abstract
In this paper, we investigate the convergence of products of conditional expectation operators. We show that if is a probability space that is not purely atomic, then divergent sequences of products of conditional expectation operators involving 3 or 4 sub--fields of can be constructed for a large class of random variables in . This settles in the negative a long-open conjecture. On the other hand, we show that if is a purely atomic probability space, then products of conditional expectation operators involving any finite set of sub--fields of must converge for all random variables in .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Analysis and Transform Methods · Approximation Theory and Sequence Spaces · Probability and Risk Models
Products of Conditional Expectation Operators: Convergence and Divergence
Guolie Lana, Ze-Chun Hub and Wei Sunc
a School of Economics and Statistics, Guangzhou University, China
b College of Mathematics, Sichuan University, China
c Department of Mathematics and Statistics, Concordia University, Canada
Abstract In this paper, we investigate the convergence of products of conditional expectation operators. We show that if is a probability space that is not purely atomic, then divergent sequences of products of conditional expectation operators involving 3 or 4 sub--fields of can be constructed for a large class of random variables in . This settles in the negative a long-open conjecture. On the other hand, we show that if is a purely atomic probability space, then products of conditional expectation operators involving any finite set of sub--fields of must converge for all random variables in .
Keywords product of conditional expectation operators, Amemiya-Ando conjecture, non-atomic -field, purely atomic -field, linear compatibility, deeply uncorrelated.
Mathematics Subject Classification (2010) 60A05, 60F15, 60F25
1 Introduction and main results
Conditional expectation is one of the most important concepts in probability theory. It plays a central role in probability and statistics. Let be a probability space and be sub--fields of , where . For , denote by the conditional expectation operator with respect to , i.e., for . Suppose that . For , define the sequence successively by
[TABLE]
Then for some sequence . In this paper, we will investigate the convergence of .
Note that conditional expectation operators can be regarded as contraction operators on Banach spaces. The study on the convergence of is not only of intrinsic interest, but is also important in various applications including numerical solutions of linear equations and partial differential equations [4, 14], linear inequalities [24], approximation theory [20, 9] and computer tomography [23].
In 1961, Burkholder and Chow [6] initiated the study of convergence of products of conditional expectations. They focused on the case and showed that converges almost everywhere and in -norm for . Further, it follows from Stein [25] and Rota [21] that if for some , then converges almost everywhere. On the other hand, Burkholder [5] and Ornstein [16] showed that for almost everywhere convergence need not hold necessarily.
However, for the convergence of becomes a very challenging problem. This paper is devoted to the following long-open conjecture on convergence of products of conditional expectations:
(CPCE) If and all the come from a finite set of sub--fields of , then must converge in -norm.
Conjecture (CPCE) is closely related to the convergence of products of orthogonal projections in Hilbert spaces. Before stating the main results of this paper, let us recall the important results obtained so far for the convergence of products of orthogonal projections in Hilbert spaces.
Let be a Hilbert space and be closed subspaces of , where . Denote by the orthogonal projection of onto . Let and , we define the sequence by
[TABLE]
If , the convergence of in follows from a classical result of von Neumann [15, Lemma 22]. If and is finite dimensional, the convergence of was proved by Práger [18]. If is infinite dimensional and is periodic, the convergence of was obtained by Halperin [10]. Halperin’s result was then generalized to the quasi-periodic case by Sakai [22]. Based on the results on the convergence of products of orthogonal projections, Zaharopol [26], Delyon and Delyon [8], and Cohen [7] proved that if is a periodic sequence with all the coming from a finite set of -fields, then for any with , the sequence of the form (1.1) converges in -norm and almost everywhere.
In 1965, Amemiya and Ando [2] considered the more general convergence problem when and is non-periodic. They showed that for arbitrary sequence , the sequence of the form (1.2) converges weakly in , and they posed the question if converges also in the norm of . In 2012, Paszkiewicz [17] constructed an ingenious example of 5 subspaces of and a sequence of the form (1.2) which does not converge in . Kopecká and Müller resolved in [12] fully the question of Amemiya and Ando. They refined Paszkiewicz’s construction to get an example of 3 subspaces of and a sequence which does not converge in . In [13], Kopecká and Paszkiewicz considerably simplified the construction of [12] and obtained improved results on the divergence of products of orthogonal projections in Hilbert spaces.
Note that a projection on can not necessary be represented as a conditional expectation operator. Thus, counterexamples for the convergence of products of orthogonal projections in Hilbert spaces do not necessarily yield divergent sequences of products of conditional expectation on probability spaces. Let be the Borel -field of and be the Lebesgue measure. In 2017, Komisarski [11] showed that there exist and coming from 5 sub--fields of such that the sequence of the form (1.1) diverges in . Note that is only a -finite measure space and the conditional expectation considered in [11] is understood in an extended sense. Conjecture (CPCE) still remains open for probability spaces. We would like to point out that Akcoglu and King [1] constructed an example of divergent sequences involving infinitely many sub--fields on the interval .
In this paper, we will show that Conjecture (CPCE) falls if is not a purely atomic probability space; however, it holds if is a purely atomic probability space. More precisely, we will prove the following results.
Theorem 1.1
There exists a sequence with the following property:
Suppose that is a Gaussian random variable on and there exists a non-atomic -field which is independent of . Then there exist three -fields , such that the sequence defined by
[TABLE]
does not converge in probability.
Denote by the space of all probability measures on . Then becomes a complete metric space if it is equipped with the Lévy-Prokhorov metric.
Theorem 1.2
There exists a sequence with the following property:
(1) Suppose that is a non-atomic probability space. Then there exists a dense subset of such that for any we can find a random variable with distribution and four -fields , such that the sequence defined by
[TABLE]
does not converge in probability.
(2) Let be a probability space that is not purely atomic. Then there exist a random variable and three -fields , such that the sequence defined by
[TABLE]
does not converge in probability.
Denote by the collection of all null sets of . For a sub--filed of , we define to be the -field generated by and .
Theorem 1.3
Suppose that is a purely atomic probability space, are sub--fields of , and . Let with and be defined by (1.1). Then converges to some in -norm and almost everywhere. If each repeats infinitely in the sequence , then
[TABLE]
The rest of this paper is organized as follows. In Section 2, we discuss the linear compatibility under conditional expectations, which is essential for our construction of divergent sequences of products of conditional expectation operators. In Section 3, we consider divergent sequences of products of conditional expectation operators on probability spaces that are not purely atomic and prove Theorems 1.1 and 1.2. In Section 4, we investigate the convergence of products of conditional expectation operators on purely atomic probability spaces and prove Theorem 1.3.
2 Linear compatibility and deep uncorrelatedness
Let be a probability space. We consider the linear compatibility defined by conditional linear equations, which is closely related to linear regression and optimal estimation (cf. Rao [19]).
Definition 2.1
Two integrable random variables on are said to be linearly compatible under conditional expectations, or linearly compatible in short, if there exist such that almost surely,
[TABLE]
Obviously, if and are independent or perfectly collinear, i.e., , then they are linearly compatible. For non-trivial examples, note that if have a 2-dimensional Gaussian distribution then they are linearly compatible, and if both and follow two-point distributions, then they must be linearly compatible.
Lemma 2.2
Let and be two random variables on with and . Suppose that (2.1) holds. Denote by the correlation coefficient of and , and denote by and the -fields generated by and , respectively. Then,
(i)
* and ;*
(ii)
* implies that a.s.;*
(iii)
* implies that and a.s.;*
(iv)
* implies that .*
Proof. We assume without loss of generality that . Then, in (2.1).
(i) It follows from (2.1) that and . Taking expectations, we get
[TABLE]
Then , which implies that .
(ii) is proved by Rao ([19, Proposition 2.1]), where only the finiteness of expectations is assumed.
(iii) We define the operators and on by and for . By Burkholder and Chow [6, Theorem 3], we have
[TABLE]
On the other hand, since (2.1) holds with ,
[TABLE]
[TABLE]
Similarly, we can show that
[TABLE]
(iv) is a direct consequence of (2.2) since and are non-zero.
Motivated by Lemma 2.2 (iv), we introduce the definition of deep uncorrelatedness for two random variables.
Definition 2.3
Two integrable random variables on are said to be deeply uncorrelated if
[TABLE]
Remark 2.4
It is clear that if and are integrable and independent then they are deeply uncorrelated, and if and have finite variances and are deeply uncorrelated then they are uncorrelated, i.e., . The following examples show that deeply uncorrelated is equivalent to neither independent nor uncorrelated.
(i) Let be a pair of random variables with the uniform distribution on the unit disc . It can be checked that and are deeply uncorrelated but not independent.
(ii) Let and be two measurable sets satisfying and . Define and . Note that . Then , which implies that are uncorrelated. However, we have that
[TABLE]
and
[TABLE]
Hence , which implies that and are not deeply uncorrelated.
We now define linear compatibility and deep uncorrelatedness for a family of random variables.
Definition 2.5
(1)* A family of integrable random variables on is said to be linearly compatible under conditional expectations, or linearly compatible in short, if for any finite sequence in , there exist such that almost surely,*
[TABLE]
(2)* is called a deeply uncorrelated family if for any finite sequence ,*
[TABLE]
Remark 2.6
(i) Let and . It is well-known that provides the -optimal estimation of given . Thus (2.5) implies that the -optimal estimation of via is consistent with the optimal linear estimation via .
(ii) An important class of linearly compatible family is Gaussian processes. For a Gaussian process , every finite collection of random variables has a multivariate normal distribution. Thus (2.5) holds and therefore is linearly compatible.
(iii) Let be a deeply uncorrelated family with , . Define . Then is a martingale.
Lemma 2.7
Let be a linearly compatible family with , . Then for any infinite sequence , there exists such that almost surely,
[TABLE]
In particular, if is a deeply uncorrelated family then for any ,
[TABLE]
Proof. Since for , it follows from (2.5) that there exists such that for each ,
[TABLE]
By the martingale convergence theorem, we have
[TABLE]
which implies that converges in .
Denote by the limit of in . Then there exists such that
[TABLE]
Thus we obtain (2.6). The proof of (2.7) is similar and we omit the details.
Note that for and , the conditional expectation can be regarded as the orthogonal projection of onto the closed subspace . However, in general, the orthogonal projection of onto a closed linear subspace can not necessary be represented as a conditional expectation operator. The following lemma shows that the linear compatibility ensures the one-to-one correspondence between conditional expectation operator and orthogonal projection.
Lemma 2.8
Let be a closed linear subspace of . Suppose that is a linearly compatible family with for any . Then for each closed linear subspace with countable basis, there exists a sub--field of such that for any ,
[TABLE]
Proof. Let be an orthonormal basis of and define . By Lemma 2.7, for any , there exists such that
[TABLE]
Denote Then for any , almost surely
[TABLE]
Hence , which implies that and are orthogonal in . Since is arbitrary, the right hand side of (2.9) equals the orthogonal projection of onto . Therefore, (2.8) holds.
3 Divergent sequences on probability spaces that are not purely atomic
Definition 3.1
Let be a probability space and be a sub--field of .
(1) A measurable set is called -atomic if and for any -measurable set , it holds that either or .
(2) is called non-atomic if it contains no -atomic set, i.e., for each with , there exists a -measurable set such that .
(3) is called purely atomic if it contains a countable number of -atomic sets such that
[TABLE]
(4)* is said to be non-atomic if is non-atomic. is said to be purely atomic if is purely atomic.*
Remark 3.2
(i) Note that is non-atomic if it is generated by a random variable whose cumulative distribution function (cdf) is continuous on , for example, a continuous random variable. Moreover, is non-atomic if and only if there exists a random variable which has a uniform distribution on (cf. [3, §2]).
(ii) is purely atomic if it is generated by a discrete random variable. Conversely, if is purely atomic, then each -measurable random variable has a discrete distribution on .
We now prove Theorems 1.1 and 1.2, which are stated in §1. Our proofs are based on the following remarkable result.
Theorem 3.3
(Kopecká and Paszkiewicz [13, Theorem 2.6])) There exists a sequence with the following property:
If is an infinite-dimensional Hilbert Space and , then there exist three closed subspaces , such that the sequence of iterates defined by does not converges in .
Proof of Theorem 1.1 Denote by the standard Gaussian measure on and denote by the standard Gaussian measure on . For , we consider its binary representation:
[TABLE]
where . For , define
[TABLE]
Let be the map
[TABLE]
Denote by the Lebesgue measure on . Then it can be checked that the image measure of under equals the infinite product measure on . Let be the cdf of . Define
[TABLE]
Let be the map
[TABLE]
Then the image measure of under equals the standard Gaussian measure on .
Since is a non-atomic sub--field which is independent of , there exists a random variable which has a uniform distribution on and is independent of . Define
[TABLE]
Set
[TABLE]
Then
[TABLE]
is a sequence of independent standard Gaussian random variables. Let
[TABLE]
be the closed linear span of . Then is an infinite-dimension Gaussian Hilbert space, i.e., a Gaussian process which is also a Hilbert subspace of .
We now show that is a linearly compatible family. Take . Let
[TABLE]
be the linear span of . Then the orthogonal projection of onto can be written as
[TABLE]
for some . Define
[TABLE]
Then is orthogonal to and hence is independent of , since have a joint Gaussian distribution. Therefore,
[TABLE]
which implies that is linearly compatible.
Applying Theorem 3.3 to the infinite-dimensional Hilbert space , we find that there exists a sequence with the following property:
For , there exist three closed subspaces , such that the sequence defined by
[TABLE]
does not converges in -norm.
Hence there exist three closed subspaces , such that the sequence defined by
[TABLE]
does not converges in -norm.
On the other hand, by Lemma 2.8, there exist three sub--fields such that
[TABLE]
Therefore,
[TABLE]
Finally, we show that does not converge in probability. Suppose that converges to some in probability. Note that
[TABLE]
which implies that is uniformly integrable. Therefore in -norm. We have arrived at a contradiction.
Proof of Theorem 1.2 (1). Since is non-atomic, there exists a random variable which has a uniform distribution on . Following the first part of the proof of Theorem 1.1, we can construct three independent standard Gaussian random variables on .
For , let be the Gaussian measure on with mean 0 and variance . For , define to be the convolution of and , i.e.,
[TABLE]
Then all the moments of are finite and weakly as . Define
[TABLE]
is a dense subset of with respect to the Lévy-Prokhorov metric.
For with . Define
[TABLE]
Then has the probability distribution , where is the cdf of a standard Gaussian random variable. Define
[TABLE]
Then has the probability distribution . Write
[TABLE]
Then
[TABLE]
is a Gaussian random variable which is independent of . Therefore, by Theorem 1.1 we can find three sub--fields , such that the sequence defined by
[TABLE]
does not converge in probability.
Proof of Theorem 1.2 (2). Suppose that is neither purely atomic nor non-atomic. Then there exist an and a sequence of atomic sets such that is a non-atomic set. Denote
[TABLE]
Define
[TABLE]
and
[TABLE]
Then is a non-atomic probability space.
For a sub--field of , define
[TABLE]
Then is a sub--field of . For a random variable on , we can extend it to a random variable on by defining for . We claim that
[TABLE]
where denotes the conditional expectation on . In fact, the right hand side of (3.2) is obviously -measurable. Thus it is sufficient to show that
[TABLE]
for any .
Note that by (3.1) we have that or for some .
Case 1: Suppose that for . Then the left hand side of (3.3) is
[TABLE]
Thus (3.3) holds.
Case 2: Suppose that for some . The proof is similar to that of Case 1 and we omit the details.
Note that is a non-atomic probability space. Then there exists on a random variable which has a uniform distribution on . Following the first part of the proof of Theorem 1.1, we can construct on two independent standard Gaussian random variables and . Let
[TABLE]
Then is a non-atomic sub--field of and is independent of the Gaussian random variable . Thus, by Theorem 1.1, we can find three sub--fields of such that the sequence of iterates on defined by
[TABLE]
does not converge in probability. Hence must diverge in the -norm of , i.e.,
[TABLE]
where is the -norm of for .
Now we can construct on three sub--fields by
[TABLE]
and a sequence of random variables by
[TABLE]
Note that by (3.2), (3.4), (3.6) and (3.7) we have
[TABLE]
which implies that
[TABLE]
To show that does not converge in probability, note that
[TABLE]
Then we obtain by (3.5) and (3.8) that
[TABLE]
On the other hand, we can check that
[TABLE]
which implies that is uniformly integrable. Suppose that converges in probability. Then converges in the -norm, which contradicts with (3.9). Therefore, does not converge in probability.
4 Convergence on purely atomic probability spaces
In this section we will prove Theorem 1.3, which is stated in §1. First, we give a lemma that holds for any probability space.
Lemma 4.1
Let be a probability space, be a family of sub--fields of and be defined by (1.1). Then the following statements are equivalent.
(1)* converges in -norm for any with .*
(2)* converges in probability for any .*
Proof. It is sufficient to show that .
For , we define the operators recursively by and
[TABLE]
Then is a family of linear contraction operators on and for . Let . Then for .
Suppose that (2) holds, i.e., the sequence converges in probability for each . Then we obtain by the bounded convergence theorem that
[TABLE]
Hence is a Cauchy sequence in and therefore converges in -norm.
Proof of Theorem 1.3. Let be atomic sets (cf. Definition 3.1) such that
[TABLE]
Let . Then for each , there exists a sequence such that
[TABLE]
We define the orthogonal projections , , on by
[TABLE]
Then . By Amemiya-Ando [2, Theorem], converges weakly in . Thus converges as for each , which implies that converges almost everywhere. Since is arbitrary, we obtain by Lemma 4.1 that converges to some in -norm for any with . Further, converges to almost everywhere since is a purely atomic probability space.
We now show that . For each , we can find an infinite subsequence such that . It follows that
[TABLE]
By the almost sure convergence of , we have that
[TABLE]
which implies that for each . Hence
[TABLE]
Let . By (1.1), we get
[TABLE]
Then,
[TABLE]
Thus
[TABLE]
Since is arbitrary, the proof is complete.
Acknowledgments This work was supported by the China Scholarship Council (No. 201809945013), National Natural Science Foundation of China (No. 11771309) and Natural Sciences and Engineering Research Council of Canada.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Akcoglu, J. L. King: An example of pointwise non-convergence of iterated conditional expectation operators, Israel J. Math., 94, 179-188 (1996).
- 2[2] I. Amemiya, T. Ando: Convergence of random pruducts of contractions in Hilberet space, Acta Scientiarum Mathematicarum (Szegeed), 26, 239-244 (1965).
- 3[3] P. Berti, L. Pratelli, P. Rigo: Atomic intersection of σ 𝜎 \sigma -fields and some of its consequences, Probab. Theory Relat. Fields, 148, 269-283 (2010).
- 4[4] F. E. Browder, On some approximation methods for solutions of the Dirichlet problem for linear elliptic equations of arbitrary order, J. Math. Mech., 7, 69-80 (1958).
- 5[5] D. L. Burkholder: Successive conditional expectations of an integrable function, Ann. Math. Stat., 33, 887-893 (1962).
- 6[6] D. L. Burkholder, Y. S. Chow: Iterates of conditional expectation operators, Proc. Amer. Math. Soc., 12, 490-495 (1961).
- 7[7] G. Cohen: Iterates of a product of conditional expectation operators, J. Func. Anal., 242, 658-668 (2007).
- 8[8] B. Delyon, F. Delyon: Generalization of von Neumann’s spctral sets and integral representation of operators, Bull. Soc. Math. France, 127, 25-41 (1999).
