A Karhunen-Lo\`{e}ve Theorem for Random Flows in Hilbert spaces
Leonardo V. Santoro, Kartik G. Waghmare, Victor M. Panaretos

TL;DR
This paper generalizes Mercer's theorem to infinite-dimensional Hilbert spaces and establishes a Karhunen-Loève expansion for mean-square continuous Hilbertian flows, enabling series representations with uncorrelated coefficients.
Contribution
It introduces a novel generalization of Mercer's theorem and derives a Karhunen-Loève theorem for functional data in Hilbert spaces, extending classical results to infinite dimensions.
Findings
Series expansion with uncorrelated coefficients for Hilbertian flows
Uniform convergence of the expansion over time
Extension of Mercer's theorem to operator-valued kernels in Hilbert spaces
Abstract
We develop a generalisation of Mercer's theorem to operator-valued kernels in infinite dimensional Hilbert spaces. We then apply our result to deduce a Karhunen-Lo\`eve theorem, valid for mean-square continuous Hilbertian functional data, i.e. flows in Hilbert spaces. That is, we prove a series expansion with uncorrelated coefficients for square-integrable random flows in a Hilbert space, that holds uniformly over time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Groundwater flow and contamination studies · Markov Chains and Monte Carlo Methods
A Karhunen–Loève Theorem for Random Flows in Hilbert spaces
Leonardo V. Santoro Kartik G. Waghmare Victor M. Panaretos
[email protected] [email protected] [email protected]
Institut de Mathématiques
École Polytechnique Fédérale de Lausanne
Abstract
We develop a generalisation of Mercer’s theorem to operator-valued kernels in infinite dimensional Hilbert spaces. We then apply our result to deduce a Karhunen-Loève theorem, valid for mean-square continuous Hilbertian functional data, i.e. flows in Hilbert spaces. That is, we prove a series expansion with uncorrelated coefficients for square-integrable random flows in a Hilbert space, that holds uniformly over time.
**MSC2020 classes: 60G12, 62R10
Key words: Mercer’s theorem, Functional Principal Components, Hilbertian functional data, random series expansion**
1 Introduction
The Karhunen-Loève theorem is a fundamental result on stochastic processes, playing a central role in their probabilistic construction, numerical analysis and statistical inference (Adler, 2010; Le Maître and Knio, 2010; Hsing and Eubank, 2015). In its simplest form, it provides a countable decomposition of a time-indexed real-valued random function into a “bi-orthogonal” Fourier series that separates its stochastic from its functional components: the basis functions are deterministic orthogonal functions, and their coefficients are uncorrelated random variables. When the basis functions are ordered by decreasing coefficient variance, the -truncated expansion provides the best -dimensional approximation of the process in mean square. Importantly, for random functions that are mean-square continuous, the series can be interpreted pointwise. These properties explain the catalytic role the expansion has played (and continues to play) in the field of Functional Data Analysis (Wang et al., 2016) – in fact, the field’s very origin traces to Grenander’s use of the expansion as a coordinate system for inference on random functions (Grenander, 1950).
Traditionally, functional data analysis focused on real-valued functions defined on a compact interval. Increasingly, though, the complexity of functional data escapes this context. Modern functional data can come in the form of functional flows (random maps from an interval into a function space) or more generally functional random fields (random maps from a Euclidean set into a function space). With the aim of making such data amenable to the tools of functional data analysis, we establish a generalisation of the Karhunen-Loève expansion to random flows (and more generally random fields) valued in an abstract separable Hilbert space, possibly infinite dimensional (Theorem 2). To do so, we first establish a version of Mercer’s theorem for non-negative definite kernels valued in separable Hilbert spaces (Theorem 1).
2 Background and Problem Statement
Let be a separable Hilbert space with inner product and induced norm , with We denote by , and the space of bounded, trace-class (TC) and Hilbert-Schmidt (HS) linear operators on , respectively, with corresponding norms:
[TABLE]
where the adjoint of a linear operator is defined via for all . Note that For , the operator defined by is bounded and linear.
We say that a bounded operator is compact if for any bounded sequence in , contains a convergent subsequence. Let be compact and self adjoint. Denote by its eigenvectors, with corresponding eigenvalues , ordered so that . Then comprises a Complete Orthonormal System (CONS) for and we may write
Let be any compact subset of a Euclidean space. We denote by the Hilbert space of square integrable -valued functions , i.e.,
[TABLE]
The associated inner product is defined by with corresponding norm by .
Let be a (mean zero) random flow in with finite second moment, . We will refer to as Hilbertian flow, as in Kim et al. (2020), though it could also be a termed a Hilbertian field when . Denote by its covariance operator:
[TABLE]
or, equivalently:
[TABLE]
Note that is nonnegative-definite and trace-class. In particular, is compact and self-adjoint; therefore, it admits the following spectral decomposition in terms of its eigenvalue-eigenfuction pairs (e.g. Hsing and Eubank (2015, Theorem 7.2.6)):
[TABLE]
where are the eigenvalues and the corresponding eigenfunctions for , forming a complete orthonormal system.
Consequently, admits the following decomposition
[TABLE]
where the convergence is understood in the mean norm sense
[TABLE]
and where it may furthermore be shown that are uncorrelated random variables with zero mean. In particular, by Hsing and Eubank (2015, Theorem 7.2.8), the above decomposition is optimal: for any and any CONS of ,
[TABLE]
We ask the following questions:
Is expansion (2) interpretable pointwise in ?
Is the convergence (3) valid uniformly over ?
When , and assuming mean-square continuity of , these two questions have been long known to admit a positive answer in the form of the celebrated Karhunen-Loève theorem (Karhunen (1946), Loeve (1948); see also Kac and Siegert (1947)), whose proof fundamentally relies on Mercer’s theorem on the decomposition of real-valued kernels (Mercer, 1909). Extensions to random fields valued in -dimensional Euclidean space have also been tackled (Withers, 1974), but the general, infinite-dimensional case has not been addressed, and does not straightforwardly follow from the case .
3 Mercer’s Theorem for Operator-Valued Kernels
Consider a function . We refer to such a function as an operator-valued kernel. We say that is continuous if:
[TABLE]
for every . We say that is symmetric if . Finally, we say that is non-negative definite if for every and sequences , :
[TABLE]
Given an operator-valued kernel, we may define an integral operator by:
[TABLE]
where the integral in (5) is to be understood as a Bochner integral.
Lemma 1**.**
Let be a continuous kernel.
- (i)
* is compact*
- (ii)
if is symmetric, then is self-adjoint.
- (iii)
* is non-negative definite if and only if is non-negative definite.*
Proof.
(i) Let be a CONS for . Then converges strongly to the identity. For , let . Note that is compact, being of finite rank. To prove that is compact it thus suffices to show that strongly, as . Now:
[TABLE]
By continuity, to prove compactness of it thus suffices to show that:
[TABLE]
Notice that strongly. Because is Hilbert-Schmidt, the convergence also holds in operator norm. Indeed for we can write
[TABLE]
and the conclusion follows.
(ii) If is symmetric, for any :
[TABLE]
thus proving that is self-adjoint.
(iii) We follow Hsing and Eubank (Theorem 4.6.4 2015). Given let be chosen so that whenever . As is a compact metric space, there exists a finite partition of such that each has diameter less than . Let be an arbitrary point of and, for all , define to be . The (uniform) continuity of now has the consequence that
[TABLE]
Now let be the integral operator with kernel . With this choice, we find that, for any ,
[TABLE]
and
[TABLE]
which, by (4), proves the positiveness of .
Conversely, suppose that
[TABLE]
for some , . As is uniformly continuous, there exist measurable disjoint sets with for all such that:
[TABLE]
This implies that:
[TABLE]
due to the mean-value theorem. Upon observing that the last expression is simply for , we conclude that is also not non-negative definite. ∎
In particular, if is a continuous, symmetric and non-negative definite kernel, admits a spectral decomposition in terms of its eigenvalue-eigenfunction pairs. That is, the eigenfunctions for , , form a CONS for , and if denote the corresponding eigenvalues, one may write:
[TABLE]
Furthermore, the following lemma establishes that the eigenfunctions of are uniformly continuous.
Lemma 2**.**
Let be a continous kernel. For each , is uniformly continuous.
Proof.
By compactness of and uniform continuity of , for any given there exists such that for all with . Then:
[TABLE]
∎
The following lemma will be instrumental in the proof of our generalisation to Mercer’s theorem.
Lemma 3**.**
- (a)
For any ,
[TABLE]
- (b)
Let be a CONS for . For any ,
[TABLE]
- (c)
the operator is well-defined and uniformly continuous wrt to . Furthermore, the sum converges uniformly.
Proof.
(a) Let
[TABLE]
and take to be the integral operator with kernel . Note that is continuous, by continuity of and by Lemma 2. For any :
[TABLE]
and must be non-negative definite. This implies, by Lemma 1 (iii), that is non-negative definite and hence , thereby proving (7).
(b) Let be a CONS for . First, note that:
[TABLE]
Then, by Cauchy-Schwarz, we obtain that:
[TABLE]
(c) Fix . By (b) we may conclude that there exists such that:
[TABLE]
Furthermore, for any , uniform continuity of the entails the existence of such that:
[TABLE]
whenever . By the last two displays, it is then clear that there exists such that:
[TABLE]
whenever . ∎
Theorem 1** (Mercer’s Theorem for Operator-Valued Kernels).**
Let be a continuous, symmetric and non-negative definite kernel, and denote by the corresponding integral operator. Let be the eigenvalue and eigenfunction pairs of , where and . Then:
[TABLE]
for all , with the series converging absolutely and uniformly.
Proof.
For different continuous kernels and , it is straightforward to construct a function such that and differ. Thus, is the unique operator kernel that defines . Now, the integral operator with the continuous kernel has the same eigen-decomposition as and is therefore the same operator. Thus, for all with the right-hand side converging absolutely and uniformly as a consequence of Lemma 3.
∎
Finally, we can now see that the integral operator in (5) is trace class.
Proposition 1**.**
Let the continuous kernel be symmetric and non-negative definite and the corresponding integral operator. Then:
[TABLE]
Proof.
We see that, for any CONS of :
[TABLE]
where we have employed Parseval’s equality, and have exchanged of the order of summation and integration by Fubini’s Theorem. ∎
4 Karhunen–Loève Theorem for Hilbertian Flows
Let be a stochastic process on a probability space , taking values in . We may define its mean by:
[TABLE]
and its covariance kernel by:
[TABLE]
provided the above expectations are well defined as Bochner integrals (see Hsing and Eubank (2015, Definition 7.2.1).
We say that a process is a second-order process if (8) and (9) are well defined for every . Note that the covariance kernel is a symmetric operator-valued kernel, i.e.:
[TABLE]
We say that is mean-square continuous if:
[TABLE]
for any and any sequence converging to .
Lemma 4**.**
Let be a second-order process. Then is mean-square continuous if and only if its mean and covariance functions are continuous wrt and , respectively.
Proof.
Assume without loss of generality that the process is centered, i.e. that .
[TABLE]
Note that the kernel is non-negative definite, in the sense of (4). Indeed, for and any sequences , :
[TABLE]
∎
We may finally state:
Theorem 2** (Karhunen-Loève Expansion for Hilbertian Flows/Fields).**
Let be a mean-squared continuous second-order process. Define:
[TABLE]
Then:
[TABLE]
Proof.
First, note that:
[TABLE]
Furthermore:
[TABLE]
where we have employed Fubini’s theorem to interchange integral and expectation, and our extension to Mercer’s decomposition, Theorem 1 to express as .
Putting things together, we obtain:
[TABLE]
and observing that, by Parseval’s identity:
[TABLE]
we finally see that (10) converges to zero uniformly by our extension to Mercer’s decomposition, Theorem 1, thus proving the theorem’s statement. ∎
5 Remarks on Computation
We now briefly remark on the issue of computing the expansion components for a collection of realised flows.
Let be independent realizations of . Let be a discretization of . Finally, let be the -dimensional subspace of spanned by the first vectors of some CONS of . Assume that at each of the nodes , we can observe linear measurements for each flow and each basis element .
Under this measurement scheme, the realization , , is reduced to the column
[TABLE]
The complete collection of observations can be concisely summarized by the matrix . A naive approach to computing the eigendecomposition associated with would proceed by evaluating a discretized version of the empirical covariance and then computing the eigendecomposition of the resulting matrix. In our case, this corresponds to calculating the eigendecompostion of . The computational complexity of this operation is ( for the eigendecomposition and the rest for evaluating the covariance), which severely limits the resolution ( and ).
Fortunately, by a classical trick (Chambers, 1977; Jolliffe, 1986), we can circumvent this computational cost by simply calculating instead the singular value decomposition of . In practice, and this allows us to write the SVD of where and are orthogonal matrices of dimensions and and is an diagonal matrix. For , we have the approximations
[TABLE]
Compared to the complexity of the naive approach, this approximation’s complexity is .
6 Discussion
Kim et al. (2020) have also considered Hilbertian functional data, and we conclude by discussing the differences between the two contexts. Kim et al. (2020) obtain an optimal decomposition by optimizing over representations of the form
[TABLE]
where is the CONS to be optimized over and are random “coefficients” in the ambient Hilbert space. In this respect, it is quite different from the original Karhunen-Loève expansion, where the role of coefficients and CONS is the “reverse”: traditionally, the CONS lives in the same ambient space as the process and the coefficients are real-valued. The expansion of Kim et al. (2020) is arrived at via the eigendecomposition of the real-valued autocovariance kernel , for . Consequently, their analysis produces eigenfunctions residing in , regardless of the nature of .
In contrast, our decomposition is obtained by optimizing over representations of the form
[TABLE]
where is the CONS to be optimized over and are the random real-valued coefficients. This stems from the spectral analysis of the operator-valued kernel , and consequently produces principal components living in the ambient space . These are arguably more natural, but in any sense compatible with the traditional Karhunen-Loève theorem.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Adler (2010) Adler, R. J. (2010). The geometry of random fields . SIAM.
- 2Chambers (1977) Chambers, J. M. (1977). Computational methods for data analysis. A Wiley Publication in Applied Statistics .
- 3Grenander (1950) Grenander, U. (1950). Stochastic processes and statistical inference. Arkiv för matematik 1 (3), 195–277.
- 4Hsing and Eubank (2015) Hsing, T. and R. Eubank (2015). Theoretical foundations of functional data analysis, with an introduction to linear operators , Volume 997. John Wiley & Sons.
- 5Jolliffe (1986) Jolliffe, I. T. (1986). Principal component analysis . Springer Series in Statistics. Springer-Verlag, New York.
- 6Kac and Siegert (1947) Kac, M. and A. Siegert (1947). An explicit representation of a stationary gaussian process. The Annals of Mathematical Statistics 18 (3), 438–442.
- 7Karhunen (1946) Karhunen, K. (1946). Zur spektraltheorie stochastischer prozesse. Ann. Acad. Sci. Fennicae, AI 34 .
- 8Kim et al. (2020) Kim, D., Y. K. Lee, and B. U. Park (2020). Principal component analysis for hilbertian functional data. Communications for Statistical Applications and Methods 27 (1), 149–161.
