Displacement Convexity in Spatially Coupled Scalar Recursions
Rafah El-Khatib, Nicolas Macris, Tom Richardson, Ruediger Urbanke

TL;DR
This paper introduces a displacement convexity framework for analyzing spatially coupled scalar recursions, revealing conditions for unique fixed points and broad applicability across coding, sensing, and statistical models.
Contribution
It establishes displacement convexity of the potential functional for spatially coupled systems, linking fixed points to minimizers and providing conditions for their uniqueness.
Findings
Displacement convexity holds for a wide class of spatially coupled recursions.
Fixed points correspond to potential minimizers under mild conditions.
The framework applies to coding, compressive sensing, and statistical mechanics models.
Abstract
We introduce a technique for the analysis of general spatially coupled systems that are governed by scalar recursions. Such systems can be expressed in variational form in terms of a potential functional. We show, under mild conditions, that the potential functional is \emph{displacement convex} and that the minimizers are given by the fixed points of the recursions. Furthermore, we give the conditions on the system such that the minimizing fixed point is unique up to translation along the spatial direction. The condition matches those in \cite{KRU12} for the existence of spatial fixed points. \emph{Displacement convexity} applies to a wide range of spatially coupled recursions appearing in coding theory, compressive sensing, random constraint satisfaction problems, as well as statistical mechanical models. We illustrate it with applications to Low-Density Parity-Check and generalized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Displacement Convexity in Spatially
Coupled Scalar Recursions
Rafah El-Khatib, Nicolas Macris, Tom Richardson, Ruediger Urbanke
EPFL Switzerland, and Qualcomm USA
Emails: {rafah.el-khatib,nicolas.macris,ruediger.urbanke}@epfl.ch, [email protected]
Abstract
We introduce a technique for the analysis of general spatially coupled systems that are governed by scalar recursions. Such systems can be expressed in variational form in terms of a potential functional. We show, under mild conditions, that the potential functional is displacement convex and that the minimizers are given by the fixed points of the recursions. Furthermore, we give the conditions on the system such that the minimizing fixed point is unique up to translation along the spatial direction. The condition matches those in [1] for the existence of spatial fixed points. Displacement convexity applies to a wide range of spatially coupled recursions appearing in coding theory, compressive sensing, random constraint satisfaction problems, as well as statistical mechanical models. We illustrate it with applications to Low-Density Parity-Check and generalized LDPC codes used for transmission on the binary erasure channel, or general binary memoryless symmetric channels within the Gaussian reciprocal channel approximation, as well as compressive sensing.
I Introduction
Spatially coupled systems have been used recently in various frameworks such as coding [2], [3], [4], [5] (for a review of applications in the context of communications see [5] and references therein), compressive sensing [6], [7], statistical physics [8], [9], and random constraint satisfaction problems [10], [11]. These systems exhibit excellent performance, often optimal, under low complexity message passing algorithms, due to the threshold saturation phenomenon [5], [12], [13]. For example, spatially coupled high-degree regular LDPC codes achieve the Shannon capacity under belief propagation [5], [13]. Another line of research has used spatially coupled constructions to prove results about the original uncoupled underlying model. For example, this idea was used to obtain proofs of replica-symmetric formulas for the mutual information in coding [14], in rank-one matrix factorization [15], and to improve provable algorithmic lower bounds on phase transition thresholds of random constraint satisfaction problems [11].
Given the success of spatial coupling in a wide variety of problems, it should hardly come as a surprise that there are fundamental mathematical structures behind spatially coupling. This paper is concerned with a somewhat hidden convexity structure called displacement convexity. Some of our preliminary work on this matter appeared in [16], [17], [18].
The large system asymptotic performance of spatially coupled systems is assessed by the solutions of coupled density evolution (DE) type update equations. In general, the fixed points of these equations can be viewed as the stationary point equations of a functional that is typically called the “potential functional” and is an “average form” of the Bethe free energy [19] of the underlying graphical model.111 In the context of statistical mechanics, the potential functional is the “replica free energy functional” [20]. The precise connection between the Bethe free energy and the potential functional in the case of coding can be found in [13]. It has already been recognized that this variational formulation is a powerful tool to analyze DE updates under suitable initial conditions [1], [8], [12], [13]. There are various possible formulations of this potential functional; in this paper, we will use the representation from [1] for scalar systems.
In a previous contribution [16], we showed that the potential, in the form given in [12], associated to a spatially coupled low-density parity-check (LDPC) code whose single system is the -regular Gallager ensemble, with transmission over the binary erasure channel with parameter , or the BEC(), has a convex structure called displacement convexity. This structure is well-known in the theory of optimal transport [21]. In fact, the potential we consider in [16] is not convex in the usual sense but it is in the sense of displacement convexity. This, in itself, is an interesting property. Although the formalism in [16] can be extended to more general scalar recursions, for example, those pertaining to irregular LDPC codes, it does not appear to extend to a very wide class of general scalar recursions. The main purpose of the present paper is to prove that a rather general class of scalar systems also exhibits the property of displacement convexity, and even strict displacement convexity under rather mild assumptions. Although the analysis of the present paper is similar in spirit to [16] it is also significantly different and more far reaching in its range of applications. We use the potential in the representation of [1] which allows to obtain much more general proofs that hold under quite mild conditions. The results are applicable to recursions appearing not only in coding, but also in compressive sensing and random constraint satisfaction problems.
The main propositions of this paper are: Proposition V.1 that states that the potential functional has the displacement convexity property; Proposition VI.1 that asserts that monotonic minimizers of the potential functional are fixed point solutions of the spatially coupled DE equations (in a generalized sense); Proposition VII.4 that gives the condition for the unicity of the minimizers up to translations along the spatial axis. It is also of interest that the potential functional satisfies a rearrangement inequality, namely Proposition III.4 that ensures that one can find minimizers among monotonic spatial fixed points. The conditions for our results to hold are rather mild and essentially match those in [1] for the existence of spatial fixed points.
This manuscript is organized as follows. Section II introduces spatially coupled recursions and the variational formulation. In Section III, we prove rearrangement inequalities that allow us to reduce the search for minima of the potential to a space of monotonic functions, and, in Section IV, we discuss the existence question using the direct method from functional analysis. The potential is shown to be displacement convex in Section V. In Section VI, we generalize the notion of fixed point solutions to the DE equations and show that such generalized solutions are minimizers of the potential. Unicity of the minimizer is addressed in Section VII. In Section VIII, we illustrate displacement convexity with applications to coding and compressive sensing.
II Set Up and Variational Formulation
In this section, we explain the set-up for general spatially coupled scalar recursions and give a variational formulation of these recursions. The fixed point equations of the scalar recursions will be generically called “density evolution” (DE) equations. The case of regular -LDPC code ensembles with transmission over the BEC will serve as a concrete running example for the setting.
Consider the pair of DE fixed point equations
[TABLE]
where . The update functions , are assumed to be non-decreasing from to , and normalized such that and . We will think of them as EXIT-like curves of DE and for (see Fig. 1). It is always possible to adopt this normalization in specific applications.
- Example:
Take an -regular Gallager ensemble, with transmission over the BEC(). Let (resp. ) be the erasure probability emitted by the check (resp. variable) nodes. The DE fixed point equations are and . In this paper, we are interested in the specific value which is the MAP threshold of the ensemble. Let , be the non-trivial stable fixed point when . To achieve the normalization of (1) we make the change of variables and , so that the DE equations become and . Note that we must have and . We then set
[TABLE]
which satisfy the required normalizations and . The corresponding EXIT curves have three intersections. The one at corresponds to the trivial fixed point of DE, the one at corresponds to the stable non-trivial fixed point of DE, and the third one at a middle point corresponds to the unstable fixed point.
The natural setting for displacement convexity, at least in the context of spatial coupling, is the continuum setting, which can be thought of as an approximation of the corresponding discrete system in the regime of large spatial length and coupling window size. The continuum limit has already been introduced in the literature as a convenient means to analyze the behavior of an originally discrete model [1], [6], [8].
Consider a spatially coupled system with an averaging window which is always assumed to be bounded, non-negative, even, integrable, and normalized such that The averaging window is the means for the “coupling” in “spatial coupling”. Let us define the constant
[TABLE]
We assume throughout the paper that is finite. As we shall see, this is directly related to finiteness of the potential. Let be two functions and denote by and their usual convolutions with , i.e., and . The pair of fixed point DE equations of a spatially coupled scalar continuous system are
[TABLE]
where is the spatial position. We will often refer to the functions , as profiles and to , as update functions. A pair of profiles that solves the above equations almost everywhere will be called a fixed point, FP for short. Note that (4) are non-local equations because of the coupling through
In this paper, we are interested in profiles ( denotes a generic profile like and ) that satisfy the limit conditions
[TABLE]
We note that these two limit values are the extreme fixed points of (1) We will refer to such profiles as interpolating profiles. A pair of interpolating profiles that solves (4) is called an interpolating FP.
- Definition:
A function satisfying (5) is called an interpolating profile. A pair of interpolating profiles that solves (4) almost everywhere, i.e., up to a set of measure zero, is called an interpolating fixed point (FP).
In Section III, we show that when minimizing the potential functional over the space of interpolating profiles we can focus on monotonic (non-decreasing) profiles.
II-A Potential function associated to (1)
In [1] the following potential function is introduced,
[TABLE]
Often, when they are clear from context or irrelevant, we will drop the update functions and as arguments from the notation and denote this potential function by . Since and are non-decreasing the potential is convex in for fixed and convex in for fixed It is minimized over by setting and over by setting .
Substituting in (6), we obtain the integral of the signed area between the two EXIT curves (see fig. 1) as
[TABLE]
Note that this is the signed area bounded by the two curves and the region between the vertical axis at the origin and a vertical axis at .
In [1], the following key result was shown. It states that for an interpolating FP to exist the potential must be minimal at both limit points.
Lemma II.1
If there exists an interpolating FP solution to (4), then for all and
The result applies not only to interpolating FPs but also to a relaxed definition of interpolating “consistent” FPs (CFPs) that we define in Section VI. In [1], when the assumption is made, the condition for all is termed the positive gap condition (PGC). In this paper we will additionally assume throughout so the term positive gap condition will be used to imply both this equality and the inequality in Lemma II.1.
When the inequality in Lemma II.1 is strict, i.e., for then the condition is termed the strictly positive gap condition (SPGC) in [1]. In this case, it was shown that an interpolating fixed point profile exists provided is strictly positive on the interior of some interval and zero off of the interval. This support condition on can be relaxed under various other conditions (see [1]).
- Definition:
We say that the positive gap condition (PGC) is satisfied when and for all . The strictly positive gap condition is satisfied when and for .
- Example:
For the -regular Gallager ensemble, with transmission over the BEC() with we have the potential function
[TABLE]
and the signed area
[TABLE]
Moreover, we have . In fact, this last constraint together with the two fixed point equations and completely determine , and . The SPGC holds for this example (see Section VIII for further illustration).
II-B Potential functional of the spatially coupled system (4)
The solutions of spatially coupled DE equations (4) are given by the stationary point of a potential functional of and defined below. This can be checked by setting the functional derivatives of this potential functional with respect to each of and to zero. We set
[TABLE]
where we have introduced the notation
[TABLE]
- Example:
For the -regular LDPC code and transmission over the BEC(), the potential (8) is
[TABLE]
Note that the limit of the integrand in (8) (and the example) vanishes when because of the condition (5) on the profiles. It also vanishes when because of (5) and . However, this does not suffice for the existence of the integral, essentially due to the fact that may not be Lebesgue integrable (for monotonic profiles this difficulty does not arise). So it is possible that fails to be well-defined as a Lebesgue integral for some choices of the interpolating profiles.
Once we consider interpolating profiles and assume the PGC and that , we can circumvent this technical issue by defining the potential functional as
[TABLE]
We show below that the limit always exists (it is possibly ).
Lemma II.2
Assuming the PGC, we have for any interpolating profile pair that
[TABLE]
and given a sequence of interpolating pairs converging pointwise almost everywhere to an interpolating pair we have
[TABLE]
Proof:
Define and . Note that
[TABLE]
Now, if we define
[TABLE]
then
[TABLE]
Taking limits by definition (10) and Lemma .1, we obtain
[TABLE]
We will shortly see that the PGC implies is non-negative so that is well defined (it is possibly ). This also means that it is possible to adopt
[TABLE]
as an alternative expression for .
Now, note that and are convex functions because and are non-decreasing. Indeed
[TABLE]
By Jensen’s inequality we have
[TABLE]
and we therefore obtain
[TABLE]
which proves the non-negativity of since is non-negative by the PGC.
Integrating (14) and using (13), we obtain the first claim (11) of the lemma. Furthermore, we get the second claim (12) directly by applying Fatou’s lemma to (13) (we can apply Fatou’s lemma since by (14) is a non-negative sequence, and it converges to ). ∎
Let us remark that in the process of proving this lemma, we have seen can be defined as (10) or equivalently as (13), as long as we assume the PGC, interpolating profiles and .
II-C Discussion
In Section VI, we show that among all interpolating profiles, monotonic interpolating CFPs yield minimizers of . To do that, we use rearrangement properties that are summarized in Section III. For a fixed , we always have . This is because is convex in for fixed and setting minimizes over for fixed .
One of the main results of this paper is to show the displacement convexity of in its two arguments. More precisely, we can think of interpolating between two pairs and of monotonic profiles by interpolating their inverse functions. Hence, we consider
[TABLE]
and show that is a convex function of Note that for a monotonic interpolating profile the inverse function is uniquely defined for almost all and right and left limits and , respectively, are uniquely determined. Displacement convexity is explained in more detail in Section V.
Displacement convexity applies only to monotonic profiles. In the next section, we address the conditions under which one can conclude that minimizers of satisfying (5) can taken to be monotonic.
The following quantities will play a crucial role in the remainder of this work,
[TABLE]
Here, is called the kernel for reasons that will become clear. As will be seen, displacement convexity arises from the convexity of .
Lemma II.3
Assume that . Then, is well defined and convex.
Proof:
Using integration by parts, we can write
[TABLE]
For , we have
[TABLE]
so taking shows . Using (16) we conclude that
[TABLE]
Thus, is finite and well-defined. Convexity follows because . ∎
Much of the analysis in this paper proceeds relatively simply under the assumption that
[TABLE]
Most of our results will first be established under this assumption. In general, however, this assumption is not needed and it is sufficient only that We typically generalize our results to this case by taking limits. Let us discuss this issue.
- Definition:
We say that a function is saturated off of the finite interval if for and for
Given a profile let us define by
[TABLE]
By definition, is saturated off of (see Fig. 2).
Lemma II.4
Let be interpolating profiles and assume the PGC and that then
[TABLE]
Proof:
See Appendix -B. ∎
We end this section with another useful definition.
- Definition:
Assuming it exists, we define
[TABLE]
As we will see, the functional captures the “simple” (uncoupled) part of It is invariant under increasing rearrangements and linear under displacement interpolation.
III Rearrangements
Displacement convexity is usually defined on a space of probability measures. For measures on the real line, it is most convenient to view displacement convexity on a space of cumulative distribution functions (cdf’s). It is therefore fortunate that the search for the global minimum of the potential functional (8) can be reduced to the space of profiles and that are non-decreasing. In this section, we use the tool of increasing rearrangements to show that such rearrangements of and can only decrease the potential.
Symmetric decreasing rearrangements are a classical tool in analysis, see [22]. Here we will use a closely related cousin namely increasing rearrangements (see [23]). Our presentation is self-contained and no previous exposure to rearrangements is needed. Consider a profile that satisfies (5). The increasing rearrangement222Note that an increasing rearrangement is not necessarily strictly increasing. of is the increasing function that has the same limits, and where the mass of each level set is in some sense preserved (here the mass of a level set is infinite). More formally, let us represent in layer cake form as
[TABLE]
where is the indicator function of the level set . For each value , the level set can be written as the disjoint union of a bounded set and a half line . We define the rearranged set and then
[TABLE]
A simple example capturing the notion of increasing rearrangement is shown in Fig. 3.
Lemma III.1
Let and be two profiles satisfying (5). Then, assuming the left integral exists, we have
[TABLE]
Proof:
For each there exists a minimal such that Define and We also define the same quantities for the rearranged profiles and , namely , and . We show below that
[TABLE]
Equation (21) gives the result since, using the layer cake representation, it follows that
[TABLE]
Let us give an explicit argument for (21). We note that the infinite part of a level set can only increase under an increasing rearrangement, thus . So is common to and and subtracting it leaves two finite sets with the same finite measure since rearrangements are measure preserving, i.e.,
[TABLE]
(with the same on both sides). Thus,
[TABLE]
Similarly, , and (21) follows from these two identities. ∎
Lemma III.2
For any interpolating and , we have
[TABLE]
Proof:
If the left-hand side is infinite, then the result is immediate, so we assume that it is finite.
This result is very similar to the Hardy-Littlewood inequality for symmetric rearrangements. We will, however, give a self-contained elementary proof. The key inequality is the following which holds for all
[TABLE]
This gives the result since
[TABLE]
To see (LABEL:eqn:keyineq), observe that, for we have some maximal and minimal such that and where the unions are disjoint and If (see case a) in Fig. 4) then the right-hand side of (LABEL:eqn:keyineq) is [math] and (LABEL:eqn:keyineq) is immediate, so we assume otherwise. If (see case b) in Fig. 4) then we trivially have equality in (LABEL:eqn:keyineq) so we also assume We now have case c) in Fig. 4 and we obtain
[TABLE]
Note that the last line is non-negative because we are not in the case . Now, and can intersect only in the interval so we have
[TABLE]
and the lemma follows. ∎
Lemma III.3
For any interpolating and , we have
[TABLE]
Proof:
If the left-hand side is infinite, the inequality holds. Hence, we suppose it is finite. We have
[TABLE]
Since the integrand is non-negative and the integral is finite, we can apply the Fubini theorem to rewrite
[TABLE]
Now, we apply Lemma III.2 to the functions and , where . Note that is simply a translated version of , so its rearrangement is just obtained by the same translation of , i.e., . Thus,
[TABLE]
Multiplying by , integrating over , and using (24), we obtain (23).
∎
We are now ready to prove a rearrangement inequality for
Proposition III.4** (Monotonicity of Minimizers)**
Let and be profiles satisfying (5) and let and be their respective increasing rearrangements. Assume the PGC and that then we have
[TABLE]
Proof:
If the left-hand side of (25) is infinite, then the result is immediate, so we assume that is finite. Let us first assume that (in fact, we can assume the saturated case). It then follows that in Equ. (18) is finite. Note that if is monotone, then . Thus the increasing rearrangement of is equal to and similarly for the term . We can now apply Lemma III.1 (with suitable scaling) to conclude that For this case, the proposition now follows from Lemma III.3.
Now, we consider the general case, where possibly . Due to Lemma II.4, we have
[TABLE]
We remark that due to Lemma .2 Equ. (48). Therefore, using the saturated case, we have already established above, we have
[TABLE]
Finally, it is easy to see that for any interpolating profile , we have pointwise. By Lemma II.2 Equ. (12), we obtain
[TABLE]
Combining (26), (27), and (28) concludes the proof. ∎
Proposition III.4 shows that minimizers , of the functional can be found in the spaces of non-decreasing profiles. From now on, we therefore restrict the functional to those spaces.
IV Existence of Minimizers
The existence of a monotonic FP, which we will show is a minimizer of is proved in [1]. In this section, we give an alternate proof, under similar conditions, using the direct method of the calculus of variations [24], as was done in [17].
In the direct method of the calculus of variations, one constructs a minimizer as a limit point of a minimizing sequence. Since is invariant under a common translation of and , it is necessary to center the sequence in order to carry out the method. We can do this by translating and so that We call such a profile pair centered.
Proposition IV.1
Assume and assume the SPGC is satisfied. Then, there exists a monotonic non-decreasing profile pair that minimizes under the condition that has limit at and limit at
Proof:
We already remarked that we can adopt the alternative expression (13) for the potential functional, namely
[TABLE]
where . Therefore is bounded from below so, by Proposition III.4, there exists a minimizing sequence of monotonic profiles satisfying the limit condition, i.e.,
[TABLE]
Let us center the sequence so that for each Interpreting and as cumulative probability distributions, our aim is to show the tightness of the sequence, i.e., that the transition of and from to must occur in a bounded region for all
Let be an arbitrary finite constant. Then, we claim that that for any there exists such that implies that for and for (assuming is a centered monotonic profile pair satisfying the limit conditions).
This claim completes the proof. Indeed we can then extract from a subsequence converging to a limit point which necessarily satisfies the limit conditions, and by Fatou’s lemma
[TABLE]
so and is a monotone minimizing pair for the potential functional.
Now we prove the claim. Since, by Lemma .3, we have
[TABLE]
we see that implies that
[TABLE]
By the strictly positive gap condition, there exists such that unless we have either or Let be the least such that . Then,
[TABLE]
and . Thus for each , we have for . Similarly, we have for each we have for ∎
V Displacement Convexity
A generic functional on a space (of profiles, say) is said to be convex in the usual sense if, for any pair , and for all , and for the linear interpolation of the profiles, the inequality holds. Displacement convexity, on the other hand, is defined as convexity under an alternative interpolation called displacement interpolation. The usual setting for displacement convexity is a space of probability measures. For measures over the real line, one can conveniently define the displacement interpolation in terms of the cdf’s associated to the measures. This is the simplest setting and the one that we adopt here.
We think of the increasing profiles as right-continuous cdf’s of some underlying measures over the real line. As already stated, the inverse defined almost everywhere and with left and right limits and , respectively, are uniquely defined. However, at this point, it is useful to settle on the right-continuous inverse which is defined for all , namely .
Consider two profiles and , and assume is continuous. We can define a map as
[TABLE]
The map can be seen as a pushforward map for measures from to . This is expressed as which means
[TABLE]
for any function such that the integral is well-defined. Then, denoting by the identity map, the interpolant is the cdf of the measure defined by
[TABLE]
We have
[TABLE]
whenever the integral is defined. In particular, if is convex, then this shows convexity in of the integral due to the following,
[TABLE]
The graphical construction of the interpolant is illustrated in Fig. 5. Graphically, finds the position on the -axis so that for some given . Consider the linear interpolation between points on , The displacement interpolant is defined so that the following equality holds for all
In the case where is discontinuous, we have to be more careful in the definition. At points of discontinuity of , the map should not be single-valued. Since we work in one dimension, this issue is easily circumvented and we can in general define via its inverse as
[TABLE]
and (which is right continuous). Correspondingly, if is an interpolating increasing profile then, under appropriate regularity of we can write
[TABLE]
and we have
[TABLE]
With this in mind, we will continue to use the notation when the above interpretation should be understood.
In the remainder of this work, we consider two pairs of interpolating profiles and and consider the corresponding interpolants and .
We now state one of the main results of this paper.
Proposition V.1
Assume the PGC and Then, the potential is displacement convex; that is, for all ,
[TABLE]
We first show that it is sufficient to prove the proposition under the assumption that and are saturated. We recall that by Lemma II.4, for any monotonic interpolating pair we have
[TABLE]
Given any monotonic interpolating pairs , let denote the displacement interpolant of and It is easy to see that converges pointwise to when . By Lemma II.2 Equ. (12) we therefore have
[TABLE]
In view of (32) and (33), we see that (31) follows from
[TABLE]
which is the saturated case of (31). For the remainder of the section, we therefore assume the saturated case, and prove (34).
If and are saturated then we have
[TABLE]
Indeed,
[TABLE]
and the first term is integrable for saturated profiles , and the second term is also integrable because of Lemma .3 (note that is not necessarily saturated). This is the critical requirement since, by integrating by parts, we obtain
[TABLE]
The full derivation of this identity reads
[TABLE]
where we have used the fact the is well-defined.
The identity (35) leads to the following key result:
Lemma V.2
Let and be saturated, then
[TABLE]
is a convex function of
Proof:
Since and are saturated we have by (35) that
[TABLE]
This is convex in because the kernel is convex (see II.3). ∎
Lemma V.3
For any saturated pairs the functional is affine in
Proof:
We will show that is linear in . We start by the considering the first term of this difference. Using the layer cake representation and the monotonicity of the functions, we have
[TABLE]
Using (30) we can write write this as
[TABLE]
which is evidently linear in . Similarly for the second term in the difference , we obtain
[TABLE]
∎
We are now ready to prove the main result of this section.
Proof:
If or , then the result is immediate, so we assume both are finite. As argued above, we can assume that all functions are saturated. We rewrite the potential in (8) as follows
[TABLE]
By Lemma V.3, the functional is affine and hence convex in The second term was shown to be convex in Lemma V.2. ∎
VI Fixed Points and Minimizers
The main goal of this section is to prove Proposition VI.1, which states that a pair of monotonic profiles minimizes if and only if it is a “consistent” fixed point (CFP). It will be helpful to start with a preliminary discussion motivating the definition of CFP.
We already remarked that is convex in for fixed and minimized (over ) by setting , and similarly for and interchanged. From (9), a similar argument shows that and so that
[TABLE]
Under some conditions, we can have even though it is not the case that almost everywhere. This can happen, in particular, if is discontinuous and the pair does not satisfy the strictly positive gap condition.
One of the main analytical tools used in [1] was the construction of and given and so that form a “consistent” interpolating fixed point. Note that, from an interpolating fixed point, we can recover the graph of as the parametric curve as Given interpolating and , we denote the so obtained as (see [1] for more detail.) The update function is uniquely determined at points of continuity but may not be uniquely determined at points of discontinuity. In particular, if is constant over some open interval where is increasing then has a discontinuity at that value of and we see that we cannot have almost everywhere. Nevertheless, it is the case that for all and in this sense it satisfies the DE equation. In [1], the notation
[TABLE]
was used to capture this case.333More precisely, if plays the role of , we denote by when at points of continuity of and at points of discontinuity of . This motivates the following definition:
- Definition:
We say that an interpolating pair of profiles is a consistent fixed point (CFP) if and . Recall that is a fixed point (FP) if and almost everywhere, i.e., up to a set of measure zero.
Proposition VI.1
Let be monotonic and interpolating. Then is minimal - in the sense for any monotonic interpolating - if and only if is a CFP.
Proof:
If is not a CFP then either in which case the pair cannot be minimal, or we have either or , which shows that is not minimal.
To prove the converse, assume is a CFP. The proof proceeds by contradiction. Hence, we suppose there exists interpolating with and we shall deduce a contradiction. By Lemma II.4, we can assume that and are saturated.
We will show that we may also take to be saturated. Define
[TABLE]
so is a CFP for Since are saturated, it follows easily that
[TABLE]
and by Lemma .4 we have
[TABLE]
and we see that we can assume are saturated.
Since is a CFP it follows that and for all interpolating and Hence, we now have
[TABLE]
where is some positive constant. The last step follows from and for some positive constants and which follows from the saturation of and By Proposition V.1, we have
[TABLE]
Because of the assumption on the right-hand side of (38) is strictly negative. Thus (37) and (38) contradict each other for sufficiently small. We conclude that no such can exist. ∎
We conclude this section in Lemma VI.3 with a pleasing expression for when is a monotonic minimizer, equivalently a CFP. To obtain the expression, and for further application, we require a result concerning the following functional from [1],
[TABLE]
where
[TABLE]
Note that is non-negative; this is closely related to the positive gap condition. One of the main results in [1] (Lemma 9) is the following (this result is used in Section VII).
Lemma VI.2
Let be a CFP for (4), then
[TABLE]
It turns out for our application that we only require the case and in this case the right-hand side of (39) simplifies, at least at points of continuity of and to
[TABLE]
Lemma VI.3
If is a CFP then
[TABLE]
where
[TABLE]
Note that is a non-negative even function that tends to [math] at (recall is an odd function).
Proof:
By Lemma .4, it is enough to prove this for the saturated case. For the saturated case, we can integrate by parts to obtain
[TABLE]
From Lemma VI.2 and (39) (or (41)) , we have
[TABLE]
Combining these two equations we obtain
[TABLE]
Adding this to (35) yields the result by the definition of given in (18). ∎
VII Unicity of Minimizer
The existence of increasing interpolation solutions to (4) was established in [1] under the assumption of the strictly positive gap condition and assuming that is strictly positive on an interval and [math] off of (We shall refer to this as the interval support condition.) It was also shown in [1] that existence of such a fixed point implies the positive gap condition and, by example, it was shown that if for some , then there may be an infinite family of fixed point solutions that are not equivalent under translation. In this section we use displacement convexity to show that the solution whose existence was proved in [1] under the strictly positive gap condition is unique up to displacement.
It follows from Proposition VI.1 that all interpolating minimizers have the same potential and that they are all CFPs. By Proposition V.1, we see that if and are both monotonic interpolating CFPS then is a CFP for all Displacement convexity can therefore not be strict in this case. The aim of the proof is to show that the strictly positive gap condition then leads to the conclusion that all CFPs are equal up to translation.
Given and , we define
[TABLE]
Lemma VII.1
Let and be CFPs and assume the interval support condition. Then, for all , we have
[TABLE]
where denotes 2-d Lebesgue measure.
Proof:
We assume throughout that and are CFPs. Formally, we have
[TABLE]
The formula is derived in Appendix -E for saturated profiles. Note that the integrand is always non-negative so the integral is well-defined, although it may take the value We claim that
[TABLE]
Assume that the claim is false. Then there exists a set on which are all bounded such that
[TABLE]
In the saturated case, it is easy to see that is absolutely continuous and so is It now follows that for all large enough, we have
[TABLE]
and therefore, using the convexity of with respect to , we deduce that there is a positive constant such that, for all large enough, we have
[TABLE]
Applying Lemma II.4 and Lemma II.2(12), and noting that converges pointwise to yields
[TABLE]
which contradicts Proposition VI.1, thereby establishing the claim. Note that the claim gives the desired result except perhaps on a set of of measure
Now assume that for some we have
[TABLE]
By the continuity ( and are continuous off of at most a countable set) and inner-regularity of Lebesgue measure, there exists a closed set of positive measure and a constant such that for all we have and for all satisfying
For all , define
[TABLE]
Note that for and that is non-decreasing.
Let us find such that and Then, for any we have
[TABLE]
By the Fubini theorem, this contradicts our above established claim (43). ∎
Let us define and let denote the open interval centered at of length
Lemma VII.2
Let be a CFP and assume the SPGC and the interval support condition. For all , there exists and such that
Proof:
Let and define We must have since, otherwise, we obtain which, by Lemma VI.2, contradicts the SPGC.
To be more precise, for , let us define
[TABLE]
By (39) (see also (41)), the SPGC implies that the measure of at least one of and is strictly positive. We shall assume that the measure of is positive, and the other case can be handled similarly. It follows from monotonicity of and that there exists and sufficiently small such that We then have and, for small enough, , which gives ∎
By Lemma VII.1, we have that, if and are CFPs and the SPGC and interval support condition holds, then
[TABLE]
We claim that this implies that is essentially constant. Similarly, we have is essentially constant. Moreover, these two constants are equal.
Lemma VII.3
Assume the SPGC and the interval support condition and that and are CFPs. Then, is essentially constant on
Proof:
Let us assume that is not essentially constant, i.e., there exists a real value so that and . Then, there exists a value that is in the support of both sets, i.e., for any we have and
By Lemma VII.2, there exists a and such that By definition of , there is a positive constant such that for all , which now contradicts (44). This completes the proof. ∎
Proposition VII.4
Assume the SPGC and the interval support condition and that and are interpolating monotonic CFPs. Then, there exists such that, for almost all , we have and
Proof:
By Lemma VII.3, there exists such that for almost all Similarly, there exists such that for almost all It follows that for almost all It now follows from Lemma VII.2 and (44) that ∎
Even though we have stated and proved the results for CFPs, under the assumptions of this section CFPs are actually FPs.
Lemma VII.5
If is a CFP and satisfies the strictly positive gap condition and the interval support condition holds, then is a FP.
Proof:
If the SPGC and the interval support condition hold then and are strictly increasing wherever they take values in This implies that must be a FP (see [1] for further detail). ∎
VIII Illustrations
In this work, we have shown (Proposition IV.1 and Propositions V.1, VI.1, and VII.4) that under some conditions, the potential functional is displacement convex and that its minimizer exists and is unique up to translation. These conditions are the strictly positive gap condition, , and the interval support condition. In this section, we apply these results on different scalar systems when these conditions hold. In particular, for the applications we consider, we use the even uniform window with which implies the two latter conditions. We illustrate for each application that the strictly positive gap condition holds.
To check the SPGC one can directly look at , but there is also a simpler way to check the condition. Indeed, we already remarked that for fixed the potential is minimized by setting . Therefore,
[TABLE]
So the SPGC is valid as long as the signed area for . Similarly, for fixed , the potential is minimized by setting . Thus,
[TABLE]
where
[TABLE]
is the alternative signed area bounded between the two EXIT curves and the horizontal axis at the origin and at height . The SPGC is valid as long as for .
Clearly, when as assumed in this paper, we also have .
VIII-A LDPC Code Ensembles on the BEC
We demonstrate our results on the -regular spatially coupled LDPC code ensemble when transmission takes place over the BEC(). For this ensemble, we have the (unscaled) uncoupled DE equations and . We already showed how to perform the right scaling and ; asking that is a fixed point and we find , and . Replacing these numbers in the expression of the potential function (see Section II-A) we find . Fig. 6 and 7 illustrate the corresponding EXIT curves and the potential that is seen to satisfy the SPGC.
VIII-B Generalized LDPC Codes
We consider a generalized LDPC (GLDPC) code, where the check node constraints are given by a primitive BCH code with minimum distance (see [25] for more information). We consider the code with degree-2 variable nodes and degree- check nodes, with transmission over the BEC(). The (unscaled) uncoupled DE equations are [12]
[TABLE]
Set and , . We then get the scaled equations (1), namely ,
[TABLE]
The normalization condition and the condition completely determine , , and . The potential function and (alternative) signed area are given by
[TABLE]
The EXIT curves and signed area are illustrated in Fig. 8. and Fig. 9 for the GLDPC code with and . This corresponds to , , . Clearly, the SPGC condition is satisfied.
VIII-C The Gaussian Approximation
There are various forms of the Gaussian approximation [26], [27], [28] used to simplify the analysis of coding systems with transmission over binary memoryless symmetric (BMS) channels. Here, we consider a variant developed in [27], [28]. This method approximates the densities of the log-likelihood ratio (LLR) messages exchanged in the decoding graph with symmetric Gaussian densities; that is, densities of the form with the property . We also approximate the BMS channel with a binary-input Gaussian additive white noise (BIGAWN) channel with parameter and with the same entropy as the original channel . This makes the analysis one-dimensional and has been shown to serve as a good approximation.
The Gaussian approximation allows us to track the evolution of decoding by tracking the entropies of the LLR messages. Let denote the entropy of a symmetric Gaussian density of mean [29]. In particular, it can be expressed as
[TABLE]
Note that and . We consider the -regular LDPC code ensemble with transmission over the BMS.
The (unscaled) uncoupled DE equations are
[TABLE]
We define as the value of at the MAP threshold and set and , . We then get the scaled equations (1), namely ,
[TABLE]
The normalization condition and the condition completely determine , , and . The potential function is given by
[TABLE]
A plot of the EXIT curves and potential function yields curves that are very similar to the case of the BEC (see e.g. Figs 6, 7).
VIII-D Compressive Sensing
Consider a signal vector of length where the components are i.i.d. copies of a random variable . We assume that and that each component of is corrupted with Gaussian noise . We take measurements of the signal and assume that the measurement matrix has i.i.d. Gaussian components . The measurement ratio is defined by . Here we are interested in state evolution [6], which tracks the mean square error of the approximate message-passing (AMP) estimator (for the signal) . Given an that is large enough, the parameter is kept fixed as gets large.
The state evolution fixed point equations read
[TABLE]
where the minimum mean square error function is defined as follows. Let where is a scalar output and and let . Then . In the equations above, when we initialize with , is the average mean square error of the AMP estimator at iteration .
We now put this system of equations in the form (1). Here, there is no trivial fixed point ; however, the picture is very similar to LDPC coding-like systems considered above. The role of the “trivial” fixed point is played by a fixed point obtained by initializing state evolution with . Given the , for below the algorithmic threshold, this is the only fixed point, and for above this threshold, one finds three solutions (besides which is stable, there are an unstable and a stable fixed point). Set and . Equations (45) become
[TABLE]
Note that is a fixed point. We now scale , where and are chosen later on. Then (46) takes the form (1) with the EXIT curves defined as
[TABLE]
From these, one can compute the potential and the signed areas. Here, we illustrate the signed area. We have
[TABLE]
from which it follows that
[TABLE]
Finally, we set the signal-to-noise ratio to the value defined such that and . These conditions also determine and (note also that these values are a “non-trivial” stable fixed point). A plot of at yields a curve similar to Fig. 9 that satisfies the SPGC.
IX Conclusion
There are some questions that remain open. We have seen in Section III that we restrict our search of minimizing profiles to the space of increasing profiles. It is not clear in our settings when the inequality (25) is strict and so we cannot exclude the existence of a minimizing pair outside the spaces of increasing profiles. Another more fundamental open problem comes back to our formulation of the potential. In applications, it is inherently discrete whereas in our analysis, it is convenient to consider the continuum limit approximation of the potential. It would be interesting to see whether this analysis can be adapted to the discrete formulation.
Acknowledgment
We thank Vahid Aref and Marc Vuffray for interesting discussions at the early stages of this work.
The appendix contains proofs of the various limit results that allow the generalization of arguments from the saturated case to the non-saturated case, as well as some elementary technical results.
-A Integrability
Lemma .1
Let be an interpolating profile and assume that Then,
[TABLE]
Proof:
Assume that . By the evenness of we have
[TABLE]
Applying the Fubini theorem, we have
[TABLE]
where we introduce the notation
[TABLE]
From these two expressions we obtain the two bounds
[TABLE]
Letting be arbitrary, we have
[TABLE]
Since we see, by choosing , that we have
[TABLE]
Similarly, we have
[TABLE]
which, by choosing , gives
[TABLE]
∎
-B Basic Bounds
We begin with some approximation limits.
Lemma .2
Let be an interpolating profile (i.e., one satisfying (5)) and assume that Then
[TABLE]
Proof:
Define
[TABLE]
and note that We have
[TABLE]
from which we obtain (using changes of variables)
[TABLE]
and (48) now follows. The inequality (49) can be shown similarly by first noting that
[TABLE]
and writing
[TABLE]
Using again changes of variables and the upper bound on , we find that , which proves (49).
Now we show (50). We have
[TABLE]
from which we obtain
[TABLE]
and (50) follows. ∎
We can now prove Lemma II.4. For convenience we restate the lemma.
(Lemma II.4): Let and be interpolating profiles and assume the PGC and . Then
[TABLE]
Proof:
If then the result follows from Lemma II.2 Equ. (12). We assume now that and note that it is then sufficient to show that
[TABLE]
The expression between parentheses can be written as
[TABLE]
where the last term follows from the fact that . The result now follows from Lemma (.2). ∎
-C Rearrangement
Now, we focus on monotonic profiles. In particular, we prove the following lemma which is used throughout the paper.
Lemma .3
For any non-decreasing function , we have
[TABLE]
Proof:
First, we note that
[TABLE]
and we obtain
[TABLE]
where the next-to-last step follows by the layer-cake representation and the monotonicity of . ∎
-D Minimizers
In this section, we focus on limit results specific to CFPs.
Lemma .4
Assume and let be an interpolating CFP. Let us define
[TABLE]
Then
[TABLE]
Proof:
From (40) and (41), for any CFP, we have
[TABLE]
and, since we clearly have
[TABLE]
Thus, it only remains to show that
[TABLE]
By Lemma .3, we have
[TABLE]
and by Lemma .2 Equ. (48), we have
[TABLE]
The result now follows from Lemma .2 Equ. (50). ∎
-E Second derivative
We recall from the proof of Proposition V.1 that, for saturated profiles,
[TABLE]
The representation used in Lemma V.2 for the second term is equivalent to
[TABLE]
Moreover, we saw in Lemma V.3 that is affine in . So, using , we immediately get
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Kudekar, T. Richardson, and R. L. Urbanke, “Wave-like solutions of general one-dimensional spatially coupled systems,” IEEE Transcations on Information Theory , vol. 61, no. 8, pp. 4117–4157, 2015.
- 2[2] A. J. Felstrom and K. S. Zigangirov, “Time-Varying Periodic Convolutional Codes With Low-Density Parity-Check Matrix,” IEEE Transactions on Information Theory , pp. 2181–2190, 1999.
- 3[3] M. Lentmaier, A. Sridharan, K. S. Zigangirov, and D. J. Costello Jr, “Terminated LDPC convolutional codes with thresholds close to capacity,” in IEEE International Symposium on Information Theory Proceedings (ISIT) . IEEE, 2005, pp. 1372–1376.
- 4[4] M. Lentmaier, A. Sridharan, D. J. Costello Jr, and K. Zigangirov, “Iterative decoding threshold analysis for LDPC convolutional codes,” IEEE Transactions on Information Theory , vol. 56, no. 10, pp. 5274–5289, 2010.
- 5[5] S. Kudekar, T. Richardson, and R. L. Urbanke, “Spatially coupled ensembles universally achieve capacity under belief propagation,” IEEE Transactions on Information Theory , vol. 59, no. 12, pp. 7761–7813, 2013.
- 6[6] D. L. Donoho, A. Javanmard, and A. Montanari, “Information-theoretically optimal compressed sensing via spatial coupling and approximate message passing,” in IEEE International Symposium on Information Theory Proceedings (ISIT) . IEEE, 2012, pp. 1231–1235.
- 7[7] F. Krzakala, M. Mézard, F. Sausset, Y. Sun, and L. Zdeborová, “Probabilistic reconstruction in compressed sensing: algorithms, phase diagrams, and threshold achieving matrices,” Journal of Statistical Mechanics: Theory and Experiment , vol. 2012, no. 08, p. P 08009, 2012.
- 8[8] S. H. Hassani, N. Macris, and R. Urbanke, “Coupled Graphical Models and Their Thresholds,” in Information Theory Workshop (ITW) , 2010, pp. 1–5.
