Necessary conditions for linear convergence of iterated expansive, set-valued mappings with application to alternating projections
D. Russell Luke, Marc Teboulle, Nguyen H. Thao

TL;DR
This paper establishes necessary conditions, including metric subregularity and subtransversality, for the linear convergence of fixed point iterations like alternating projections, especially when mappings are expansive or set-valued.
Contribution
It introduces necessary conditions for linear convergence of expansive, set-valued mappings, extending the understanding of convergence behavior beyond nonexpansive cases, with applications to alternating projections.
Findings
Metric subregularity is necessary for linear convergence.
Subtransversality of sets is essential for the convergence of alternating projections.
Various regularity conditions influence the necessity of these geometric properties.
Abstract
We present necessary conditions for monotonicity, in one form or another, of fixed point iterations of mappings that violate the usual nonexpansive property. We show that most reasonable notions of linear-type monotonicity of fixed point sequences imply {\em metric subregularity}. This is specialized to the alternating projections iteration where the metric subregularity property takes on a distinct geometric characterization of sets at points of intersection called {\em subtransversality}. Our more general results for fixed point iterations are specialized to establish the necessity of subtransversality for consistent feasibility with a number of reasonable types of sequential monotonicity, under varying degrees of assumptions on the regularity of the sets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Necessary conditions for linear convergence of iterated expansive, set-valued mappings with application to alternating projections
D. Russell Luke , Marc Teboulle and Nguyen H. Thao Institut für Numerische und Angewandte Mathematik, Universität Göttingen, 37083 Göttingen, Germany. DRL was supported in part by German Israeli Foundation Grant G-1253-304.6 and Deutsche Forschungsgemeinschaft Research Training Grant 2088 TP-B5. E-mail: [email protected] of Mathematical Sciences, Tel Aviv University, Tel Aviv 69978, Israel. MT was supported by German Israeli Foundation Grant G-1253-304.6 and Israel Science Foundation ISF Grant 1844-16. E-mail: [email protected] Center for Systems and Control, Delft University of Technology, 2628CD Delft, The Netherlands. Department of Mathematics, Teacher College, Cantho University, Cantho, Vietnam. NHT was supported by German Israeli Foundation Grant G-1253-304.6. E-mail: [email protected], [email protected], [email protected]
(April 28, 2017)
Abstract
We present necessary conditions for monotonicity, in one form or another, of fixed point iterations of mappings that violate the usual nonexpansive property. We show that most reasonable notions of linear-type monotonicity of fixed point sequences imply metric subregularity. This is specialized to the alternating projections iteration where the metric subregularity property takes on a distinct geometric characterization of sets at points of intersection called subtransversality. Our more general results for fixed point iterations are specialized to establish the necessity of subtransversality for consistent feasibility with a number of reasonable types of sequential monotonicity, under varying degrees of assumptions on the regularity of the sets.
2010 Mathematics Subject Classification: Primary 49J53, 65K10 Secondary 49K40, 49M05, 49M27, 65K05, 90C26.
Keywords: Almost averaged mappings, averaged operators, calmness, cyclic projections, elemental regularity, feasibility, fixed points, fixed point iteration, metric regularity, metric subregularity, nonconvex, nonexpansive, subtransversality, transversality
1 Introduction
In recent years there has been a lot of progress in determining ever weaker conditions to guarantee local linear convergence of elementary fixed point algorithms, with particular attention given to the method of alternating projections and the Douglas-Rachford iteration [24, 23, 11, 13, 6, 29, 32, 26, 12]. These works beg the question: what are necessary conditions for linear convergence? We shed some light on this question for iterations generated by not necessarily nonexpansive fixed point mappings and show how our theory specializes for the alternating projections iteration in nonconvex and convex settings.
Our work builds upon the terminology and theoretical framework established in [26]. As much as possible, we have tried to make the present analysis self-contained, but it is not possible to reproduce all the results taken from [26]. After introducing basic notation and definitions in Section 2, we clarify first what we mean by linear convergence, since there are many ways in which a sequence can behave linearly with respect to the set of fixed points. We introduce a generalization of Fejér monotonicity, namely linear monotonicity (Definition 2.2) which is central to our development. We also introduce another generalization, linearly extendible sequences in Definition 2.5, that concerns sequences which can be viewed as the subsequence of some monotone sequence. This is key to the application to alternating projections studied in Sections 4 and 5. In Section 3 we lay the groundwork for the first main result on necessary conditions for linearly monotone fixed point iterations with respect to (Theorem 3.12). The result states that metric subregularity (Definition 3.9) is necessary for linearly monotone fixed point iterations. If in addition the fixed point operator is almost averaged at points in (Definition 3.2), then metric subregularity is necessary for linear convergence of the iterates to a point in (Corollary 3.13). Sections 4 and 5 are specializations to the case of alternating projections for consistent feasibility. In this setting metric subregularity takes on the more directly geometric interpretation as subtransversality of the sets at common points (Definition 4.4). Theorem 4.12 establishes the necessity of subtransversality for alternating projections iterations to be linearly monotone with respect to common points. Corollary 4.13 then shows that for sets with a certain elemental subregularity (Definition 4.1) subtransversality is necessary and sufficient for linear monotonicity of the sequence. For sequences that are R-linearly convergent to a fixed point and satisfy a subsequential linear monotonicity property (condition (28)), Theorem 4.15 shows that subtransversality is also necessary. Subtransversality is also shown to be necessary for sequences to have linearly extendible subsequences (Theorem 4.16). These results correspond to our observation in Proposition 4.18 that subtransversality has appeared in one form or another in all sufficient conditions for linear monotonicity or convergence of alternating projections for consistent feasibility that have appeared previously in the literature. In Section 5 these results are further specialized to the case of convex feasibility. We show in Theorems 5.4 and 5.5 that metric subregularity of some form is necessary and sufficient for local and global linear convergence of alternating projections. Moreover, we show in Proposition 5.9 that R-linear convergence of the sequences in this setting is equivalent to linear monotonicity of the sequence with respect to points of intersection. For Q-linear convergence, we show that linear extendability is necessary (Proposition 5.10).
Based on the results obtained here we conjecture that, for alternating projections applied to inconsistent feasibility, subtransversality as extended in [20, Definition 3.2] is also necessary for R-linear convergence of the iterates to fixed points.
2 Notation and basic definitions
Throughout our discussion is a Euclidean space. Given a subset , stands for the distance from a point to : . The projector onto the set , , is central to algorithms for feasibility and is defined by
[TABLE]
A projection is a selection from the projector. This exists for any closed set , as can be deduced by the continuity and coercivity of the norm. Note that the projector is not, in general, single-valued, and indeed uniqueness of the projector defines a type of regularity of the set : local uniqueness characterizes prox-regularity [33] while in finite dimensional settings global uniqueness characterizes convexity [9].
Given a subset and a point , the Fréchet, proximal and limiting normal cones to at are defined, respectively, as follows:
[TABLE]
In the above, means that with .
Our other basic notation is standard; cf. [27, 34, 10]. The open unit ball in a Euclidean space is denoted . stands for the open ball with radius and center ; is the open ball of radius centered at the origin.
To quantify convergence of sequences and fixed point iterations, we focus primarily on linear convergence, though sublinear convergence can also be handled in this framework. Linear convergence, however, can come in many forms. We list the more common notions next.
Definition 2.1** (R- and Q-linear convergence to points, Chapter 9 of [30]).**
Let be a sequence in .
- (i)
* is said to converge R-linearly to with rate if there is a constant such that*
[TABLE] 2. (ii)
* is said to converge Q-linearly to with rate if*
[TABLE]
By definition, Q-linear convergence implies R-linear convergence with the same rate. Elementary examples show that the inverse implication does not hold in general.
One of the central concepts in the convergence of sequences is Fejér monotonicity: a sequence is Fejér monotone with respect to a nonempty convex set if
[TABLE]
In the context of convergence analysis of fixed point iterations, the following generalization of Fejér monotonicity of sequences is central.
Definition 2.2** (-monotonicity).**
Let be a sequence on , be nonempty and satisfy and
[TABLE]
- (i)
* is said to be -monotone with respect to if*
[TABLE] 2. (ii)
* is said to be linearly monotone with respect to if (2) is satisfied for for all and some constant .*
The focus of our study is linear convergence, so only linear monotonicity will come into play in what follows. A study of other kinds of convergence, particularly sublinear, would employ the full generality of -monotonicity.
The next result is clear.
Proposition 2.3** (Fejér monotonicity implies -monotonicity).**
If the sequence is Fejér monotone with respect to then it is -monotone with respect to with .
The converse is not true, as the next example shows.
Example 2.4** (-monotonicity is not Fejér monotonicity).**
Let and consider the sequence for all . This sequence is linearly monotone with respect to with constant , but not Fejér monotone since for all .
The next definition will come into play in Sections 4 and 5. It provides a way to analyze fixed point iterations which, like our main example of alternating projections, are compositions of mappings.
The subset appearing in Definition 2.5 and throughout this work is always assumed to be closed and nonempty. We use this set to isolate specific elements of the fixed point set (most often restricted to affine subspaces). This is more than just a formal generalization since in some concrete situations the required assumptions do not hold on but they do hold on relevant subsets.
Definition 2.5** (linearly extendible sequences).**
A sequence on is said to be linearly extendible on with frequency ( is fixed) and rate if there is a sequence on such that for all and the following conditions are satisfied for all :
[TABLE]
When , the quantifier “on ” is dropped.
The requirement on the linear extension sequence means that the sequence of the distances between its two consecutive iterates is uniformly non-increasing and possesses a subsequence of type that converges Q-linearly with a global rate to zero.
The extension of sequences of fixed point iterations will most often be to the intermediate points generated by the composite mappings. In the case of alternating projections this is , and . This strategy of analyzing alternating projections by keeping track of the intermediate projections has been exploited to great effect in [24, 23, 6, 29, 11, 26]. From the Cauchy property of , one can deduce R-linear convergence from linear extendability.
Proposition 2.6** (linear extendability implies R-linear convergence).**
If the sequence on is linearly extendible on with some frequency and rate , then converges R-linearly to a point with rate .
Proof. Let be a linear extension of on with frequency and rate . Conditions (3) and (4) then imply by induction that
[TABLE]
where . This means that is a Cauchy sequence and hence converges to a limit , which is in by the closedness of this set. Conditions (3) and (4) also yield that for every ,
[TABLE]
where
[TABLE]
This means that converges R-linearly to with rate .
3 Linearly monotone fixed point iterations
Quantifying the convergence of fixed point iterations is key to providing error bounds for algorithms. For a multi-valued self-mapping , the operative inequality leading to linear convergence of the fixed point iteration is
[TABLE]
for and . When this holds, the sequence is linearly monotone with respect to and constant .
For multi-valued mappings, however, we need to clarify what is meant in the first place by the fixed point set. We take the least restrictive definition as any point contained in its image via the mapping.
Definition 3.1** (fixed points of set-valued mappings).**
The set of fixed points of a possibly set-valued mapping is defined by
[TABLE]
As noted in [26, Example 2.1], for , it is not the case that . This can happen, in particular, when the mapping is multi-valued on its set of fixed points. Almost averaged mappings detailed next are a generalization of averaged mappings and rule out so-called inhomogeneous fixed point sets.
3.1 Almost averaged mappings
Definition 3.2** (almost nonexpansive/averaged mappings, Definition 2.2 of [26]).**
Let be a nonempty subset of and let .
- (i)
* is said to be pointwise almost nonexpansive on at if there exists a (called the violation) such that*
[TABLE]
If (7) holds with then is called pointwise nonexpansive at on .
If is pointwise (almost) nonexpansive on at every point on a neighborhood of in (with the same violation ), then is said to be (almost) nonexpansive on at (with violation ).
If is pointwise (almost) nonexpansive at every point (with the same violation ) on , then is said to be (almost) nonexpansive on (with violation ). 2. (ii)
* is called pointwise almost averaged on at with violation if there is an averaging constant such that the mapping defined by*
[TABLE]
is pointwise almost nonexpansive on at with violation .
Likewise is said to be (pointwise) (almost) averaged on (at ) (with violation ) if is (pointwise) (almost) nonexpansive on (at ) (with violation ).
Remark 3.3**.**
The following remarks help to place this property in context.
- (a)
A mapping is averaged with violation and averaging constant at all points on if and only if it is firmly nonexpansive on . 2. (b)
As noted in **[26]**, pointwise almost nonexpansiveness of at with violation is related to, but stronger than calmness **[34, Chapter 8.F]** with constant : for pointwise almost nonexpansiveness the inequality (7) must hold for all pairs and , while for calmness the same inequality would hold only for points and their projections onto . 3. (c)
See **[26, Example 2.2]** for concrete examples.
Proposition 3.4** (characterizations of pointwise averaged mappings).**
[26, Proposition 2.1]** Let , and let . The following are equivalent.
- (i)
* is pointwise almost averaged on at with averaging constant and violation .* 2. (ii)
* is pointwise almost nonexpansive on at with violation .* 3. (iii)
[TABLE]
As a consequence, if is pointwise almost averaged at with any averaging constant and violation on , then is pointwise almost nonexpansive at with violation at most on .
Remark 3.5**.**
Pointwise almost averaged mappings are single-valued on the set of fixed points [26, Proposition 2.2]. If the mapping is actually nonexpansive (that is, almost nonexpansive with violation zero) on , then it must be single-valued on . When this happens, we simply write instead of .
It was proved in [5, Theorem 5.12] that if is Fejér monotone with respect to a nonempty closed convex subset and inequality (6) holds true with in place of , then converges R-linearly to a point in with rate at most . The following statement aligns with this fact.
Proposition 3.6** **(linear monotonicity and almost
averagedness imply R-linear convergence).
Let and for all . Suppose that is closed and nonempty and that is pointwise almost averaged at all points on , where , that is, is almost averaged on . If the sequence is linearly monotone with respect to with constant , then converges R-linearly to some point with rate .
Proof. We use the characterization formulated in Proposition 3.4iii of the almost averagedness of with averaging constant and violation . Combining (6) with in place of and (8) (with averaging constant and violation ) implies by induction that for every ,
[TABLE]
where is any point in . Hence, for any natural numbers and with , we have
[TABLE]
This implies that is a Cauchy sequence and therefore convergent to some point .
We claim that . Indeed, let us define
[TABLE]
The sequence is contained in the bounded set since
[TABLE]
Hence it has a subsequence converging to some as . Since the corresponding subsequence converges to and
[TABLE]
as , we deduce that .
Letting in (9) yields (1) with , which completes the proof.
The converse implication of Proposition 3.6 is not true in general because condition (1) in principle does not require the distance to strictly decrease after every iterate while condition (6) does.
Almost nonexpansivity of and linear extendability of the iteration are sufficient to guarantee that the sequence converges R-linearly to a point in . Compare this to Proposition 2.6 which, without the additional assumption of almost nonexpansivity of , only guarantees convergence to a point in .
Proposition 3.7** (linear extendability and almost nonexpansivity imply R-linear convergence).**
Let and be a sequence generated by for all . Suppose that is linearly extendible on with some frequency and rate and that is almost nonexpansive on , where is given by (5). Then converges R-linearly to a point with rate .
Proof. By Proposition 2.6 the sequence converges R-linearly to a point with rate . It remains to check that . Suppose to the contrary that there is with . Since is almost nonexpansive on , there is a violation such that
[TABLE]
This leads to a contradiction since while as . As a result, and the proof is complete.
When the fixed point set restricted to is an isolated point, then linear monotonicity of the sequence is equivalent to Q-linear convergence.
Proposition 3.8**.**
Let be an isolated point of , that is for small enough. Let be almost nonexpansive on a neighborhood of relative to . Let be a sequence generated by for all with sufficiently close to . Then is linearly monotone with respect to with rate smaller than 1 if and only if it is Q-linearly convergent to .
Proof. Since is an isolated piont of and is almost nonexpansive on a neighborhood of relative to , there is a small enough that and is almost nonexpansive on with violation , where . Let . Then by almost nonexpansivity of on we have that
[TABLE]
This implies that
[TABLE]
Hence for any sequence as described in Proposition 3.8 with , the equivalence of linear monotonicity of relative to with rate smaller than 1 and Q-linear convergence to follows from equality (10) and the definitions because each of these properties of alternatively combined with (10) ensures inductively that the whole sequence lies in .
It is mainly due to the above proposition that we include the extra technical overhead of making the above statements always relative to some subset . It is not uncommon to have a singleton relative to , but not on the whole space . For an example of this, see the analysis of the Douglas-Rachford fixed point iteration in [2].
3.2 Metric subregularity and linear convergence
The following concept of metric regularity on a set characterizes the stability of mappings at points in their image and has played a central role, implicitly and explicitly, in our convergence analysis of fixed point iterations [13, 2, 26]. We show in this section that, indeed, metric subregularity is necessary to achieve linear convergence. The following definition is a specialized (linear) variant of [26, Definition 2.5] which is a combination with slight modification of those formulated in [16, Definition 2.1 (b)] and [17, Definition 1 (b)] so that the property is relative to relevant sets for iterative methods. Our terminology also follows [10].
Definition 3.9** (metric regularity on a set).**
- Let be a set-valued mapping between Euclidean spaces and let and . The mapping is called metrically regular relative to on for with constant if*
[TABLE]
holds for all and .
When consists of a single point one says that is metrically subregular with constant on for relative to .
When , the quantifier “relative to” is dropped.
Remark 3.10**.**
The conventional concept of metric regularity at a point for corresponds to setting , and taking and to be neighborhoods of and (as opposed simply to subsets including these points) respectively. Similarly, the conventional metric subregularity [10] and metric hemi/semiregularity [22, 1, 19] at for correspond to setting , and respectively either taking to be a neighborhood of and , or taking and taking to be a neighborhood of . This notion can and has been generalized even more. The more general notion of metric subregularity studied by Ioffe [16, 17] for instance, together with -monotonicity, would be needed for the study of nonlinear convergence. These more general notions of metric subregularity are the most suitable vehicles to parallel properties like the Kurdyka-Łojasiewicz (KL) property for functions. In fact, for differentiable functions, metric regularity of the gradient is equivalent to the KL property [8, Corollary 4 and Remark 5], though from our point of view, metric subregularity is the more general property.
The following convergence criterion is fundamental.
Theorem 3.11** (linear convergence with metric subregularity).**
Let , let with closed and nonempty. Suppose that, for some fixed , is pointwise almost averaged at all points with averaging constant and violation on , and that the mapping is metrically subregular on for [math] relative to with constant . Then it holds
[TABLE]
where
[TABLE]
In particular, when , every sequence generated by with initial point in converges R-linearly to some point in with rate . If is a singleton, then the convergence is Q-linear.
Proof. The inequality (12) is the content of [26, Corollary 2.3]. Since this is easy to obtain, we reproduce the proof here for convenience. Choose any and select any . Metric subregularity of on for [math] relative to with constant means that
[TABLE]
Since , this then implies that
[TABLE]
Note that is a single-valued mapping on since is almost averaged – hence almost nonexpansive – on [26, Proposition 2.2], so we can unambiguously write for and rewrite the inequality as
[TABLE]
This inequality, together with the almost averaging property and its characterization Proposition 3.4iii yield
[TABLE]
Note in particular that . Since is any point in this proves (12) with given by (13).
For , it follows by definition that such a sequence is linearly monotone with respect to with rate . A combination of Propositions 3.6 and 3.8 then shows that the sequence is linearly convergent to a point in , R-linearly in general, and Q-linearly if is a singleton.
When , Theorem 3.11 provides a criterion for global linear convergence of abstract fixed point iterations. The next result shows that metric subregularity is in fact necessary for linearly monotone iterations, without any assumptions about the averaging properties of , almost or otherwise.
Theorem 3.12** (necessity of metric subregularity).**
Let with nonempty, fix so that is closed and nonempty, and let . If for each , every sequence generated by is linearly monotone with respect to with constant , then the mapping is metrically subregular on for [math] relative to with constant .
Proof. Since every sequence generated by starting in is linearly monotone with respect to with rate , the inequality (6) with in place of holds true. This together with the triangle inequality implies that for every ,
[TABLE]
On the other hand, we have from definition of that
[TABLE]
Combining (15) and (16) yields
[TABLE]
Consequently, is a constant of metric subregularity of on for [math] (not necessarily the smallest such constant) as claimed.
Corollary 3.13** (necessary conditions for linear convergence).**
Let with nonempty. For some , let be almost averaged on . If, for each , every sequence generated by is linearly monotone with respect to with constant , then all such sequences converge R-linearly with rate to some point in and is metrically subregular on for [math] relative to with constant .
Proof. This is an immediate consequence of Proposition 3.6 and Theorem 3.12.
4 Nonconvex alternating projections
For given, the alternating projections (AP) iteration for two closed subsets is given by
[TABLE]
For convenience, we associate with the sequence on such that and for all . In the sequel, we frequently use the joining sequence given by
[TABLE]
We will always assume, without loss of generality, that .
It is well known that every alternating projections iteration for two convex intersecting sets globally converges R-linearly to a feasibility solution if the collection of sets is what we call subtransversal [4]. The later property and its at-point version is a specialization of metric subregularity to the context of set feasibility.
4.1 Elemental regularity and subtransversality
Convexity of the underlying sets has long been the standing assumption in analysis of projection algorithms. The next definition characterizing regularity of nonconvex sets first appeared in [21, Definition 5] and encapsulates many of the regularity notions appearing elsewhere in the literature including Hölder regularity [29, Definition 2], relative ()-subregularity [13, Definition 2.9], restricted -regularity [6, Definition 8.1], Clarke regularity [34, Definition 6.4], super-regularity [23, Definition 4.3], prox-regularity [33, Definition 1.1], and of course convexity. The connection of elemental regularity of sets to the pointwise almost averaging property of their projectors is discussed later.
Definition 4.1** (elemental regularity of sets).**
Let be nonempty and let .
- (i)
The set is said to be elementally subregular relative to at for with constant if there exists a neighborhood of such that
[TABLE] 2. (ii)
The set is said to be uniformly elementally subregular relative to at for if, for any , there exists a neighborhood (depending on ) of such that (19) holds. 3. (iii)
The set is said to be elementally regular at for with constant if there exists a neighborhood of such that, for each , the set is elementally subregular relative to at for with constant . 4. (iv)
The set is said to be uniformly elementally regular at for if, for any , the set is elementally regular at for with constant .
The reference points and in Definition 4.1, need not be in and , respectively, although these are the main cases of interest for us. The properties are trivial for any constant , so the only case of interest is elemental (sub)regularity with constant .
Proposition 4.2** (Proposition 4(vii) of [21]).**
Let be closed and nonempty. If is convex, then it is elementally regular at all points for all with constant and the neighborhood for both and .
The next result shows the implications of elemental regularity of sets for regularity of the corresponding projectors.
Theorem 4.3** (projectors onto elementally subregular sets, Theorem 3.1 of [26]).**
Let be nonempty closed, and let be a neighborhood of . Let and . If is elementally subregular at relative to for each
[TABLE]
with constant on the neighborhood , then the following hold.
- (i)
The projector is pointwise almost nonexpansive at each on with violation . That is
[TABLE] 2. (ii)
The projector is pointwise almost firmly nonexpansive at each on with violation . That is
[TABLE]
In addition to the pointwise almost averaging property, metric subregularity plays a central role in the general theory of Section 3. In the context of set feasibility, this is translated to what we call subtransversality below. What we present as the definition of subtransversality is the simplified version of [26, Definition 3.2(i)].
Definition 4.4** (subtransversality).**
Let and be closed subsets of , let be endowed with some norm and let . The collection of sets is said to be subtransversal at for relative to if there exist numbers and such that
[TABLE]
holds true for all
When , the quantifier “relative to” is dropped.
The reference point in Definition 4.4 need not be in , although this is the only case of interest for us. The following characterization of subtransversality at common points will play a fundamental role in our subsequent analysis.
Proposition 4.5** (subtransversality at common points, Proposition 3.3 of [26]).**
Let and be closed subsets of . Let be endowed with -norm
[TABLE]
The collection of sets is subtransversal relative to
[TABLE]
at with for if and only if there exist numbers and such that
[TABLE]
The relative set given by (20) which makes the notion of subtransversality consistent in the product space can clearly be identified with the set . We will therefore more often than not use the terms “relative to ” instead of “relative to ” and “at ” instead of “at for ” when discussing subtransversality at common points where the product-space structure is no longer needed.
Remark 4.6**.**
It follows from Proposition 4.5 that the exact lower bound of all numbers such that condition (21) is satisfied, denoted , characterizes the subtransversality of the collection of sets at common points. More specifically, the collection of sets is subtransversal at if and only if .
The property (21) with has been around for decades under the names of (local) linear regularity, metric regularity, linear coherence, metric inequality, and subtransversality; cf. [3, 4, 14, 15, 13, 28, 31, 35, 11, 18]. We refer the reader to the recent articles [21, 20] in which a number of necessary and/or sufficient characterizations of subtransversality are discussed. The next characterization of subtransversality, which is the relativized version of [21, Theorem 1(iii)], will play a key role in proving the necessary condition results in Sections 4.2 and 5. This characterization is actually implied in the proof of [11, Theorem 6.2] where the property called intrinsic transversality [11, Definition 3.1] was shown to imply subtransversality.
Proposition 4.7** (characterization of subtransversality at common points).**
*The collection of sets is subtransversal at relative to if and only if there exist numbers and such that *
[TABLE]
Moreover,
[TABLE]
where is the exact lower bound of all numbers such that condition (22) is satisfied.
Remark 4.8**.**
In light of Remark 4.6 and the two inequalities in (23), a collection of sets is subtransversal at if and only if . For the simplicity in terms of presentation, in the sequel, we will frequently use this fact without repeating the argument.
Both inequalities in (23) can be strict as shown in the following example.
Example 4.9**.**
Let and be two lines in forming an angle at the intersection point . One can easily check that
[TABLE]
The connection of subtransversality to metric subregularity was presented for more general cyclic projections in [26, Proposition 3.4]. We present here the simplified version for two sets with possibly empty intersection.
Proposition 4.10** **(metric subregularity
for alternating projections).
Let and be closed nonempty sets. Let and such that and let be the affine subspace
[TABLE]
Define . Suppose the following hold:
- (a)
the collection of sets is subtransversal at for relative to with constant and neighborhood of ; 2. (b)
there exists a positive constant such that
[TABLE]
Then the mapping is metrically subregular on for [math] relative to with constant .
4.2 Necessary and sufficient conditions for local linear convergence
It was established in [13, Corollary 13(a)] that local linear regularity of the collection of sets (with a reasonably good quantitative constant as always for convergence analysis of nonconvex alternating projections) is sufficient for linear monotonicity of the method for -subregular sets. This result is updated here in light of more recent results.
Proposition 4.11** (convergence of alternating projections with nonempty intersection).**
Let be a nonempty subset of . Let be a neighborhood of such that
[TABLE]
Let be an affine subspace containing such that . Define . Let the sets and be elementally subregular at all relative to respectively for each
[TABLE]
with respective constants on the neighborhood . Suppose that the following hold:
- (a)
for each , the collection of sets is subtransversal at relative to with constant on the neighborhood ; 2. (b)
there exists a positive constant such that condition (24) holds true; 3. (c)
* for all ;* 4. (d)
, where and .
Then every sequence generated by seeded by any point is linearly monotone with respect to with constant
[TABLE]
Consequently, at least Q-linearly with rate .
Proof. In light of Proposition 4.10 and the definition of linear monotonicity, Proposition 4.11 is a specialization of [26, Theorem 3.2] to the case of two sets with nonempty intersection.
If in Proposition 4.11, then assumption c can obviously be omitted.
The next theorem shows that the converse to Proposition 4.11 holds more generally without any assumption on the elemental regularity of the individual sets. The proof of the next theorem uses the idea in the proof of [11, Theorem 6.2].
Theorem 4.12** (subtransversality is necessary for linear monotonicity of subsequences).**
Let , , and be closed subsets of , let , and let and be fixed. Suppose that for any sequence of alternating projections starting in and sufficiently close to , there exists a subsequence of the form for some that remains in and is linearly monotone with respect to with constant . Then the collection of sets is subtransversal at relative to with constant .
Proof. Let be so small that any alternating projections sequence starting in has a subsequence which is linearly monotone with respect to with constant . Take any . Let us consider any alternating projections sequence starting at and such a subsequence . On one hand,
[TABLE]
On the other hand,
[TABLE]
A combination of (26) and (27) yields
[TABLE]
Hence
[TABLE]
This yields subtransversality of at relative to and as claimed.
The next statement is an immediate consequence of Proposition 4.11 and Theorem 4.12.
Corollary 4.13** (subtransversality is necessary and sufficient for linear monotonicity).**
Let be an affine subspace and let and be closed subsets of that are elementally subregular relative to at with constant and neighborhood for all with .
Suppose that every sequence of alternating projections with the starting point sufficiently close to is contained in . All such sequences of alternating projections are linearly monotone with respect to with constant if and only if the collection of sets is subtransversal at relative to (with an adequate balance of quantitative constants).
The next technical lemma allows us formally avoid the restriction “monotone” in Theorem 4.12.
Lemma 4.14**.**
Let be a sequence generated by that converges R-linearly to with rate . Then there exists a subsequence that is linearly monotone with respect to any set with .
Proof. By definition of R-linear convergence, there is such that for all . Let be any set such that . If , i.e., , then there exists an iterate of (we choose the first one) relabeled such that
[TABLE]
Repeating this argument for in place of and so on, we extract a subsequence satisfying
[TABLE]
The proof is complete.
The above observation allows us to obtain the statement about necessary conditions for linear convergence of the alternating projections algorithm which extends Theorem 4.12. Here, the index number depending on the sequence will come into play in determining the constant of linear regularity.
Theorem 4.15** (subtransversality is necessary for linear convergence).**
Let be fixed and . Let , and be closed subsets of and let . Suppose that any alternating projections sequence starting in and sufficiently close to is contained in , converges R-linearly to a point in with rate , and the index where satisfies (28). Then the collection of sets is subtransversal at relative to with constant .
Proof. Let be so small that any alternating projections sequence starting in converges R-linearly to a point in with rate . Take any and generate an alternating projections sequence . By Lemma 4.14, there is a subsequence linearly monotone with respect to at rate . Then
[TABLE]
Let be such that and note that . By the definition of the projection and it follows that
[TABLE]
Hence
[TABLE]
This yields subtransversality of at relative to and as claimed.
The joining alternating projections sequence given by (18) often plays a role as an intermediate step in the analysis of alternating projections. As we shall see, property of linear extendability itself can also be of interest when dealing with the alternating projections algorithm, especially for nonconvex setting. This observation can be seen for example in [24, 23, 7, 29, 11].
Theorem 4.16** (subtransversality is necessary for linear extendability of subsequences).**
Let , , and be closed subsets of , let , and let and be fixed. Suppose that every alternating projections sequence starting in and sufficiently close to has a subsequence of the form for some such that the joining sequence given by (18) is a linear extension of on with frequency and rate . Then the collection of sets is subtransversal at relative to with constant .
Proof. Let be so small that any alternating projections sequence starting in has a subsequence of the described form which admits the joining sequence as a linear extension on with frequency and rate . Take any . Let us consider any alternating projections sequence starting at , the corresponding joining sequence and the subsequence . Let be the limit of as verified in Proposition 2.6.
On one hand,
[TABLE]
where the last estimate follows from the nature of alternating projections.
On the other hand,
[TABLE]
where the last estimate holds true since
[TABLE]
A combination of (29) and (30) then implies
[TABLE]
which yields subtransversality of at relative to and as claimed.
In general, subtransversality is not a sufficient condition for an alternating projections sequence to converge to a point in the intersection of the sets. For example, let us define the function by and on each interval of form ,
[TABLE]
and consider the sets: and and the point in . Then it can be verified that the collection of sets is subtransversal at while the alternating projections method gets stuck at points .
To conclude this section, we show that the property of subtransversality of the collection of sets has been imposed either explicitly or implicitly in all existing linear convergence criteria for the method of alternating projections that we are aware of. The next proposition catalogs existing linear convergence criteria for alternating projections which complement Proposition 4.11.
Proposition 4.17** (R-linear convergence of nonconvex alternating projections).**
Let and be closed and . The collection of sets is denoted . All alternating projections iterations starting sufficiently close to converge R-linearly to some point in if one of the following conditions holds.
- (i)
[24, Theorem 4.3]** and are smooth manifolds around and is transversal at . 2. (ii)
[11, Theorem 6.1]** is intrinsically transversal at . 3. (iii)
[23, Theorem 5.16]** is super-regular at and is transversal at . 4. (iv)
[6, Theorem 3.17]** is -regular at and the -qualification condition holds at . 5. (v)
[29, Theorem 2]** is [math]-Hölder regular relative to at and intersects separably at .
It can be recognized without much effort that under any item of Proposition 4.17, the sequences generated by alternating projections starting sufficiently close to are actually linearly extendible.
Proposition 4.18** (ubiquity of subtransversality in linear convergence criteria).**
Suppose than one of the conditions i–v of Proposition 4.17 is satisfied. Then for any alternating projections sequence starting sufficiently close to , the corresponding joining sequence given by (18) is a linear extension of with frequency and rate .
Proof. The statement can be observed directly from the key estimates that were used in proving the corresponding convergence criterion. In fact, all the criteria listed in Proposition 4.17 essentially were obtained from the same fundamental estimate which we named linear extendability in this paper.
Taking Theorem 4.16 into account we conclude that subtransversality of the collection of sets at is a consequence of each item listed in Proposition 4.17. This observation gives some insights about relationships between various regularity notions of collections of sets and has been formulated partly in [11, Theorem 6.2] and [21, Theorem 4]. Hence, the subtransversality property lies at the foundation of all linear convergence criteria for the method of alternating projections for both convex and nonconvex sets appearing in the literature to this point.
5 Application: alternating projections with convexity
In the convex setting, statements with sharper convergence rate estimates are possible. This is the main goal of the present section. Note that a convex set is elementally regular at all points in the set for all normal vectors with constant and neighborhood [20, Proposition 4(vii)]. We can thus, without loss of generality, remove the restriction to the subset that is omnipresent in the nonconvex setting. We also write and for the projections since the projectors are single-valued.
The next technical lemma is fundamental for the subsequent analysis.
Lemma 5.1** (non-decreasing rate).**
Let and be two closed convex sets in . We have
[TABLE]
Proof. Using the basic facts of the projection operators on a closed and convex sets, we obtain
[TABLE]
The last estimate holds true since the second term on the previous line is non-positive.
Lemma 5.1 implies that for any sequence of alternating projections for convex sets, the rate is nondecreasing when increases. This allows us to deduce the following fact about the algorithm.
Theorem 5.2** (lower bound of complexity).**
Consider the alternating projections algorithm for two closed convex sets and with a nonempty intersection. Then one of the following statements holds true.
- (i)
The alternating projections method finds a solution after one iteration. 2. (ii)
Alternating projections will not reach a solution after any finite number of iterations.
Proof. If the starting point is actually in , the proof becomes trivial. Let us consider the case that . Suppose that the alternating projections method does not find a solution after one iterate, that is, . In other words, we suppose that scenario (i) does not occur and prove the validity of scenario (ii).
In this case, it holds that as . As a result, . We also claim that . Indeed, suppose otherwise that (note that by definition of projection). Then , which implies that is a fixed point of , . This contradicts the fact that and since any alternating projections sequence for convex sets with nonempty intersection will converge to a point in the intersection [4]. Hence, we have checked that
[TABLE]
Then the following constant is well defined:
[TABLE]
Using Lemma 5.1 we get
[TABLE]
Hence
[TABLE]
Applying Lemma 5.1 consecutively, we obtain
[TABLE]
This particularly implies that for any natural number , and the proof is complete.
Remark 5.3**.**
In contrast to Theorem 5.2 for convex sets, there are simple examples of nonconvex sets such that for any given number , the alternating projections method will find a solution after exactly iterates. For instance, let us consider a geometric sequence where . For any number , one can construct the two finite sets by and . Then the alternating projections method starting at will find the unique solution after exactly iterates.
Theorem 5.4** (necessary and sufficient condition: local version).**
Let and be closed convex sets and . If the collection of sets is subtransversal at with constant , then for any number , all alternating projections sequences starting sufficiently close to are linearly monotone with respect to with rate not greater than .
Conversely, if there exists a number such that every alternating projections iteration starting sufficiently close to converges R-linearly to some point in with rate not greater than , then the collection of sets is subtransversal at with constant .
Proof. The first implication is an adaption of [13, Corollary 3.13(c)] to the terminology of this paper.
We now prove the converse implication. Let be so small that every alternating projections iteration starting in converges R-linearly to a point in with rate not greater than . Take any . Let us consider the alternating projections sequence starting at and converging R-linearly to with rate not greater than . By definition of R-linear convergence, there is a number such that
[TABLE]
Taking Theorem 5.2 into account, we consider the two possible cases as follows.
Case 1. The alternating projections method finds a solution after one iterate, . Lemma 5.1 yields
[TABLE]
This implies that and as a result,
[TABLE]
Case 2. The alternating projections do not reach a solution after any finite number of iterates. We will make use of the joining sequence given by (18). Since and are firmly nonexpansive, the sequence is Fejér monotone with respect to . Then it follows that
[TABLE]
We claim that
[TABLE]
Suppose to the contrary that there exists a natural number such that . Choose a number such that . Then applying Lemma 5.1, we get
[TABLE]
This together with (35) implies that for all natural number ,
[TABLE]
Letting , the last inequality leads to a contradiction since . Hence, (36) has been proved.
Now, using (36) and the firm nonexpansiveness of and , we obtain that
[TABLE]
A combination of (34) and (37), which respectively correspond to the two cases, implies that
[TABLE]
Hence is subtransversal at and the constant as claimed.
The next theorem provides a global version of Theorem 5.4.
Theorem 5.5** (necessary and sufficient condition: global version).**
Let and be closed convex sets with nonempty intersection. If the collection of sets is subtransversal at every point of (the boundary of) with constants bounded from above by , then for any number , every alternating projections iteration converges R-linearly to a point in with rate not greater than .
Conversely, if there exists a number such that every alternating projections sequence eventually converges R-linearly to a point in with rate not greater than , then the collection of sets is globally subtransversal with constant , that is,
[TABLE]
Proof. We prove the first implication. Let us take any point and consider the alternating projections sequence starting at . It suffices to consider only the case that the alternating projections method does not find a solution after one iterate. It is well known that converges to some point [4]. Hence, after a finite number, say , of iterates, the iterate must be sufficiently close to . Using the assumption that is subtransversal at with constant and applying Theorem 5.4, we deduce that the alternating projections sequence starting from converges R-linearly to with rate not greater than . On one hand, using (36) for the alternating projections sequence starting from and the corresponding joining sequence, we get
[TABLE]
On the other hand, applying Lemma 5.1, we get
[TABLE]
A combination of (39) and (40) yields the estimate (36). Proposition 3.6 then ensures that the joining sequence converges R-linearly to with rate not greater than . This implies that the sequence converges R-linearly to with rate not greater than as claimed.
We now prove the converse implication. Suppose that every sequence of alternating projections eventually converges R-linearly to a point in with rate not greater than . We need to verify (38). Note that the estimate (38) is trivial for . Let us take an arbitrary and consider the alternating projections sequence starting at . We consider the two possible cases as stated in Theorem 5.2.
Case 1. The alternating projections method finds a solution after one iterate. The argument for Case 1 of the proof of Theorem 5.4 yields (38).
Case 2. The alternating projections method does not find a solution after any finite number of iterates. Since eventually converges R-linearly to a point with rate not greater than , there exists a natural number and a constant such that
[TABLE]
Let us define the number
[TABLE]
Combining (41) and (42) yields
[TABLE]
The argument for Case 2 in the proof of Theorem 5.4 implies that the sequence defined at (18) satisfies
[TABLE]
From this condition, the estimate (38) is obtained by using the estimates at (37).
The proof is complete.
It is clear that Theorem 5.4 does not cover Theorem 5.5. The following example also rules out the inverse inclusion.
Example 5.6** (Theorem 5.5 does not cover Theorem 5.4).**
Consider the convex function given by
[TABLE]
In , we define two closed convex sets and and a point . Then the two equivalent properties (namely, transversality of at and local linear convergence of around ) involved in Theorem 5.4 hold true while the two global ones involved in Theorem 5.5 do not.
To establish global convergence of a fixed point iteration, one normally needs some kind of global regularity behavior of the fixed point set. In Theorem 5.5, we formally impose only subtransversality in order to deduce global R-linear convergence and vice versa. Beside the global behavior of convexity, the hidden reason behind this seemingly contradictory phenomenon is a well known fact about subtransversality of collections of convex sets. We next deduce this result from the convergence analysis above. The proof is given for completeness.
Corollary 5.7**.**
[25, Theorem 8]** Let and be closed and convex subsets of with nonempty intersection. The collection of sets is globally subtransversal, that is, there is a constant such that
[TABLE]
if and only if is subtransversal at every point in with constants bounded from above by some .
Proof. This implication is trivial with .
Note that the estimate (43) is trivial for . Let us take an arbitrary and consider the alternating projections sequence starting at . Take any number The argument in the first part of Theorem 5.5 implies that converges R-linearly to some with rate and
[TABLE]
By letting in the above inequality, we obtain (43) with .
The proof is complete.
The convergence counterpart of Corollary 5.7 can also be of interest.
Corollary 5.8**.**
Let be an alternating projections sequence for two closed convex subsets of with nonempty intersection and . If there exists a natural number such that for all , then for all .
We emphasize that the two statements in Corollary 5.8 are always equivalent (by the argument for the second part of Theorem 5.5) if the constant is not required to be the same. However, this requirement becomes important when one wants to estimate global rate of convergence via the local rate of convergence. The next statement can easily be observed as a by-product via the proof of Theorem 5.4.
Proposition 5.9** (equivalence of linear monotonicity and R-linear convergence).**
For sequences of alternating projections between convex sets, R-linear convergence and linear monotonicity of the sequence of iterates are equivalent.
The next statement can serve as a motivation for Definition 2.5.
Proposition 5.10** (Q-linear convergence implies linear extendability).**
Let be a sequence of alternating projections for two closed convex sets with nonempty intersection. If converges Q-linearly to a point with rate , then is linearly extendible with frequency and rate , and the corresponding joining sequence is such a linear extension sequence.
Before proving this, we first establish the following technical fact.
Lemma 5.11**.**
Let and be two closed convex sets in with nonempty intersection. We have
[TABLE]
Proof. [of Lemma 5.11.] Denote and . It suffices to consider the two cases as follows.
Case 1. or . This implies that , which in turn implies that . Hence, inequality (44) is satisfied.
Case 2. Both sides of (44) are strictly positive. Let be the projection of on the line (segment, equivalently since ) joining and . The elementary geometry for triangles and , respectively, yields
[TABLE]
This implies
[TABLE]
Inequality (44) now follows since, by convexity of , , and by definition of the projector,
[TABLE]
This proves Lemma 5.11.
We conclude with the proof of Proposition 5.10.
Proof. [of Proposition 5.10.] It suffices to prove that the sequence given in (18) satisfies
[TABLE]
We will prove this by way of contradiction. Suppose otherwise that there exists some such that
[TABLE]
We can assume without loss of generality. By Lemma 5.1 we get
[TABLE]
Lemma 5.11 then implies
[TABLE]
This contradicts Q-linear convergence of to with rate , and the proof is complete.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. J. Aragón Artacho and B. S. Mordukhovich. Enhanced metric regularity and Lipschitzian properties of variational systems. J. Global Optim. , 50(1):145–167, 2011.
- 2[2] T. Aspelmeier, C. Charitha, and D. R. Luke. Local linear convergence of the ADMM/Douglas–Rachford algorithms without strong convexity and application to statistical imaging. SIAM J. Imaging Sci. , 9(2):842–868, 2016.
- 3[3] H. H. Bauschke and J. M. Borwein. On the convergence of von Neumann’s alternating projection algorithm for two sets. Set-Valued Anal. , 1(2):185–212, 1993.
- 4[4] H. H. Bauschke and J. M. Borwein. On projection algorithms for solving convex feasibility problems. SIAM Rev. , 38(3):367–426, 1996.
- 5[5] H. H. Bauschke and P. L. Combettes. Convex Analysis and Monotone Operator Theory in Hilbert Spaces . CMS Books Math./Ouvrages Math. SMC. Springer, New York, 2011.
- 6[6] H. H. Bauschke, D. R. Luke, H. M. Phan, and X. Wang. Restricted Normal Cones and the Method of Alternating Projections: Applications. Set-Valued and Var. Anal. , 21:475–501, 2013.
- 7[7] H. H. Bauschke, D. R. Luke, H. M. Phan, and X. Wang. Restricted Normal Cones and the Method of Alternating Projections: Theory. Set-Valued and Var. Anal. , 21:431–473, 2013.
- 8[8] J. Bolte, A. Daniilidis, O. Ley, and L. Mazet. Characterizations of Łojasiewicz inequalities: subgradient flows, talweg, convexity. Trans. Am. Math. Soc. 362(6):3319–3363 (2010).
