Distributed Algorithms for Computing a Common Fixed Point of a Group of Nonexpansive Operators
Xiuxian Li, Gang Feng

TL;DR
This paper introduces distributed algorithms for multi-agent networks to find common fixed points of nonexpansive operators, ensuring convergence under various regularity conditions and reducing computational load with block-coordinate updates.
Contribution
It proposes the D-IKM and D-IBKM algorithms for distributed fixed point computation, extending Krasnosel'skid1-Mann iteration to networked, multi-agent settings with convergence guarantees.
Findings
D-IKM converges weakly to a common fixed point.
D-IBKM reduces computational complexity while maintaining convergence.
Convergence rates are established under linear regularity assumptions.
Abstract
This paper addresses the problem of seeking a common fixed point for a collection of nonexpansive operators over time-varying multi-agent networks in real Hilbert spaces, where each operator is only privately and approximately known to each individual agent, and all agents need to cooperate to solve this problem by propagating their own information to their neighbors through local communications over time-varying networks. To handle this problem, inspired by the centralized inexact Krasnosel'ski\u{\i}-Mann (IKM) iteration, we propose a distributed algorithm, called distributed inexact Krasnosel'ski\u{\i}-Mann (D-IKM) iteration. It is shown that the D-IKM iteration can converge weakly to a common fixed point of the family of nonexpansive operators. Moreover, under the assumption that all operators and their own fixed point sets are (boundedly) linearly regular, it is proved that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Optimization and Variational Analysis · Advanced Optimization Algorithms Research
Distributed Algorithms for Computing a Common Fixed Point of a Group of Nonexpansive Operators
††thanks:
Xiuxian Li and Gang Feng X. Li and G. Feng are with Department of Biomedical Engineering, City University of Hong Kong, Kowloon, Hong Kong (e-mail: [email protected]; [email protected]).
Abstract
This paper addresses the problem of seeking a common fixed point for a collection of nonexpansive operators over time-varying multi-agent networks in real Hilbert spaces, where each operator is only privately and approximately known to each individual agent, and all agents need to cooperate to solve this problem by propagating their own information to their neighbors through local communications over time-varying networks. To handle this problem, inspired by the centralized inexact Krasnosel’skiĭ-Mann (IKM) iteration, we propose a distributed algorithm, called distributed inexact Krasnosel’skiĭ-Mann (D-IKM) iteration. It is shown that the D-IKM iteration can converge weakly to a common fixed point of the family of nonexpansive operators. Moreover, under the assumption that all operators and their own fixed point sets are (boundedly) linearly regular, it is proved that the D-IKM iteration converges with a rate for some constant , where is the iteration number. To reduce computational complexity and burden of storage and transmission, a scenario, where only a random part of coordinates for each agent is updated at each iteration, is further considered, and a corresponding algorithm, named distributed inexact block-coordinate Krasnosel’skiĭ-Mann (D-IBKM) iteration, is developed. The algorithm is proved to be weakly convergent to a common fixed point of the group of considered operators, and, with the extra assumption of (bounded) linear regularity, it is convergent with a rate . Furthermore, it is shown that the convergence rate can still be guaranteed under a more relaxed (bounded) power regularity condition.
Index Terms:
Distributed algorithms, multi-agent networks, Krasnosel’skiĭ-Mann iteration, nonexpansive operators, fixed point, optimization.
I Introduction
Fixed point theory in Hilbert spaces finds numerous applications in nonlinear numerical analysis and optimization [1, 2], which, roughly speaking, provides a unified mathematical framework for such kinds of problems. As such, a large volume of literature on the topic has emerged, including the investigation of fixed point theory itself and its applications [3, 4, 5, 6, 7, 8, 9, 10].
Although fruitful results have been reported on fixed point theory [1], most of them are on centralized problems, that is, there is a global computing unit or coordinator who is aware of all the operators’ information. Compared with centralized problems, distributed ones enjoy overwhelming advantages, such as, lower cost, higher robust to failures, and less storage, and so on [11]. Along this line, recently, a distributed problem for finding a common fixed point of a group of paracontraction operators was studied in [12, 13], which is motivated by a typical problem, that is, solving a linear algebraic equation in the Euclidean space in a distributed manner, where a multiple of agents hold private partial information on the linear equation and thus all agents need to cooperatively solve the problem through local communications [14, 15, 16, 17, 18]. Meanwhile, the case with strongly quasi-nonexpansive operators was reported in [19]. It is worthwhile to note that the aforesaid works have focused on the Euclidean space with exact knowledge of operators.
This paper aims to develop distributed algorithms for a collection of autonomous agents to seek a common fixed point of nonexpansive operators or mappings, which are privately held by individual agents, in real Hilbert spaces. Note that nonexpansive operators are more general than the operators considered in [12, 13, 19], and in fact they include the paracontraction operators and strongly quasi-nonexpansive operators as special cases. It is also noted that the nonexpansive operators include some celebrated operators, such as, projections, the proximal map, and the gradient descent map , where is a differentiable and convex function, is the gradient of , being Lipschitz with constant , and the constant satisfies .
On the other hand, it is well known that the classical Krasnosel’skiĭ-Mann (KM) iteration is a quintessential algorithm to find a fixed point for a nonexpansive operator [20, 21, 22, 23, 24, 25, 26]. Note that Picard iteration does not converge in general for a nonexpansive operator. The KM iteration is firstly proposed in [27, 28], which have so far received tremendous attention [20, 21, 22, 23, 24, 25, 26]. Moreover, the KM iteration provides a unified framework for analysis of various algorithms, such as Proximal point algorithms (PPA) [29], forward-backward splitting method (FBS) [30], Peaceman-Rachford splitting (PRS) [31], Douglas-Rachford splitting (DRS) [32, 33], alternating direction method of multipliers (ADMM) [34], and a three-operator splitting [35]. It is shown that the KM iteration converges weakly to a fixed point of a nonexpansive operator under mild conditions [36].
With the above observations, this paper aims at developing distributed algorithms, by extending the KM iteration to the distributed scenario, for a family of autonomous agents to seek a common fixed point of a group of nonexpansive operators in real Hilbert spaces, where each operator is privately and approximately known by individual agent. In summary, the contributions of this paper can be summarized as follows.
An algorithm, called distributed inexact Krasnosel’skiĭ-Mann (D-IKM) iteration, is proposed, which, under some mild conditions, is shown to be weakly convergent to a common fixed point of the concerned nonexpansive operators. Moreover, a preliminary result on the convergence rate is provided, that is, there exists a subsequence of the sequence generated by the D-IKM iteration such that the subsequence converges at a rate , where is the iteration number. Compared with those most related works [12, 13, 19], all of which focus on Euclidean spaces with exact knowledge of operators and do not analyze the convergence rate, this paper considers more general spaces, i.e., real Hilbert spaces, with only approximate knowledge of operators, and also presents a result on the convergence speed. 2. 2.
To reduce computational complexity and burden of storage and transmission, another algorithm, named distributed inexact block-coordinate Krasnosel’skiĭ-Mann (D-IBKM) iteration, is developed, where only a part of coordinate is updated at each iteration for each agent. Under mild conditions, it is proved that the D-IBKM iteration converges weakly to a common fixed point of the considered operators and the similar convergence rate as in the case 1) can also be established. 3. 3.
Under an assumption of the (bounded) linear regularity for all operators and their fixed point sets, a convergence of for the two proposed algorithms can be established, where is a constant and is the iteration index. Furthermore, it is shown that the same convergence rate can be maintained with a more relaxed assumption of (bounded) power regularity for the considered operators.
The remainder of this paper is organized as follows. Section II provides some preliminary knowledge and the problem formulation, and the D-IKM iteration is developed in Section III along with its convergence rate. Subsequently, in Section IV, the D-IBKM iteration is presented along with its convergence results. The proofs of main results in last two sections are provided in Section V. Finally, Section VI concludes this paper and discusses the direction of future research.
II Preliminaries and Problem Statement
This section provides some notations, preliminary concepts, and the problem formation.
Notations: Let be a real Hilbert space with inner product and associated norm . For an integer , let , , , and represent the sets of real numbers, -dimensional real vectors, real matrices, and nonnegative integers, respectively. Let be the index set with an integer , and be the stacked column vector of . Denote by the projection of a point onto a closed and convex set , i.e., . Moreover, denote by the identity matrix of compatible dimension, the identity operator or mapping, and the Kronecker product. Let be the distance from a point to the set , i.e., . Let and be, respectively, the largest integer less than or equal to and the smallest integer greater than or equal to real number . For an operator or mapping , denote by the set of fixed points of , i.e., . Let and denote weak and strong convergence, respectively. The closed ball with center and radius is denoted by .
To proceed, let us review some fundamental concepts in operator theory [1].
Let be a nonempty subset of , and let be an operator or mapping. Then is called nonexpansive if for all
[TABLE]
called -averaged for if it can be written as
[TABLE]
for some nonexpansive operator , called firmly nonexpansive if for all
[TABLE]
called quasi-nonexpansive (QNE) if for any and any
[TABLE]
and called -strongly quasi-nonexpansive (-SQNE) for if for all and all
[TABLE]
It is known that the set is closed and convex if is QNE [37].
We are now ready to formulate the problem considered in this paper. Specifically, the goal is for a group of autonomous agents to find a common point in real Hilbert space such that
[TABLE]
where is a nonexpansive operator for all . In this problem, no global coordinator, which can access all the information of ’s, is assumed to exist. Instead, is assumed to be approximately and locally accessible to agent in the sense that agent can receive the approximate information for any point , where is an error. This is more reasonable since the precise value of is usually hard or expensive to obtain, for instance, the exact gradient of a function. The objective of this paper is to develop a distributed algorithm to solve the problem (6) under the aforementioned scenario. One possible way to solve the problem is to generalize the classical centralized KM iteration to the distributed case. In doing so, it is helpful to briefly introduce the KM iteration.
For a nonexpansive operator , a well-known method for finding a fixed point of is the so-called inexact KM iteration [21, 25], that is,
[TABLE]
where is the error of approximating , and is a sequence of relaxation parameters. When for all , (7) reduces to the classical KM iteration [20, 22]. It has been shown that the (inexact) KM iteration converges weakly to a fixed point of under mild conditions [36, 21, 25], for example, when for the KM iteration [36].
Now, let us introduce the graph theory for describing the communication pattern among all agents [11]. Specifically, the communication mode among agents can be modeled by a digraph , where is the node or vertex set, and is the edge set. An edge means that agent is capable of transmitting its information to agent , in which case agent is called a neighbor of agent . A directed path from to is a sequence of edges of the form . A graph is called strongly connected if there exists at least a directed path from any node to any other node in this graph. In this paper, the communication graph for all agents is assumed to be time-varying, that is, any two agents can have different communication status at different time steps. In this case, the graph is denoted as , where indicates the time index. The union of graphs is defined as . At each time , there exists an adjacency matrix such that if , and otherwise. Assume that for all and all . For communication graphs, we have the following standard assumptions.
Assumption 1** (Graph Connectivity and Weights Rule).**
The time-varying graphs are uniformly jointly strongly connected, that is, there exists an integer such that the graph union is strongly connected for all . 2. 2.
For all , is row-stochastic, i.e., for all , and there exists a constant such that whenever .
To end this section, it is convenient for us to list a useful lemma.
Lemma 1** ([38]).**
Let Assumption 1 hold and define for with the convention . Then, for any , there exists a vector such that and the following statements hold.
* for all and , where and are some constants, and is the -th entry of .* 2. 2.
There exists a constant such that for all and all . 3. 3.
.
III The D-IKM Iteration
This section aims to solve problem (6) by developing a distributed algorithm, called distributed inexact KM (D-IKM) iteration.
Motivated by the inexact KM iteration given in (7), the D-IKM iteration is proposed as follows
[TABLE]
where
[TABLE]
represents the aggregate information received from its neighbors at time step , is an estimate of a common fixed point of ’s by agent at time instant , is an error of approximating by agent , and is a sequence of relaxation parameters for agent , which is assumed to satisfy
[TABLE]
for some constant and for all .
For the ease of exposition, let us denote by the set of summable sequences in , the set of fixed points of , the set of common fixed points of all ’s which is assumed to be nonempty, and
[TABLE]
We are now ready to present the first main result as follows.
Theorem 1**.**
For the D-IKM iteration (8) with for all , under Assumption 1, the following two statements hold:
All ’s are bounded and converge weakly to a common point in ; and 2. 2.
There exists a subsequence , such that
[TABLE]
Proof.
The proof is given in Section V-A. ∎
Remark 1**.**
It is worth pointing out that it is in general standard to leverage as a measure of the convergence speed for the centralized (inexact) KM iteration, since amounts to , see [24, 25, 20, 23, 21, 22]. This is why is employed for measuring the convergence rate of the D-IKM iteration, as shown in (12). However, it is noted that the result in (12) is described by a subsequence instead of , since the D-IKM iteration involves communications over a multi-agent network unlike the case of the centralized KM iteration. It is still open whether one can obtain the result as in the case of the centralized KM iteration [22].
In what follows, the convergence rate of the D-IKM iteration is further discussed under some extra assumptions. It was shown in [39, 9] that the centralized KM iteration is linearly convergent under the (bounded) linear regularity assumption, which is referred to as a sufficient condition for the linear convergence of averaged nonexpansive operators. It is thus natural for us to ask if the linear convergence can still be maintained for the distributed case, i.e., the D-IKM iteration, under the same assumption. To proceed, let us first review the concept of (bounded) linear regularity.
Definition 1** ([39]).**
Let be a nonempty subset of , be an operator with , and be a finite collection of closed convex subsets of with , where is a finite index set. It is said that:
* is linearly regular with constant if for all *
[TABLE] 2. 2.
* is boundedly linearly regular if for any bounded set , there exists such that for all *
[TABLE] 3. 3.
* is linearly regular with constant if for all .* 4. 4.
* is boundedly linearly regular if for any bounded set , there exists such that for all .*
One example for linearly regular operators is the projection operator on a closed convex set , as it is easy to verify that . The above notions have been thoroughly investigated in [39, 9]. For instance, suppose that , then is boundedly linearly regular if , where denotes the set of interior points of set . Please refer to [39, 9] for more details and [40] for another relevant notion, i.e., metric (sub-)regularity for set-valued mappings.
To proceed, the assumption of the bounded linear regularity is explicitly given below.
Assumption 2**.**
* is boundedly linearly regular for each , and the sets are boundedly linearly regular.*
In view of Theorem 1, it is known that all ’s are bounded, say for a constant and for all , which leads to that there exist constants and such that for all
[TABLE]
if Assumption 2 holds.
With the above preparations, we are now in a position to give the result on the D-IKM iteration’s stronger convergence.
Theorem 2**.**
Under Assumptions 1 and 2, all ’s in the D-IKM iteration (8) converge strongly to a common point in , if there holds
[TABLE]
where
[TABLE]
, and . Moreover, in the absence of the approximate errors (called D-KM iteration for (8) in this case), i.e., for all and , all ’s converge to a common point in at a rate of under condition (17), where is given in Lemma 1.
Proof.
The proof is given in Section V-A. ∎
Remark 2**.**
From the above theorem, it can be obviously seen that the D-KM iteration enjoys the convergence rate , i.e., , which is slower than the linear convergence rate, i.e., . The slower convergence rate for the D-KM iteration can be attributed to local communications among agents, since all agents need to exchange their own information to their neighbors for achieving the synchronization of ’s for all . In this regard, it is unknown whether or not the linear convergence rate can be achieved for the D-KM iteration under the same assumptions or the bounded power regularity introduced later in Definition 2, which is left as our future work.
As a matter of fact, the convergence rate for the D-KM iteration can still be ensured under another relaxed assumption. Specifically, we introduce a novel concept of bounded power regularity for a family of operators.
Definition 2**.**
Let be a nonempty subset of , and let be an operator for each , along with . It is said that:
* are power regular with constant if for all *
[TABLE] 2. 2.
* are boundedly power regular if for any bounded set , there exists such that for all *
[TABLE]
In the sequel, it is shown that (bounded) power regularity for a set of operators can be implied by (bounded) linear regularities of each operator and their fixed point sets.
Proposition 1**.**
For a finite family of operators , , along with , if is (boundedly) linearly regular for each and meanwhile the sets are (boundedly) linearly regular, then are (boundedly) power regular.
Proof.
Let us first focus on linear regularity. With reference to the conditions in this proposition, there exist constants and such that for all
[TABLE]
which implies that
[TABLE]
where . As a result, one can obtain that , which thereby implies the power regularity for the set of ’s according to Definition 2. Furthermore, the case with bounded power regularity can be similarly proved. ∎
It can be seen from Proposition 1 that (bounded) power regularity in Definition 2 is more relaxed than the notion of (bounded) linear regularity in Definition 1. In fact, (bounded) power regularity in Definition 2 is strictly looser than (bounded) linear regularity in Definition 1, which can be illustrated by the following example.
Example 1**.**
Let and in Definitions 1 and 2, and consider two operators as and for , where . Then, it is easy to see that , , and hence . It is also straightforward to obtain that , for , thus leading to that is not linearly regular since as . But one can easily check that there holds for all , which indicates that are power regular with constant .
It is also noteworthy that notions in Definition 2 can be regarded as a generalization of (bounded) linear regularity for a single operator to multiple operators. Then, instead of Assumption 2, the following less restrictive assumption can be made.
Assumption 3**.**
* is boundedly power regular.*
With this assumption, one can obtain the following result.
Theorem 3**.**
Let Assumptions 1 and 3 hold. Then all ’s in the D-KM iteration, i.e., in (8) for all and , converge to a common point in with a rate , if condition (17) holds, where is given in Lemma 1.
Proof.
The proof is given in Section V-A. ∎
As seen from Theorem 3, the convergence rate is proportional to , like a power function of , which is the reason for calling the “power” regularity in Definition 2.
IV The D-IBKM Iteration
The focus of this section is on randomly updating a part of the coordinate for each agent, instead of the entire coordinate, in order to reduce the computational complexity and the burden of storage and transmission, especially for the case with large-scale coordinates and large-scale network, as investigated for centralized algorithms [41, 42, 43].
To begin with, it is convenient to introduce some notations employed in this section.
Notations: is the direct Hilbert sum with Borel -algebra , where is a separable real Hilbert space, along with the same inner product and associated norm . A -valued random variable is a measurable map with the standing probability space , endowed with the expectation , where a measurable (or -measurable) map means that there holds for every set . Let denote a generic vector in , and let denote the -algebra generated by the collection of random variables. Denote by a filtration, i.e., each is a sub-sigma algebra of such that for all . Let be the set of -valued random variable sequence adapted to , i.e., is -measurable for all , and define . Throughout this section, all inequalities and equalities are understood to hold -almost surely whenever in the presence of random variables, even though “-almost surely” is not explicitly expressed. For brevity, we abbreviate “-almost surely” as “a.s.” subsequently.
Consider now problem (6). In this case, is nonexpansive with being measurable for all and . To solve this problem, a block-coordinate based distributed algorithm, called distributed inexact block-coordinate KM (D-IBKM) iteration, is proposed as follows,
[TABLE]
for and , where serves as an estimate of a solution to problem (6) for agent at time , for all , is an aggregate information of agent received from its neighbors at time slot , is a sequence of identically distributed -valued random variables with , , is a -valued random variable, viewed as the error of approximating , and is a sequence of relaxation parameters, satisfying for a constant . Wherein, let be a -valued random variable for all .
To proceed, set with , and let for and , for which it is assumed that is independent of and for . Also, define . In the meantime, assume that for all and , meaning that every block-coordinate has a chance to update.
Regarding iteration (21), it can be equivalently written as
[TABLE]
where , and
[TABLE]
After setting and , (22) can be compactly written as
[TABLE]
Similarly to Section III, denote by the set of fixed points of , and the set of common fixed points of all ’s which is assumed to be nonempty. It is also necessary to define a new norm with associated inner product on as in [43]
[TABLE]
It is noted that , meaning that the two norms are equivalent, where .
Equipped with the above preparations, we are now ready to present the main result of this section.
Theorem 4**.**
For the D-IBKM iteration (21) under Assumption 1 and the assumption that for all , the following two statements hold:
All ’s are bounded and converge weakly, in the space , to a common point in a.s.; 2. 2.
There exists a subsequence , such that for all
[TABLE]
Proof.
The proof is given in Section V-B. ∎
Remark 3**.**
It should be noted that when there is only one agent in a multi-agent network, the results in Theorem 4 reduces to the centralized case [43]. However, the analysis for the distributed iteration here is more complicated than that for the centralized scenario, rendering the argument in [43] not directly applicable here. In addition, the convergence rate is not investigated in [43], while the convergence speed is provided here, see also Theorem 5 below.
To further investigate the convergence rate of D-IBKM in (21), let us recall Definition 2 for the bounded power regularity of a family of operators. It is known from Theorem 4 that all ’s are bounded, connoting that there is a constant such that for all . As a consequence, under Assumption 3, there must exist a constant such that for all
[TABLE]
Now, the stronger convergence of D-IBKM in (21) can be given as follows.
Theorem 5**.**
Under Assumptions 1 and 3 for the D-IBKM iteration (21), for all a.s., if there holds
[TABLE]
where , , , , and is given in Lemma 1. Moreover, in the absence of errors (called D-BKM iteration for (21) in this case), i.e., for all and , converges to zero with a rate a.s. under condition (28).
Proof.
The proof is given in Section V-B. ∎
V Convergence Analysis: Proofs of Theorems 1-5
This section aims to provide detailed convergence analysis for the main results in the last two sections, that is, the proofs of Theorems 1-5.
V-A Proofs of Theorems 1-3
Let us first introduce several lemmas for the subsequent use.
Lemma 2** ([44]).**
Let be a sequence of nonnegative scalars such that for all
[TABLE]
where , and for all with and . Then, the sequence converges to some and .
Lemma 3**.**
Consider and let be a linear operator in real Hilbert space , then , where is the largest entry of the matrix in the modulus sense.
Proof.
For arbitrary with and , it can be concluded that
[TABLE]
where the last inequality has used the fact that . Consequently, one can obtain that , as claimed. ∎
Lemma 4**.**
Let be a nonexpansive operator with . Then, there holds for all and .
Proof.
For any and , it is easy to deduce that
[TABLE]
where the inequality has exploited the nonexpansive property of . ∎
The following result is a fundamental result which relates to via for each agent .
Lemma 5**.**
Consider the D-IKM iteration (8). For all , there holds
[TABLE]
Proof.
In view of (8), it can be asserted that for
[TABLE]
where the second inequality has made use of the nonexpansive property of , and the last inequality has utilized the iteration (8). ∎
With the above lemmas at hand, we are now ready to prove Theorems 1-3.
Proof of Theorem 1: Invoking (11), the iteration (8) can be rewritten as
[TABLE]
Then, for any , which must satisfy for all and , it can be obtained from (29) that for all
[TABLE]
where the second inequality follows from the nonexpansive property of because is nonexpansive, and the last inequality is due to the convexity of and for all , see Assumption 1.
Multiplying on both sides of (30) and summing over yield that
[TABLE]
where we have employed in Lemma 1. Note that , , and . Applying Lemma 2 results in that is bounded and thus so is for all and because of by Lemma 1.
Subsequently, let us denote
[TABLE]
Then, in view of (8), one can obtain that
[TABLE]
which, together with Lemma 4 and the convexity of the norm , implies that
[TABLE]
By multiplying on both sides of (34) and summing over , it can be concluded that
[TABLE]
where we have resorted to the facts that , , and for all and .
Now, summing (35) over gives rise to
[TABLE]
which, together with , yields that
[TABLE]
further leading to
[TABLE]
With the above preparations, we are ready to prove that ’s will reach agreement for all agents . To see this, the iteration (8) can be written in a compact form
[TABLE]
where , , and .
Invoking (38) and because of , one readily obtains that . With reference to (39), by applying the same arguments as that of Lemmas 3 and 4 in [38] and using Lemma 3, one has that
[TABLE]
where is viewed as a weighted average of ’s.
We next show the weak convergence of (8). Bearing in mind that , it can be obtained by (35) and Lemma 2 that converges. In the meantime, one has that
[TABLE]
which yields that
[TABLE]
since converges, (see (40)), and by Cauchy-Schwarz inequality.
On the other hand, by resorting to Lemma 5 and (38) along with for all , one has that
[TABLE]
which, in tandem with (40), gives rise to that for all
[TABLE]
where the nonexpansiveness of is employed in the second inequality.
Now, for arbitrary sequential cluster point of , i.e., , in view of (43), invoking Corollary 4.28 in [1] yields that for all , i.e., , Then, in light of Lemma 2.47 in [1] and (41), it can be asserted that converges weakly to a point in , say .
Consequently, the weak convergence of ’s to a common point in can be ensured once noting the fact that for all and all
[TABLE]
where the inequality has employed Cauchy-Schwarz inequality.
It remains to show the convergence rate (12). Let us prove it by contradiction. If there are no subsequences such that (12) holds, then there must exist , , and , such that for all
[TABLE]
On the other hand, in view of Lemma 5, it follows that
[TABLE]
by using for scalars , which in combination with (45) results in that
[TABLE]
where the last inequality has made use of (45). It is apparent that (47) contradicts (37). Therefore, one can claim that (12) holds. This ends the proof of Theorem 1.
Proof of Theorem 2: Define , and let
[TABLE]
Invoking (8), it can be concluded that
[TABLE]
where the second inequality has adopted the same reasoning as in (33) and (34). Substituting (46) in (50), one has that
[TABLE]
where we have utilized the fact that for .
By multiplying on both sides of (51) and summing over , one has that
[TABLE]
where the facts that and in Lemma 1 have been utilized.
To proceed, it is helpful to establish a relationship between and . Specifically, it can be deduced that
[TABLE]
where the last inequality has used the fact that
[TABLE]
for two scalars . Moreover, it is easy to get that
[TABLE]
where the last inequality has leveraged the nonexpansive property of . Now, inserting (55) into (53) yields that
[TABLE]
At this point, turning our attention back to (52), invoking (56) leads to that
[TABLE]
Consider the term in (57), in light of (15), one can obtain that
[TABLE]
where the last inequality is due to (16). Consider further the term in (58), one has that
[TABLE]
where the last inequality has leveraged (54). Summing over for (59) yields that
[TABLE]
Substituting (58) and (60) into (57) gives rise to that
[TABLE]
where
[TABLE]
For notation simplicity, let
[TABLE]
Then, (61) can be written as
[TABLE]
Consider now the term . Recalling , one can conclude that
[TABLE]
where the third inequality has employed the fact that and is nonexpansive, and the last inequality has used the convexity of . Subsequently, by multiplying on both sides of (66) and summing over , it follows that
[TABLE]
where in Lemma 1 has been applied in the inequality. Combining (67) with the fact that implies that
[TABLE]
Bearing in mind the definition of in (39), it follows from (68) that
[TABLE]
where . In view of (69), following the same arguments as that of Lemmas 3 and 4 in [38] for (39), one can conclude that
[TABLE]
which further gives rise to
[TABLE]
where (69) has been utilized in the second inequality.
Inserting (71) into (65), it can be then obtained that
[TABLE]
where is defined in (18) and
[TABLE]
Because of , it follows from (72) that
[TABLE]
further implying that
[TABLE]
where
[TABLE]
It is easy to verify that under condition (17). Note that there exists such that
[TABLE]
Then, by iteratively applying (75), one can conclude that
[TABLE]
where
[TABLE]
with and .
Meanwhile, it can be obtained that
[TABLE]
which, together with (78), yields that
[TABLE]
It is easy to see that as since so is due to for all , and thus as . Moreover, it can be obtained from (77) that
[TABLE]
which further implies that
[TABLE]
On the other hand, invoking (79) yields that
[TABLE]
which, together with (83), leads to
[TABLE]
By combining (81)-(84), one can conclude that and thus for all (see (71)) converge strongly to the origin, and converge at a rate of , i.e., , when for all and .
Finally, let
[TABLE]
Then, applying the convexity of , it can be concluded that . Meanwhile, note that and for all . Combining the above analysis completes the proof.
Proof of Theorem 3: By Theorem 1, it is known that all ’s are bounded. Therefore, according to the bounded power regularity of , one has that there exists a constant such that
[TABLE]
which leads to that
[TABLE]
Note that (87) is consistent with (58) with different coefficients. Hence, following the same argument as that of Theorem 2, the conclusions of this theorem can be asserted. The proof is thus completed.
V-B Proofs of Theorems 4 and 5
Let us first introduce several lemmas.
Lemma 6** ([45]).**
Let be a filtration. If , , , and satisfy the following inequality a.s.:
[TABLE]
then, and converges to a -valued random variable a.s.
Lemma 7** ([1]).**
Let , and let . Then
[TABLE]
The relationship between and is revealed through in the following lemma.
Lemma 8**.**
Consider the D-IBKM iteration (21). For all , there holds
[TABLE]
Proof.
It follows that
[TABLE]
where (24) has been employed in the equality and last inequality, and the nonexpansiveness of deduces the second inequality.
To proceed, let us analyze and . In doing so, for all , , and , define
[TABLE]
for . It is easy to see that is -measurable since is so.
As a result, one can obtain that for all
[TABLE]
where the third equality has used the definition (23). Similarly, one has that
[TABLE]
Now, by squaring (89), taking the conditional expectation, and inserting (91) and (92) into it, one obtains that
[TABLE]
which is as claimed. ∎
With the above lemmas at hand, we are now ready to prove Theorems 4 and 5 as follows.
Proof of Theorem 4: Throughout this subsection, let . Similar to (91), one can obtain that for all
[TABLE]
where the first equality has used the definition (23), and the nonexpansive property of has been applied to get the inequality.
Invoking Jensen’s inequality and (93), it can be concluded that
[TABLE]
Then, in view of (24), one has that
[TABLE]
Taking the conditional expectation on the above inequality yields that
[TABLE]
where the second inequality has exploited (94) and the last inequality has applied the convexity of norm . By multiplying on both sides of the above inequality and summing over , it is easy to obtain that
[TABLE]
where we have employed and in Lemma 1.
By the assumption in Theorem 4, it is straightforward to verify that
[TABLE]
Now, applying Lemma 6 to (95), one can readily obtain that and thereby ’s are bounded a.s.
Since is bounded a.s., there exists such that for all
[TABLE]
Then, it follows that
[TABLE]
where Lemma 7 has been utilized to get the last equality. Taking the conditional expectation on the above inequality, one has that
[TABLE]
where the last inequality has made use of (93) and the convexity of norm . Then, by multiplying on both sides of the above inequality and summing over , one can obtain that
[TABLE]
where Lemma 1 has been applied. Recalling (96) and in light of Lemma 6, one has that
[TABLE]
yielding that
[TABLE]
which, by the law of total expectation, gives rise to
[TABLE]
Consider the iteration (24). It can be written in a compact form
[TABLE]
where , , and for . In view of (96) and (100), it follows that and thus . Then, using the same arguments as that of Lemmas 3 and 4 in [38] and applying Lemma 3, one has that for all
[TABLE]
where is viewed as a weighted average of ’s. By resorting to Markov’s inequality, for arbitrary small , it can be claimed that
[TABLE]
which, together with (102), implies that
[TABLE]
Now, combining (92) with (99) leads to that
[TABLE]
further yielding, by the norm equivalence, that
[TABLE]
Further, one can have that for all
[TABLE]
where we have exploited for and the nonexpansive property of in the first and second inequalities, respectively. Meanwhile, by the convexity of norm, it follows that
[TABLE]
which, together with (103), results in
[TABLE]
Combining (104), (105), and (106) gives rise to that for all
[TABLE]
Finally, following the same reasoning as that between (43) and (44), the a.s. weak convergence of ’s to a common point in in Theorem 4 can be concluded.
It remains to prove the convergence result in (26). This can be similarly done as that of (12) using (98), (92), Lemma 8, (96), and the law of total expectation. This ends the proof.
Proof of Theorem 5: Let us denote by the distance from a vector to a set in space . Let and there exists such that for all
[TABLE]
due to the boundedness of ’s.
Then, in light of (24), it can be derived that
[TABLE]
where the last equality has invoked Lemma 7. Next, as similarly done for (93), it can obtain that
[TABLE]
Consequently, by multiplying on both sides of (108), summing over , using the convexity of , and taking the conditional expectation along with (91), (92), (109) and , one can get that
[TABLE]
where the parameters and the existence of is guaranteed by the boundedness of ’s, with a.s. for all . Using (56) in (110), one can obtain that
[TABLE]
In view of (27), it can be derived that , which, together with (60) and (111), results in
[TABLE]
where
[TABLE]
It is easy to verify that (71) still holds in the expectation sense. Thus, by taking the expectation on both sides of (112), one has that
[TABLE]
where and
[TABLE]
Therefore, letting with under (28), it can be concluded that
[TABLE]
In the end, invoking the similar argument for (75), the conclusions of this theorem can be established. This ends the proof.
VI Conclusion
This paper has investigated the problem of seeking a common fixed point for a family of nonexpansive operators over a time-varying multi-agent network in real Hilbert spaces, where each operator is only privately and approximately known by individual agent. In order to deal with the problem, a distributed algorithm, called D-IKM iteration, has been developed, which is shown to be weakly convergent to a common fixed point of the collection of operators, and furthermore, convergent with the rate under the (bounded) linear regularity assumption. To further make this algorithm more implementable in practice, another scenario, where only a random part of coordinate (instead of the entire coordinate) is activated and updated for each agent at each iteration, has been studied. Another distributed algorithm, called D-IBKM, has been accordingly proposed along with the convergence analysis similar to the D-IKM iteration case, but in the sense of almost surely. In addition, a novel concept, i.e., bounded power regularity for a family of operators, has been introduced, which is more relaxed than the counterparts for an operator and a family of sets. It is shown that the convergence rate can still be ensured under the assumption of the new concept. Regarding future work, it is interesting to consider the asynchronous case, i.e., all agents have their own local clocks, and to further study the convergence rate under the (bounded) power regularity.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. H. Bauschke and P. L. Combettes, Convex Analysis and Monotone Operator Theory in Hilbert Spaces, 2nd ed . Springer, New York, 2017.
- 2[2] A. Cegielski, Iterative Methods for Fixed Point Problems in Hilbert Spaces . Springer, Heidelberg, 2012, vol. 2057.
- 3[3] J. Eckstein and D. P. Bertsekas, “On the Douglas-Rachford splitting method and the proximal point algorithm for maximal monotone operators,” Mathematical Programming , vol. 55, no. 1-3, pp. 293–318, 1992.
- 4[4] H. Attouch, L. M. Briceno-Arias, and P. L. Combettes, “A parallel splitting method for coupled monotone inclusions,” SIAM Journal on Control and Optimization , vol. 48, no. 5, pp. 3246–3270, 2010.
- 5[5] H. Iiduka, “Convergence analysis of iterative methods for nonsmooth convex optimization over fixed point sets of quasi-nonexpansive mappings,” Mathematical Programming , vol. 159, no. 1-2, pp. 509–538, 2016.
- 6[6] J. M. Borwein, G. Li, and M. K. Tam, “Convergence rate analysis for averaged fixed point iterations in common fixed point problems,” SIAM Journal on Optimization , vol. 27, no. 1, pp. 1–33, 2017.
- 7[7] W. B. Haskell and R. Jain, “A random monotone operator framework for strongly convex stochastic optimization,” in Proceedings of IEEE 56th Conference on Decision and Control , Melbourne, Australia, 2017, pp. 3763–3768.
- 8[8] P. Yi and L. Pavel, “A distributed primal-dual algorithm for computation of generalized Nash equilibria via operator splitting methods,” in Proceedings of 56th Conference on Decision and Control , Melbourne, Australia, 2017, pp. 3841–3846.
