Finite rate distributed weight-balancing and average consensus over digraphs
Chang-Shen Lee, Nicol\`o Michelusi, Gesualdo Scutari

TL;DR
This paper introduces a novel distributed algorithm for weight-balancing and average consensus over directed graphs, using finite rate simplex communications, with proven convergence and sublinear rate, validated by numerical experiments.
Contribution
It presents the first finite rate distributed weight-balancing algorithm and extends it to average consensus, with new metrics and step-size rules for directed graphs.
Findings
Algorithm converges to weight-balanced solution at sublinear rate
Proposed consensus algorithm converges to true average almost surely
Numerical results confirm theoretical convergence and performance
Abstract
This paper proposes the first distributed algorithm that solves the weight-balancing problem using only finite rate and simplex communications among nodes, compliant with the directed nature of the graph edges. It is proved that the algorithm converges to a weight-balanced solution at sublinear rate. The analysis builds upon a new metric inspired by positional system representations, which characterizes the dynamics of information exchange over the network, and on a novel step-size rule. Building on this result, a novel distributed algorithm is proposed that solves the average consensus problem over digraphs, using, at each timeslot, finite rate simplex communications between adjacent nodes -- some bits for the weight-balancing problem and others for the average consensus. Convergence of the proposed quantized consensus algorithm to the average of the node's unquantized initial values…
| References | Quantization | Digraph | Convergence | Limit Point | # Bits/Timeslot | Initial Value |
| [13] | Deterministic | Neighborhood | Problem-Dependent | Integer | ||
| [14] | Deterministic | ✓ | Neighborhood | Problem-Dependent | Integer | |
| [15] | Deterministic | Neighborhood | Problem-Dependent | Box | ||
| [16] | Deterministic | Neighborhood | Infinite | Any | ||
| [17] | Deterministic | Neighborhood | Any | Any | ||
| [18] | Probabilistic | ✓ | Neighborhood | Any | Any | |
| [19] | Probabilistic | ✓ | Neighborhood | Any | Box | |
| [20, 21] | Deterministic | ✓ | Average | Any | Box | |
| [22] | Probabilistic | ✓ | Average | Any | Box | |
| [23] | Deterministic | ✓(Balanced) | ✓ | Average | Any | Box |
| [24] | Probabilistic | ✓(Balanced) | ✓ | Average | Any | Informative |
| [25, 26] | Deterministic | ✓ | ✓ | Weighted Average | Any | Box |
| [27, 28] | Deterministic | ✓ | ✓ | Average | Trajectory-Dependent | Integer |
| [29] | Deterministic | ✓ | Neighborhood | Infinite | Box | |
| [30] | Probabilistic | ✓ | ✓ | Average | Infinite | Any |
| Proposed | Probabilistic | ✓ | ✓ | Average | Any | Informative |
| Symbol | Description |
|---|---|
| Out- () & in-neighbors () of node | |
| Out- () & in-degrees () of node | |
| Timeslot Index | |
| Step-size (for weight-balancing) | |
| Step-size (for consensus) | |
| Weight matrix | |
| Sum of outgoing () & incoming () weights at node | |
| (Weight) balance | |
| Graph Laplacian matrices | |
| Quantization range for consensus | |
| Local estimate (cf. (11)) | |
| Clipped local estimate (cf. (10)) | |
| Initial measurements | |
| Number of bits to quantize | |
| Number of bits to quantize | |
| Distance between consecutive quantization points | |
| Weight-balancing signal sent by node | |
| Decreasing & Update events |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Distributed systems and fault tolerance · Advanced Memory and Neural Computing
Finite Rate Distributed Weight-Balancing and Average Consensus Over Digraphs
Chang-Shen Lee, Nicolò Michelusi, and Gesualdo Scutari Lee and Michelusi are with the School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA; emails: <lee2495,michelus>@purdue.edu.Scutari is with the School of Industrial Engineering, Purdue University, West Lafayette, IN, USA; email: [email protected]. The work of Scutari and part of the work of Lee have been supported by the USA National Science Foundation under Grants CIF 1719205 and CIF 1564044; and in part by the Army Research Office under Grant W911NF1810238. A preliminary version of this paper appeared at IEEE CDC 2018 [1].
Abstract
This paper proposes the first distributed algorithm that solves the weight-balancing problem using only finite rate and simplex communications among nodes, compliant with the directed nature of the graph edges. It is proved that the algorithm converges to a weight-balanced solution at sublinear rate. The analysis builds upon a new metric inspired by positional system representations, which characterizes the dynamics of information exchange over the network, and on a novel step-size rule. Building on this result, a novel distributed algorithm is proposed that solves the average consensus problem over digraphs, using, at each timeslot, finite rate simplex communications between adjacent nodes – some bits for the weight-balancing problem and others for the average consensus. Convergence of the proposed quantized consensus algorithm to the average of the node’s unquantized initial values is established, both almost surely and in the moment generating function of the error; and a sublinear convergence rate is proved for sufficiently large step-sizes. Numerical results validate our theoretical findings.
Index Terms:
Distributed weight-balancing, distributed average consensus, directed graph, quantization, data rate.
I Introduction
Digraphs play a key role in a number of network applications, such as distributed optimization [2], distributed flow-balancing [3], distributed averaging and cooperative control [4], to name a few. In particular, distributed average consensus, whereby nodes aim at agreeing on the sample average of their local values, has received considerable attention over the years; some applications include load-balancing [5], vehicle formation [6], and sensor networks [7]. Several of the these distributed algorithms, when run on digraphs, require some form of graph regularity, such as the weight-balanced property [8]: at each node, the sum of the outgoing edge weights equals that of the incoming edge weights.
Several centralized algorithms have been proposed to balance a digraph; see, e.g., [9] and references therein. In this paper, we are interested in the design of distributed algorithms that solve the weight-balancing and average consensus problems over digraphs, using only quantized information, simplex communications,111One way, as opposed to duplex, two ways communications. and without knowledge of the graph topology other than the direct neighbor. This problem is motivated by realistic scenarios, such as wireless sensor networks, where channels may be asymmetric due to different transmit powers of nodes and interference, and where communications are subject to finite rate constraints. To date, the design of such algorithms in distributed settings remains a challenging and open problem, as documented next.
I-A Related works
Distributed weight-balancing algorithms were proposed in [3, 10, 8, 11, 12] (see Table I). With the exception of [8, Sec. IV],[11, 12], all these algorithms require infinite bits in each communication round, since nodes need to exchange either real valued, or integer but unbounded quantities. Although [8, Sec. IV] and [12] use a finite number of bits at each iteration, this number cannot be arbitrarily chosen (e.g. to satisfy some transmission constraints), it is instead the result of the algorithmic trajectory and thus it is not known a-priori. In addition, these works adopt unicast communications, whereby nodes transmit different signals to different out-neighbors. To reduce signaling overhead, broadcast communications are preferable in dense networks. Finally, while compliant with prescribed finite rate constraints, the distributed integer weight-balancing algorithm [11] requires full-duplex edge communications–each node must exchange information with both its out- and in-neighbors–which does not comply with simplex constraints. To the best of our knowledge, the design of distributed algorithms that solve the weight-balancing problem using a prescribed finite rate and simplex communications is an open problem.
Distributed average consensus algorithms have a long history, tracing back to the seminal works[31, 32, 4]. These early works assumed that nodes can reliably exchange unquantized information over undirected networks. To cope with limited data rates, quantization was later introduced, and its effect analyzed for both undirected [13, 14, 15, 16, 17, 18, 19, 22, 20, 21] or directed graphs [24, 25, 23, 26, 27, 28, 29, 30], as documented in Table II. The quantized average consensus problem based on deterministic uniform quantization and dithered (probabilistic) quantization has been considered in [13, 14, 15, 16, 17] and [18, 19], respectively. However, these schemes do not achieve exact consensus but converge to a neighborhood of the average. Exact average consensus is proved in [20, 21] for deterministic quantization and in [22] for probabilistic quantization. However, all these algorithms consider undirected graphs, which can be easily weight-balanced (e.g. using the Metropolis weights [33]). While the extensions of the above deterministic and probabilistic schemes to digraphs were studied in [25, 23, 26] and [24], respectively, all these works only achieve exact average convergence over balanced digraph. However, the weight matrices of digraphs are inherently unbalanced, thus requiring specific weight-balancing algorithms, as documented earlier; they thus suffer from the same limitations of distributed weight balancing schemes.
To address unbalanced digraphs, the idea adopted in the seminal work [34] is to estimate and compensate the bias caused by the unbalanced weights, via the so-called push-sum algorithm [34]. This algorithm requires unquantized communication. Unfortunately, applying naively a finite-bit quantization to the push-sum scheme does not lead to convergence, as we will demonstrate numerically in Section VI (cf. Q-Push-Sum). Extensions of push-sum employing quantization have been developed in [27, 28, 29, 30]. However, [29, 30] consider unbounded quantization intervals, which necessitates infinite bits to encode the signal whereas [27, 28] impose integer constraints on the initial values of the consensus signals and necessitate a trajectory-dependent number of bits. Besides quantization, other instances of imperfect communications in average consensus problems over digraphs were investigated in [35, 36] (asynchrony) and [37, 38] (link failure).
I-B Summary of the main contributions
The above literature review shows that there are no distributed algorithms solving the weight-balancing and the exact average consensus problems for real initial values over digraphs, using finite-bit quantized information with a prescribed number of bits and simplex communications. This paper provides an answer to these open questions.
1) Distributed quantized weight-balancing: The first contribution is a novel distributed quantized weight-balancing algorithm whereby nodes transfer part of their balance–the difference between the out-going and the incoming sum-weights, which should be zero for a weight-balanced graph–to their out-neighbors via quantized simplex communications; by doing so, the balance is transferred from high imbalance to low imbalance nodes, provably converging to a weight-balanced solution at sublinear rate. Differently from existing quantized weight-balancing schemes[10, 8, 12], the proposed algorithm can use at each iteration a prescribed number of bits (possibly, time-varying). The convergence analysis is also a novel technical contribution of the paper:
- i)
First, we identify necessary and sufficient conditions under which the total imbalance decreases, denoted by the decreasing event (see D.4). Roughly speaking, this event occurs when a node transfers its balance to a neighbor with balance of opposite sign. Hence, nodes closer to nodes with balance of opposite sign more directly contribute to trigger the decreasing event and thus reduce the total imbalance, and are therefore more important than those farther away. 2. ii)
The next step is to prove that the decreasing events occur often enough that the total imbalance asymptotically vanishes at sublinear rate. To this end, we show that the time interval between two consecutive occurrences of a decreasing event is uniformly bounded. This is proved by introducing a sophisticated metric, a non-negative integer-valued function of the imbalances of nodes and of their importance, which strictly increases every time there is a transfer of balance from less important nodes to more important ones, up until the next decreasing event occurs. By proving that this function is uniformly bounded, we conclude that the decreasing events occur infinitely often.
To build such a function, we use the idea of positional system representation: the value of the function at each timeslot is expressed by a number whose th digit represents the sum-imbalance of the th most important nodes. By doing so, every transfer of balance from nodes of lower importance towards those of higher importance causes this function to increase, as it induces a “carry” operation from a digit to the next more significant one in its positional representation. 3. iii)
We introduce a novel diminishing step-size rule, which guarantees that the balance at each node is expressed as an integer multiple of the current step-size. This choice greatly facilitates the convergence analysis, since it allows one to tightly control the amount of decrement of the total imbalance at each timeslot.
2) Distributed average quantized consensus: Building on the proposed weight-balancing scheme, we introduce a novel distributed algorithm that performs average consensus and weight-balancing on the same time scale with finite-bit simplex communications–some bits for consensus and some to balance the digraph. For instance, one may perform one-bit (simplex) communication per channel use, by exchanging weight-balancing and consensus information alternately. The key idea behind the algorithm is to preserve the average of the variables over time, while gradually weight-balancing the graph. We prove convergence of the nodes’ local variables to the exact average of the initial values, both almost surely and in the moment generating function of the error. A sublinear convergence rate is proved for sufficiently large step-sizes.
The rest of the paper is organized as follows. In Section II, we introduce some basic notation and preliminary definitions. Section III introduces the ideas of the proposed distributed quantized weight-balancing and average consensus algorithms, whose details are discussed in Section IV and Section V, respectively. Some numerical results are discussed in Section VI, while Section VII draws some conclusions. The proof of auxiliary lemmas is provided in the appendix.
II Notation and Background
II-A Notation
The sets of real, integer, nonnegative integer, and positive integer numbers are denoted by , , , and , respectively. The indicator function is denoted by ,with if is true, and otherwise. We define the floor and ceiling functions , , the sign of , , , and the clip function . We adopt the big-O notation, . Vectors are denoted as (lowercase, boldface), matrices as (uppercase, boldface). All equalities and inequalities involving random variables are tacitly assumed to hold almost surely (i.e., with probability ), unless otherwise stated. We use L.x, C.x, D.x, T.x, P.x, A.x and App.x for Lemma x, Corollary x, Definition x, Theorem x, Proposition x, Assumption x and Appendix x, respectively. The rest of the symbols used in the paper are summarized in Table III.
II-B Basic graph-related definitions
Consider a network with nodes, modeled as a static, directed graph , where is the set of vertices (the nodes), and is the set of edges (the communication links). A directed edge from to is denoted by , so that information flows from to . We assume , and denote the set of in- and out-neighbors of node as and , with cardinality (in-degree) and (out-degree), respectively. We will consider strongly connected digraphs.
Definition 1**.**
A digraph is strongly connected if, with , there exists a directed path from to .
Associated with the digraph , we define a weight matrix such that
[TABLE]
along with the following quantities (cf. Fig. 1) instrumental to formulate the weight-balancing problem.
Definition 2** (In-flow, out-flow and weight-balance).**
Given a digraph with weight matrix , the in-flow of node is defined as while the out-flow is . The weight-balance of node is defined as ; and the overall weight-balance vector is .
Definition 3** (Weight-balanced digraph).**
A weight matrix , associated to the digraph , is said to be weight-balanced if it induces zero balance, i.e., .
III Summary of The Proposed Algorithms
In this section, we introduce the proposed distributed quantized weight-balancing and consensus algorithms; their detailed analysis will be carried out in Sec. IV and Sec.V.
III-A System model and problem formulation
Average consensus problem
Let be a set of nodes. Each node controls and iteratively updates a local variable , whose initial value is set to . The average consensus problem consists in the following iterative algorithm (or variations of it): given at time , let
[TABLE]
where is a suitably chosen weight matrix compliant with the graph [cf.(1)]. The goal is to locally estimate the average of the initial values
[TABLE]
i.e., as .
We consider a setting where: i) communications among the agents are quantized using a finite number of bits; and ii) information exchanges flows according to the edge directions of the graph (simplex communications). This puts in jeopardy the convergence of the vanilla consensus algorithm (2), as communications therein subsume an infinite number of bits and needs to be balanced [24], a condition that cannot be enforced a priori without using a centralized controller with knowledge of . To cope with these two issues, we first introduce a distributed quantized weight-balancing algorithm solving the weight-balancing problem (cf. Sec, III-B); and then we integrate this algorithm with a distributed consensus algorithm using quantized simplex communications solving the average consensus problem (cf. Sec. III-C).
III-B Distributed quantized weight-balancing
We propose a distributed, iterative algorithm to solve the weight-balancing problem over a strongly connected digraph using only quantized information and simplex communications. Note that strong connectivity guarantees the existence of a matrix, compliant to the digraph (cf. D.1) that is weight-balanced (cf. D.3) [9]. The proposed algorithm is formally stated in Algorithm 1 and discussed next.
Each node controls the in-neighbors weights . In WB.1, each node quantizes the local balance via (4), using bits (a -bit quantizer has quantization levels), and broadcasts the quantized signal to its out-neighbors. In WB.2, each agent collects the signals from its in-neighbors, and updates the corresponding weights according to (5). The balance of each node is then updated according to (6). Roughly speaking, by (5)-(6) there is a transfer of the balance among nodes in the network: the quantity (with denoting the step-size) is subtracted from the balance of node [cf. (6)], and equally divided among its out-neighbors , which will increase their incoming weight by [cf. (5)]. Note that 1) although may be negative, remains compliant to , which will be shown in T.2; 2) Algorithm 1 is fully distributed: each node only needs to know its in- and out-degrees and , and to agree on a common step-size rule . This assumption, along with knowledge of or its equivalent information, is commonly used in distributed algorithms over directed graphs; see, e.g., [8, 12, 34, 39, 29]. Convergence of Algorithm 1 is studied in Sec. IV.
III-C Distributed quantized average consensus algorithm
We now introduce the proposed distributed quantized average consensus algorithm over non-balanced digraphs, as described in Algorithm 2. The algorithm combines Algorithm 1 with a variation of the quantized average consensus protocol based on probabilistic quantization, which we recently proposed in [24]. The algorithm is designed so that these two building blocks run on the same time-scale.
More specifically, each node controls two set of variables, namely: i) the in-neighbors weights ; and ii) the local estimate . The goal is to update these variables so that asymptotically the average consensus problem is solved while the weights converge to a balanced matrix. At each iteration , node quantizes its local estimate , by first clipping it within the quantization range [cf. (10)], followed by the probabilistic quantization (9) with bits; it then transmits the resulting quantized signal (along with for the weight-balancing) to its out-neighbors (AC.1). Upon receiving the signals from its in-neighbors, node updates its weights using (5), and the local variable according to (11). The update in (11) aims at forcing a consensus on the average among the local variables . In fact, the third term in (11) is instrumental to align the local copies , while the second term is a correction needed to preserve the average of the iterates, i.e., , for all [cf. (62)]. Hence, if all are asymptotically consensual, it must be \big{|}y_{i}(k)-(1/N)\sum_{i}y_{i}(k)\big{|}=\big{|}y_{i}(k)-(1/N)\sum_{i}y_{i}(0)\big{|}\underset{k\to\infty}{\longrightarrow}0.
Convergence of Algorithm 2 is studied in Sec. V.
IV Distributed Quantized Weight-Balancing
We study convergence of Algorithm 1 under the following mild assumptions.222The analysis can be extended to the case in which each node uses its own step-size , provided that: 1) every node knows the step-size of its in-neighbors, and 2) every satisfies A.2.
Assumption 1**.**
Let be a sequence satisfying and for all and some . Then, there exists such that the number of bits \big{\{}B_{i}^{\rm(w)}(k)\big{\}}_{k\in\mathbb{Z}_{+}} satisfies: for all ,
[TABLE]
Assumption 2**.**
The step-size and initial weight matrix satisfy:
[TABLE]
respectively, where , and .
Define , .
Note that the step-size satisfying A.2 is vanishing and non-summable, as shown below.
Lemma 1**.**
If satisfies A.2, then
[TABLE]
Proof:
Let . Note that . Then, the upper and lower bounds on are obtained by bounding with respect to this interval. ∎
A.2 is consistent with similar choices adopted in stochastic optimization [40], such as . However the diminishing and non-summability properties alone are not sufficient to prove convergence of Algorithm 1; A.2 further guarantees that the balance at each node is always an integer multiple of the current step-size (L.11, cf. App. A-C ), which will be shown to be a key property to prove that is asymptotically vanishing. An instance of satisfying A.2 is [1]
[TABLE]
Note that a fixed step-size , for all , may fail to achieve convergence. In fact, it is possible that each , so that and there is no further transfer of balance, resulting still in an unbalanced digraph.
A.1 states that at least once over a time window of duration , all nodes are simultaneously communicating at least one bit to their out-neighbors. This offers some flexibility in the design of the communication protocol. For instance, nodes can transmit one bit at each time slot [1], yielding one bit per channel use, or transmit one bit every time slots, resulting in a lower effective rate of bits per channel use.
We are now ready to state our main convergence result.
Theorem 2**.**
Let be the sequence generated by Algorithm 1 under A.1 and A.2. Then, there hold:
[TABLE]
where is weight-balanced and , are defined in (31).
IV-A Proof of Theorem 2
IV-A1 Proof of statement (a)
We begin by highlighting the main steps of the proof, with the help of Fig. 2. Step 1: We show that is non-increasing. Furthermore, we identify two key events affecting the dynamics of , namely: the so-called “decreasing event” and “update event” . , formally defined in (15), occurs if at timeslot either one of the following two facts happen: 1) a node transfers its nonzero balance to an out-neighbor with balance of opposite sign; 2) two nodes, with balance of opposite sign, transfer their balances to a common out-neighbor. On the other hand, , formally defined in (17), occurs if at timeslot a node transfers its balance to its out-neighbors (i.e., , for some ) but does not occur. Note that it can happen that neither nor occur at some ; this is the case when is “too small” or all nodes are inactive (). We show that decreases by at least , with , iff. occurs, and remains unchanged otherwise (cf. L.3). **Step 2: ** To guarantee that vanishes, the decreasing event must occur sufficiently often. Towards this end, we prove two key properties of the decreasing and update events, namely:
P1)
there are at most update events between two consecutive decreasing events;
P2)
if (roughly speaking, if does not decrease sufficiently fast), there are at most timeslots between two consecutive update events.
The decreasing step-size together with P2 guarantee that update events occur within bounded time; this property combined with P1 guarantees that decreasing events occur at uniformly bounded time intervals. Finally, Step 3 builds on the above results to prove statement (a) of the theorem. Roughly speaking, one can infer that: either 1) is below the diminishing threshold (Step 1), causing it to vanish (since the step-size vanishes); or 2) it exceeds the threshold at some timeslots, causing it to be suppressed by the decreasing event (Step 2) until it falls again below the vanishing threshold.
We proceed next with the formal proof.
Step 1: We begin by introducing the definition of and .
Definition 4** (Decreasing event and its occurrence time ).**
Let , , be defined as
[TABLE]
Furthermore, let , , be the timeslot of occurrence of the th decreasing event; recursively, and ,
[TABLE]
Definition 5** (Update event ).**
Let , , be defined as:
[TABLE]
We show next that decreases by at least iff. occurs, and remains unchanged otherwise.
Lemma 3**.**
There holds
[TABLE]
Proof:
See App.A-II.∎
Step 2: This step characterizes “how often” and occur (properties P1 and P2), and the implication on . We first provide some intuition motivating our approach.
** Intuition:** Let us look at the balance transfer within two consecutive decreasing events at (finite) times and . Let
[TABLE]
be the set of nodes with positive and negative balance. Note that iff. , since . For the next decreasing event to occur at time : either 1) a node (resp. ) has enough balance (i.e., ) to trigger the update to an out-neighbor in (resp. ); or 2) two nodes and trigger an update to an out-neighbor in . This balance is built-up throughout the update events within , during which nodes in (resp. ), , keep transferring part of their balance towards the out-neighbors in (resp. ) that are closer to nodes outside (resp. ). Hence, one can expect that the decreasing event will occur after a certain number of update events, specifically when a sufficient amount of balance has been transferred to some nodes having out-neighbors in (resp. ). To characterize this number and show that it is bounded over each interval , the proposed idea is to construct a nonnegative, integer-valued function of , denoted by , which (a) strictly increases whenever occurs; and (b) is uniformly upper bounded on . These properties guarantee that the number of update events within is bounded, which proves P1. The same function will be also used to prove P2 (cf. P.4 & C.5). Next, we build and prove P1 and P2.
** Building the function :** Let , ,
[TABLE]
We define the set (possibly empty) of nodes that are directed hops away from a node with opposite sign of balance as
[TABLE]
where is the directed distance between and . Then, we define the function
[TABLE]
Based on the above discussion, nodes in are “more important” than nodes in , in the sense that they will more immediately trigger the next decreasing event; nodes in are more important than those in , and so on. The function aims at capturing this hierarchical transfer of balance along the chain during each update event, up until occurs. In particular, we want to increase its value by (at least) one (integer) unit every time one of such transfers happens (i.e., occurs).
To motivate the choice of (22), let us look at the balance transfer during an update event at time ; for the sake of simplicity, say is triggered by node . As a result, node transfers part of its balance to its out-neighbors ,333Note that as, by definition of , at least one node in is one step closer to nodes with balance of opposite sign. according to (6). In (6), can be regarded as the unit of balance and is the normalized imbalance, an integer number (cf. L.11, App.A-III). Such a node experiences a decrease of its normalized imbalance by units while the normalized imbalance of increases by at least one unit. To encode this balance transfer as an increase of by at least one integer, we can associate it with the “carry on” operation from a digit to the next more significant one in a positional notational representation of the value. Specifically, is expressed in radix- notation wherein the sum normalized imbalance of nodes in contributes to the most significant digit, the one of nodes in contributes to the second most significant digit, and so on. By doing so, when occurs as above, the aforementioned exchange of balance triggers the transfer of one unit from the th most representative digit to the th one, so that increases by at least one unit.
** Proof of P1 and P2:** P.4 below states the desired properties of and proves P2 as a by-product; P1 follows from C.5.
Proposition 4** (Properties of ).**
Let . Then:
- (i)
* is non-negative, non-decreasing, and upper bounded by ;* 2. (ii)
* strictly increases if occurs, i.e., * 3. (iii)
If , then an update or decreasing event occurs within the next slots; hence,
.444Since we are interested in the variations of within , the case is irrelevant.
Proof:
See Appendix A-III.∎
Corollary 5**.**
- (i)
There are at most update events between two consecutive decreasing events. 2. (ii)
If , , then
[TABLE]
Proof:
(i) is a direct result of P.4(i)-(ii) and the fact that iff. . (ii): let such that , and let be the next decreasing event at or after (possibly, and/or ). Invoking P.4, we infer that i) there are at most update events in ; ii) ( is non-increasing and only decreases after decreasing events), so that the first update or decreasing event after occurs within timeslots and subsequent ones are separated by at most timeslots until the next decreasing event at . These two facts together imply ; therefore,
[TABLE]
∎
Step 3: We now prove that . Equivalently,
[TABLE]
To this end, note that it suffices to show that
[TABLE]
In fact, using L.1 in App.A-I, (24) implies
[TABLE]
so that (23) readily follows with . To prove (24), let ; we define as
[TABLE]
for some sufficiently large to be determined. The existence of such is guaranteed by L.10 in App.A-I. Let
[TABLE]
Then, it readily follows that, for all ,
[TABLE]
so that (24) holds for . It remains to prove that it holds for . We do so by induction. Assume that it holds at , for some , and that
[TABLE]
Clearly, this is true for . We show next that this condition implies that (24) holds for , and
[TABLE]
Therefore, (24) holds . To show the induction step, note that (26) implies (24), , since
[TABLE]
It remains to prove (27); we do it by contradiction. Assume (26) and (28) hold but (27) does not. Then,
[TABLE]
Choosing large enough so that, for ,
[TABLE]
(this is possible since ) we can then apply C.5(ii) recursively times, yielding
[TABLE]
where follows from A.2 and (28). This proves the contradiction, hence .
IV-A2 Proof of statement (b)
Convergence of to is a consequence of the following Lemma.
Lemma 6**.**
The sequence is a Cauchy sequence.
Proof:
See Appendix A-B.∎
IV-A3 Proof of statement (c)
First, using the fact that for any , there exists such that , it follows that
[TABLE]
Note that (12) and (5) imply have the same value. Let , where . Applying (30) to the update of , it follows that
[TABLE]
where we use the fact , which implies that
[TABLE]
Let , and be a non-trivial weight-balancing solution, i.e., . It follows that and satisfy the following properties
since ; 2. 2.
. Since is an irreducible singular M-matrix, cf. [41, C.4.24], it follows from [41, T.4.31] that 1) has rank , and 2) , and thus ; 3. 3.
since .
Let with , and . It follows that
[TABLE]
where follows from and follows from . Similarly,
[TABLE]
which proves the desired results with
[TABLE]
V Distributed Quantized Average Consensus
In this section, we study convergence of Algorithm 2. We introduce the following mild assumptions.
The first condition is on the number of bits used to quantize the consensus variables at each iteration.
Assumption 3**.**
Let be an activation sequence satisfying and for all and some given . The number of bits \big{\{}B_{i}^{\rm(c)}(k)\big{\}}_{k\in\mathbb{Z}_{+}} used by each node satisfies
[TABLE]
The above condition is almost the same as the one used in the weight-balancing algorithm (cf. A.1) except for the global upper bound, and can be coupled with it. For example, nodes can communicate for weight-balancing using one bit at odd time slots, and for average consensus using one bit at even time slots, yielding one bit per channel use. Lower effective data rates can be achieved using intermittent communications.
We next introduce the assumption on and the step-size used in the consensus updates.
Assumption 4** (Informative ).**
The average [cf. (3)] satisfies .
Assumption 5**.**
The step-size sequence satisfies:
[TABLE]
It is important to remark that A.4 neither requires to be confined within the quantization range nor its to be known. This is a major departure from the literature, which calls for to be within the quantization range – see, e.g. [22, 20, 25, 21]. We require instead the average to fall within the quantization interval , which is a less restrictive condition. For example, if nodes are estimating a common unknown parameter via noisy measurements corrupted by zero mean Gaussian noise , i.i.d. across nodes, then is the sample mean estimate across the nodes. In this case, a bound on is hard to obtain (theoretically it is unbounded), but the bound of the parameter, , is known in many cases. Even worse, for , whereas the sample average , so that it becomes more and more informative for large , whereas the initial local measurements become larger and larger. In this example, nodes can simply set , so that is informative with high probability. Herein, we are not interested in non-informative , which, as the name suggests, does not provide any information to estimate .
We are now ready to state the convergence of Algorithm 2.
Theorem 7**.**
*Let be the sequence generated by Algorithm 2 under A.2-5. Then:
(a) Almost sure convergence:*
[TABLE]
(b) Convergence in the moment generating function:
[TABLE]
*Furthermore, if and , then:
(c) Convergence rate:*
[TABLE]
where with defined in L.14.
Proof:
Let with . Using (11), the -updates become
[TABLE]
and, due to the probabilistic quantization, .
To study the dynamics of the consensus error, we define
[TABLE]
and prove that the sequence satisfies the conditions of [42, T.1], sufficient to prove our theorem.
Intermediate results: We begin by introducing some properties of , instrumental for the sequel of the proof.
Lemma 8**.**
In the setting of T.7, there holds
[TABLE]
The case holds trivially since . Otherwise () L.8 follows from the dynamics (35) and the fact that with probabilistic quantization. To bound these dynamics when , we use the fact that is uniformly bounded within a bounded set with probability 1 (cf. L.13) and L.14 to obtain
[TABLE]
where are constants defined in L.14 and in we defined
[TABLE]
for some constant . Note that the boundedness of (cf. L.6), and thus of , and that of (being the output of a finite rate quantizer), guarantee that .
We are now ready to prove T.7.
Proof of statement (a): Define , and . It is sufficient to show that in (36) satisfies the conditions of [42, T.1], namely:
[TABLE]
where is a non-negative function such that
[TABLE]
and and satisfy
[TABLE]
Conditions in 1) are trivially satisfied by definition [cf. (36)]. To prove the condition in 2), we use , L.13 in App.B, and (37), yielding,
[TABLE]
with and . Moreover,
[TABLE]
where we used (cf. A.5); and , due to (38), A.5, and T.2. Therefore, the condition in 2) holds.
Overall, we have shown that all the conditions of [42, T.1] are satisfied, implying
[TABLE]
Since and for all , statement (a) of the theorem follows.
Proof of statement (b): Since and \mathbb{P}\Bigr{(}\underset{k\rightarrow\infty}{\lim}{\|{\bf y}(k)-\bar{y}(0){\bf 1}\|^{r}=0}\Bigr{)}=1, for all (recall that is bounded for all , cf. L.13 in App.B), it follows from the dominated convergence theorem (cf. [43, T.1.6.7]) that , which implies statement (b).
Proof of statement (c): For simplicity, we assume that , and the proof can be easily generalized to the case that satisfying Assumption 3. Since and , it follows that , and there exist such that
[TABLE]
Under the conditions of the theorem, (37) holds, which implies
[TABLE]
where . Let . By induction, we can show
[TABLE]
Note that
[TABLE]
[TABLE]
[TABLE]
The first three terms are bounded, since and for . It follows that , for some . Letting , it follows that
[TABLE]
for some . To conclude, note that ,
[TABLE]
which proves the desired result.
∎
VI Numerical Results
In this section, we present some numerical results to validate our theoretical findings on strongly connected digraphs with nodes constructed by the following procedure: a directed ring links all the nodes, to ensure strong connectivity (cf. Fig. 3). Then directed edges are randomly added, with probability on each pair of nodes.
VI-A Quantized weight-balancing
We adopt (14) for . We compare the total imbalance of our proposed scheme with the integer weight-balancing and real weight-balancing schemes in [8]. The real weight-balancing scheme uses real valued communications; the integer weight-balancing scheme uses unicast transmissions to each of its out-neighbors to communicate the associated edge weight, and cannot use a prescribed number of bits. As we will see numerically, these features allow the scheme to converge to a weight-balanced solution within finite time. In contrast, our scheme uses broadcast communications with a prescribed number of bits per channel use, which in general does not guarantee convergence within finite time. The simulation results are averaged over 100 graph realizations.
Fig. 4 shows the total imbalance of Algorithm 1 with 1-bit and 5-bit of information exchange, as well as of the other two benchmark schemes. Note that in the integer weight-balancing scheme, the maximum weights in the 100 realizations are between 88 to 250, implying that 7 to 8 bits are required per edge per timeslot, which implies or or bits per node per timeslot. It is shown that the metric is non-increasing for the proposed schemes, which is consistent with our analytical results (cf. L.3). In addition, one can see that the curve of can be partitioned into nearly flat and steep line segments, for both schemes. The rationale behind this behavior is that the total imbalance decreases only when decreasing events occur (steep line segments); in between, the imbalance may be transferred within the network, but without causing the total imbalance to decrease. Compared with the two benchmark schemes, it shows that the proposed scheme with 50 bits outperforms the real weight-balancing scheme [8], which requires infinite rate communications. On the other hand, The comparison between the proposed 7-bit scheme and the integer weight-balancing scheme shows that, initially, the proposed scheme has better performance. However, as noted earlier, the integer weight-balancing scheme later outperforms the proposed 7-bit scheme since it is guaranteed to converge to a weight-balanced solution in finite timeslots.
VI-B Quantized average consensus
We compare our proposed algorithm with the following state-of-the-art schemes:
- Q-Push-Sum, where we straightforwardly apply the finite-bit probabilistic quantization to the original push-sum algorithm in [34], i.e., ,
[TABLE]
and is the estimate of the initial average, where is the quantization defined in (9); note that Q-Push-Sum can be regarded as the generalization of [27] to real valued initialization and finite rate communications; 2) Q-Run-Avg, where we apply the finite-bit probabilistic quantization to the algorithm in [30];555Note: this algorithm requires memory space to store the estimate of the eigenvector of graph Laplacian at each node. 3) Q-Monte-Carlo, where we apply the -bit quantization
[TABLE]
to the Monte-Carlo based algorithm in [29, Section 4]. In this algorithm nodes exchange quantized random values sampled from the exponential distribution with parameter related to their current states. Note that exact convergence can be achieved by [30, 29] using infinite-bit quantized communications, [34] using real value communications, and [27] using integer value communications. However, there is no theoretical guarantee for all these benchmark schemes with finite bit quantization: our proposed scheme is the first algorithm solving the distributed average consensus over unbalanced digraphs with a prescribed finite rate communications.
We adopt the mean square error (MSE) as defined in (36) as performance metric. The simulation results are averaged over 100 graph realizations and 100 initial value realizations, i.e., totally 10000 realizations.
For the proposed algorithm, we adopt: , ; (14) is adopted for , and , which satisfies A.5; for Q-Push-Sum we use 50 bits and to quantize , and 50 bits and to quantize ; for Q-Run-Avg we quantize each element of (the estimate of the left eigenvector at 0 of graph Laplacian constructed by a row stochastic weight matrix) using 4 bits (i.e., totally bits required for quantizing ) and with , and is quantized using 20 bits and ; for Q-Monte-Carlo, we use 50 bits to quantize both and , and other parameters are: . Note that the communication resource budget per node per timeslot is 100 bits in all schemes.
Fig. 5 shows the MSE performance of Algorithm 2 as well as other benchmark schemes. It is shown that only the proposed scheme and the Q-Monte-Carlo are reaching the average consensus, among all finite rate schemes. Note that Q-Run-Avg and Q-Push-Sum seem also converge for some realizations, cf. Fig. 6. However, only the proposed scheme has theoretical convergence guarantees.
Fig. 7 shows the communication cost (left y-axis) and delay (right y-axis) needed by Algorithm 2 to reach a target MSE of and , versus the total number of bits per channel use. The communication cost is defined as the product of the total number of bits per node per timeslot and the number of timeslots. For each parameter setting, we run 50 graph realizations and 10 initial value realizations. To avoid the average results affected by the outliers, we select the best of results to perform averaging. We observe that increasing the total number of bits reduces the number of timeslots required. On the other hand, there exists an optimal number of bits that minimizes the communication cost. Using more bits does not appear to be beneficial, since the communication cost becomes larger, and it is only marginally compensated by the reduction of the number of timeslots required.
VII Conclusions
In this paper, we introduced a novel distributed algorithm that solves the weight-balancing problem using only quantized information and simplex communications. Building on this scheme, a second contribution of the paper was a novel distributed average consensus algorithm over non-balanced digraphs that uses quantized simplex communications. Convergence of the algorithm was proved using a novel line of analysis, based on a novel metric inspired by the positional system representation and a new step-size rule. Finally, numerical results validated our theoretical findings.
Appendix A Intermediate Results in the Proof of Theorem 2
A-A Preliminary definitions and results
Throughout the proof, we write the updates of , of Algorithm 1 as
[TABLE]
Lemma 9**.**
Given and , defined in (19), it holds:
[TABLE]
Proof:
Let and consider . Then, (4) and imply ; (see (15)) implies . then follows from (46), so that ; hence . follows from a similar argument on . ∎
Lemma 10**.**
.
Proof:
We prove it by contradiction. Let . Suppose , for all . Invoking Corollary 5.(ii) recursively times and taking yields
[TABLE]
a contradiction since due to (13). ∎
A-B Proof of Lemma 6
By the definition of Cauchy sequence applied to each entry of , we need to prove that, such that . To this end, let and define as666Note that since and , see (13).
[TABLE]
Since is updated only at update or decreasing events, using (5) recursively, we infer
[TABLE]
With defined as in (16) and letting , we can further upper bound
[TABLE]
Since there are at most update events between the two consecutive decreasing events at times and (cf. C.5), it follows that , hence
[TABLE]
To bound , we apply recursively L.3,
[TABLE]
hence . By combining (49) with (50) and (47), we finally obtain, ,
[TABLE]
and, ,
[TABLE]
which proves that is a Cauchy sequence.
A-C Proof of Lemma 3
We first introduce the following intermediate result.
Lemma 11**.**
Let be the sequence generated by Algorithm 2. Then, and .
Proof:
We prove this lemma by induction using (46). The induction hypothesis holds at , since and (cf. A.2). Suppose that it holds at , i.e., . Then, since , with (A.2), it follows that and . Therefore, by (46), one can infer that , proving the induction step and completing the proof. ∎
Proof of Lemma 3: Let and . Using (46), we find
[TABLE]
It will be useful to note that, as can be seen from (4), , (possibly, ) and have the same signs, yielding the following inequality for all ,
[TABLE]
where stands for the triangle inequality. We now distinguish the two cases and . If , from the negation of in (15) and from (4), it follows that , and have the same signs, so that (52) holds with equality, .
Conversely, if , there exists and such that either 1) , and ; or 2) , and . In the first case (, and ), we bound (51) as
[TABLE]
In the second case (, , and, without loss of generality, ), we bound instead
[TABLE]
In both cases, since , and (see (4)), we further bound
[TABLE]
By summing (52) (with strict equality if ) and (54) over , it holds
[TABLE]
after noticing that , hence
[TABLE]
This completes the proof.
A-D Proof of Proposition 4
Property (i): Note that since , for all . To show that it is upper bounded, we use and , and write
[TABLE]
We prove that is nondecreasing as by product of the proof of Property (ii), as given below.
Property (ii): Since , we can lower bound as
[TABLE]
Case 1: . We have and , which implies . Case 2: . From the discussion following (52), (52) holds with equality:
[TABLE]
Moreover, ; this implies that there exists a non-empty set of nodes that receive at least one update from their in-neighbors, defined as
[TABLE]
It is straightforward to show that
[TABLE]
In fact, if (i.e., ), it follows that (i.e., and , cf. L.9); therefore, setting and in (55), we find that , so that and (56) follows. With this definition, let
[TABLE]
be the distance of the node closest to those of opposite sign of balance at to receive the update, and let be one of such nodes. Then, we have
[TABLE]
In fact, if for some of such , then receiving the update, which contradicts the definition of . Reading (55) at , yields
[TABLE]
We can then further lower bound as
[TABLE]
where we neglected the non-negative terms associated to . To further bound this quantity, note that (cf. (57)), which, together with for an update event to occur, implies (cf. (4)). Therefore . Finally, we use the fact that
[TABLE]
yielding
[TABLE]
Now, using and , we obtain
[TABLE]
In the last inequality, we used the fact that . In fact, if , then (57) implies that , which contradicts the occurrence of the update event. Finally, note that and (cf. L.9), hence , i.e., gets closer to nodes of opposite sign. Together with , it implies
[TABLE]
The desired result follows by using (60)-(61) in (A-D).
Property (iii): We prove it by contradiction. Assume that such that and are two consecutive update events with , and . It follows that such that
[TABLE]
which implies that due to A.1, which contradicts the assumption that and are two consecutive update events since . Hence, it follows from property (ii) that
[TABLE]
which proves property (iii).
Appendix B Auxiliary Results for Theorem 7
Lemma 12**.**
Let be the sequence generated by Algorithm 2, in the setting of T.7. Then, it holds:
[TABLE]
Proof:
From (35) and , it follows
[TABLE]
so that the statement of the lemma readily follows after noticing that . ∎
Lemma 13**.**
Let be the sequence generated by Algorithm 2, in the setting of T.7. Then,
[TABLE]
*where , , , *
[TABLE]
Proof:
We first show that . From T.2, we know that is bounded for all , which implies . On the other hand, since and , one can verify using Cauchy-Schwarz inequality that and thus . By inspection, it is also clear that and . We now prove by induction. Clearly, it for . Now, assume it holds for some , we prove that this implies (induction step). We have:
If , then
[TABLE] 2. 2.
If , then , so that (11) yields
[TABLE] 3. 3.
Similarly, if then
[TABLE] 4. 4.
If , then so that (11) yields
[TABLE]
∎
Lemma 14**.**
Let be the sequence generated by Algorithm 2, in the setting of T.7. Then,
[TABLE]
for some finite constants .
Proof:
Let be the saturation error,
, ,
, . The proof contains three steps:
- •
Step 1: We will lower bound as
[TABLE]
- •
Step 2: we will show that the last term of the RHS in Step 1 satisfies, for some ,
[TABLE]
- •
Step 3: by combining the above results, we will show that, for some constants ,
[TABLE]
In the following, we provide detailed derivations of each step.
- •
Step 1: It is easy to show that
[TABLE]
The term can be lower bounded as
[TABLE]
where comes from the fact that
[TABLE]
where the last inequality comes from the fact that (i) if , then ; (ii) if , then and ; and (iii) if , then and .
- •
Step 2: First, one can verify that
[TABLE]
Let , , . Note that to preserve the average (L.12). Since is strongly connected, there exists a path from to . Let be the set of nodes in the shortest path from to , with and . We have
[TABLE]
where follows from (T.2); comes from Cauchy-Schwarz inequality. To further bound this quantity, note that
[TABLE]
On the other hand, since the consensus algorithm preserves the average, it follows
[TABLE]
so that the first inequality in (69) is upper bounded as
[TABLE]
Consider the following two cases:
- (i)
, so that and
[TABLE] 2. (ii)
( can be solved similarly) so that : since , using (70) and it follows
[TABLE]
From (i), (ii) and (68), there exists some such that
[TABLE]
since and thus is bounded (L.13).
- •
Step 3: Let . By combining (66) and (67), we get
[TABLE]
with , where comes from the facts , , and .
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Lee, N. Michelusi, and G. Scutari, “Distributed quantized weight-balancing and average consensus over digraphs,” in Proc. IEEE Conf. on Decis. and Control , Dec. 2018, pp. 5857–5862.
- 2[2] P. D. Lorenzo and G. Scutari, “Next: In-network nonconvex optimization,” IEEE Trans. Signal Inf. Process. Netw. , vol. 2, no. 2, pp. 120–136, Jun. 2016.
- 3[3] C. N. Hadjicostis and A. D. Domínguez-García, “Distributed balancing of commodity networks under flow interval constraints,” IEEE Trans. Autom. Control , vol. 64, no. 1, pp. 51–65, Jan. 2019.
- 4[4] R. Olfati-Saber and R. M. Murray, “Consensus problems in networks of agents with switching topology and time-delays,” IEEE Trans. Autom. Control , vol. 49, no. 9, pp. 1520–1533, Sep. 2004.
- 5[5] G. Cybenko, “Dynamic load balancing for distributed memory multiprocessors,” J. Parallel and Distrib. Comput. , vol. 7, no. 2, pp. 279–301, 1989.
- 6[6] J. A. Fax and R. M. Murray, “Information flow and cooperative control of vehicle formations,” IEEE Trans. Autom. Control , vol. 49, no. 9, pp. 1465–1476, Sep. 2004.
- 7[7] G. Scutari, S. Barbarossa, and L. Pescosolido, “Distributed decision through self-synchronizing sensor networks in the presence of propagation delays and asymmetric channels,” IEEE Trans. Signal Process. , vol. 56, no. 4, pp. 1667–1684, Apr. 2008.
- 8[8] A. I. Rikos, T. Charalambous, and C. N. Hadjicostis, “Distributed weight balancing over digraphs,” IEEE Trans. Control Netw. Syst. , vol. 1, no. 2, pp. 190–201, Jun. 2014.
