A New Backpressure Algorithm for Joint Rate Control and Routing with Vanishing Utility Optimality Gaps and Finite Queue Lengths
Hao Yu, Michael J. Neely

TL;DR
This paper introduces a novel backpressure algorithm that achieves near-optimal utility with finite queue lengths, overcoming the traditional utility-delay tradeoff in multi-hop networks.
Contribution
It proposes a new backpressure algorithm that guarantees vanishing utility gaps while maintaining bounded queue lengths, a significant improvement over existing methods.
Findings
Utility gap approaches zero as the algorithm runs
Queue lengths remain bounded by a finite constant
The method uses a new convex programming approach
Abstract
The backpressure algorithm has been widely used as a distributed solution to the problem of joint rate control and routing in multi-hop data networks. By controlling a parameter in the algorithm, the backpressure algorithm can achieve an arbitrarily small utility optimality gap. However, this in turn brings in a large queue length at each node and hence causes large network delay. This phenomenon is known as the fundamental utility-delay tradeoff. The best known utility-delay tradeoff for general networks is and is attained by a backpressure algorithm based on a drift-plus-penalty technique. This may suggest that to achieve an arbitrarily small utility optimality gap, the existing backpressure algorithms necessarily yield an arbitrarily large queue length. However, this paper proposes a new backpressure algorithm that has a vanishing utility optimality gap, so…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Wireless Network Optimization · Network Traffic and Congestion Control · Wireless Networks and Protocols
A New Backpressure Algorithm for Joint Rate Control and Routing with Vanishing Utility Optimality Gaps and Finite Queue Lengths
Hao Yu and Michael J. Neely
Department of Electrical Engineering
University of Southern California
Abstract
The backpressure algorithm has been widely used as a distributed solution to the problem of joint rate control and routing in multi-hop data networks. By controlling a parameter in the algorithm, the backpressure algorithm can achieve an arbitrarily small utility optimality gap. However, this in turn brings in a large queue length at each node and hence causes large network delay. This phenomenon is known as the fundamental utility-delay tradeoff. The best known utility-delay tradeoff for general networks is and is attained by a backpressure algorithm based on a drift-plus-penalty technique. This may suggest that to achieve an arbitrarily small utility optimality gap, the existing backpressure algorithms necessarily yield an arbitrarily large queue length. However, this paper proposes a new backpressure algorithm that has a vanishing utility optimality gap, so utility converges to exact optimality as the algorithm keeps running, while queue lengths are bounded throughout by a finite constant. The technique uses backpressure and drift concepts with a new method for convex programming.
I Introduction
In multi-hop data networks, the problem of joint rate control and routing is to accept data into the network to maximize certain utilities and to make routing decisions at each node such that all accepted data are delivered to intended destinations without overflowing any queue in intermediate nodes. The original backpressure algorithm proposed in the seminal work [1] by Tassiulas and Ephremides addresses this problem by assuming that incoming data are given and are inside the network stability region and develops a routing strategy to deliver all incoming data without overflowing any queue. In the context of [1], there is essentially no utility maximization consideration in the network. The backpressure algorithm is further extended by a drift-plus-penalty technique to deal with data network with both utility maximization and queue stability considerations [2, 3, 4]. Alternative extensions for both utility maximization and queue stabilization are developed in [5, 6, 7, 8]. The above extended backpressure algorithms have different dynamics and/or may yield different utility-delay tradeoff results. However, all of them rely on “backpressure” quantities, which are the differential backlogs between neighboring nodes.
It has been observed in [9, 5, 7, 10] that the drift-plus-penalty and other alternative algorithms can be interpreted as first order Lagrangian dual type methods for constrained optimization. In addition, these backpressure algorithms follow certain fundamental utility-delay tradeoffs. For instance, the primal-dual type backpressure algorithm in [5] achieves an utility optimality gap with an queue length, where is an algorithm parameter. By controlling parameter , a small utility optimality gap is available only at the cost of a large queue length. The drift-plus-penalty backpressure algorithm [4], which has the best utility-delay tradeoff among all existing first order Lagrangian dual type methods for general networks, can only achieve an utility optimality gap with an queue length. Under certain restrictive assumptions over the network, a better tradeoff is achieved via an exponential Lyapunov function in [11], and an tradeoff is achieved via a LIFO-backpressure algorithm in [12]. The existing utility-delay tradeoff results seem to suggest that a large queueing delay is unavoidable if a small utility optimality gap is demanded.
Recently, there have been many attempts in obtaining new variations of backpressure algorithms by applying Newton’s method to the Lagrangian dual function. In the recent work [10], the authors develop a Newton’s method for joint rate control and routing. However, the utility-delay tradeoff in [10] is still ; and the algorithm requires a centralized projection step (although Newton directions can be approximated in a distributed manner). Work [13] considers a network flow control problem where the path of each flow is given (and hence there is no routing part in the problem), and proposes a decentralized Newton based algorithm for rate control. Work [14] considers network routing without an end-to-end utility and only shows the stability of the proposed Newton based backpressure algorithm. All of the above Netwon’s method based algorithms rely on distributed approximations for the inverse of Hessians, whose computations still require certain coordinations for the local information updates and propagations and do not scale well with the network size. In contrast, the first order Lagrangian dual type methods do not need global network topology information. Rather, each node only needs the queue length information of its neighbors.
This paper proposes a new first order Lagrangian dual type backpressure algorithm that is as simple as the existing algorithms in [4, 5, 7] but has a better utility-delay tradeoff. The new backpressue algorithm achieves a vanishing utility optimality gap that decays like , where is the number of iterations. It also guarantees that the queue length at each node is always bounded by a fixed constant of the same order as the optimal Lagrange multiplier of the network optimization problem. This improves on the utility-delay tradeoffs of prior work. In particular, it improves the utility-delay tradeoff in [5] and the utility-delay tradeoff of the drift-plus-penalty algorithm in [4], both of which yield an unbounded queue length to have a vanishing utility optimality gap. The new backpressure algorithm differs from existing first order backpressure algorithms in the following aspects:
The “backpressure” quantities in this paper are with respect to newly introduced weights. These are different from queues used in other backpressure algorithms, but can still be locally tracked and updated. 2. 2.
The rate control and routing decision rule involves a quadratic term that is similar to a term used in proximal algorithms [15].
Note that the benefit of introducing a quadratic term in network optimization has been observed in [16]. Work [16] considers a network utility maximization problem with given routing paths that is a special case of the problem treated in this paper. The algorithm of [16] considers a fixed set of predetermined paths for each session and does not scale well when treating all (typically exponentially many) possible paths of a general network. The algorithm proposed in [16] is not a backpressure type and hence is fundamentally different from ours. For example, the algorithm in [16] needs to update the primal variables (source session rates for each path) at least twice per iteration, while our algorithm only updates the primal variables (source session rates and link session rates) once per iteration. The prior work [16] shows that the utility optimality gap is asymptotically zero without analyzing the decay rate, while this paper shows the utility optimality gap decays like .
II System Model and Problem Formulation
Consider a slotted data network with normalized time slots . This network is represented by a graph , where is the set of nodes and is the set of directed links. Let and . This network is shared by end-to-end sessions denoted by a set . For each end-to-end session , the source node and destination node are given but the routes are not specified. Each session has a continuous and concave utility function that represents the “satisfaction” received by accepting amount of data for session into the network at each slot. Unlike [5, 10] where is assumed to be differentiable and strongly concave, this paper considers general concave utility functions , including those that are neither differentiable nor strongly concave. Formally, each utility function is defined over an interval , called the domain of the function. It is assumed throughout that either or , the latter being important for proportionally fair utilities [17] that have singularities at .
Denote the capacity of link as and assume it is a fixed and positive constant.111As stated in [10], this is a suitable model for wireline networks and wireless networks with fixed transmission power and orthogonal channels. Define as the amount of session ’s data routed at link that is to be determined by our algorithm. Note that in general, the network may be configured such that some session is forbidden to use link . For each link , define as the set of sessions that are allowed to use link . The case of unrestricted routing is treated by defining for all links .
Note that if with , then and can also be respectively written as and . For each node , denote the sets of its incoming links and outgoing links as and , respectively. Note that and are the decision variables of a joint rate control and routing algorithm. If the global network topology information is available, the optimal joint rate control and routing can be formulated as the following multi-commodity network flow problem:
[TABLE]
where is an indicator function; (2) represents the node flow conservation constraints relaxed by replacing the equality with an inequality, meaning that the total rate of flow into node is less than or equal to the total rate of flow out of the node (since, in principle, we can always send fake data for departure links when the inequality is loose); and (3) represents link capacity constraints. Note that for each flow , there is no constraint (2) at its destination node since all incoming data are consumed by this node.
The above formulation includes network utility maximization with fixed paths as special cases. In the case when each session only has one single given path, e.g., the network utility maximization problem considered in [18], we could modify the sets used in constraints (4) and (5) to reflect this fact. For example, if link is only used for sessions and , then . Similarly, the case [16] where each flow is restricted to using links from a set of predefined paths can be treated by modifying the sets accordingly. See Appendix A for more discussions.
The solution to problem (1)-(6) corresponds to the optimal joint rate control and routing. However, to solve this convex program at a single computer, we need to know the global network topology and the solution is a centralized one, which is not practical for large data networks. As observed in [9, 5, 7, 10], various versions of backpressure algorithms can be interpreted as distributed solutions to problem (1)-(6) from first order Lagrangian dual type methods.
Assumption 1
(Feasibility) Problem (1)-(6) has at least one optimal solution vector .
Assumption 2
(Existence of Lagrange multipliers) Assume the convex program (1)-(6) has Lagrange multipliers attaining the strong duality. Specifically, define convex set . Assume there exists a Lagrange multiplier vector such that
[TABLE]
where q(\boldsymbol{\lambda})=\sup_{[x_{f};\mu_{l}^{(f)}]\in\mathcal{C}}\big{\{}\sum_{f\in\mathcal{F}}U_{f}(x_{f})-\sum_{f\in\mathcal{F}}\sum_{n\in\mathcal{N}\setminus\{\text{Dst}(f)\}}\lambda_{n}^{(f)}\big{[}x_{f}\mathbf{1}_{\{n=\text{Src}(f)\}}+\sum_{l\in\mathcal{I}(n)}\mu_{l}^{(f)}-\sum_{l\in\mathcal{O}(n)}\mu_{l}^{(f)}\big{]}\big{\}} is the Lagrangian dual function of problem (1)-(6) by treating (3)-(6) as a convex set constraint.
Assumptions 1 and 2 hold in most cases of interest. For example, Slater’s condition guarantees Assumption 2. Since the constraints (2)-(6) are linear, Proposition 6.4.2 in [19] ensures that Lagrange multipliers exist whenever constraints (2)-(6) are feasible and when the utility functions are either defined over open sets (such as with ) or can be concavely extended to open sets, meaning that there is an and a concave function such that whenever .222If , such concave extension is possible if the right-derivative of at is finite (such as for or ). Such an extension is impossible for the example because the slope is infinite at . Nevertheless, Lagrange multipliers often exist even for these utility functions, such as when Slater’s condition holds [19].
Fact 1
(Replacing inequality with equality) If Assumption 1 holds, problem (1)-(6) has an optimal solution vector such that all constraints (2) take equalities.
Proof:
Note that each can appear on the left side in at most one constraint (2) and appear on the right side in at most one constraint (2). Let be an optimal solution vector such that at least one inequality constraint (2) is loose. Note that we can reduce the value of on the right side of a loose (2) until either that constraint holds with equality, or until reduces to [math]. The objective function value does not change, and no constraints are violated. We can repeat the process until all inequality constraints (2) are tight. ∎
III The New Backpresure Algorithm
III-A Discussion of Various Queueing Models
At each node, an independent queue backlog is maintained for each session. At each slot , let be the source session rates; and let be the link session rates. Some prior work enforces the constraint (2) via virtual queues of the following form:
[TABLE]
While this virtual equation is a meaningful approximation, it differs from reality in that new injected data are allowed to be transmitted immediately, or equivalently, a single packet is allowed to enter and leave many nodes within the same slot. Further, there is no clear connection between the virtual queues in (7) and the actual queues in the network. Indeed, it is easy to construct examples that show there can be an arbitrarily large difference between the value in (7) and the physical queue size in actual networks (see Appendix B).
An actual queueing network has queues with the following dynamics:
[TABLE]
This is faithful to actual queue dynamics and does not allow data to be retransmitted over multiple hops in one slot. Note that (8) is an inequality because the new arrivals from other nodes may be strictly less than because those other nodes may not have enough backlog to send. The model (8) allows for any decisions to be made to fill the transmission values in the case that , provided that (8) holds.
This paper develops an algorithm that converges to the optimal utility defined by problem (1)-(6), and that produces worst-case bounded queues on the actual queueing network, that is, with actual queues that evolve as given in (8). To begin, it is convenient to introduce the following virtual queue equation
[TABLE]
where represents a virtual queue value associated with session at node . At first glance, this model (9) appears to be only an approximation, perhaps even a worse approximation than (7), because it allows the values to be negative. Indeed, we use only as virtual queues to inform the algorithm and do not treat them as actual queues. However, this paper shows that using these virtual queues to choose the decisions ensures not only that the desired constraints (2) are satisfied, but that the resulting decisions create bounded queues in the actual network, where the actual queues evolve according to (8). In short, our algorithm can be faithfully implemented with respect to actual queueing networks, and converges to exact optimality on those networks.
The next lemma shows that if an algorithm can guarantee virtual queues defined in (9) are bounded, then actual physical queues satisfying (8) are also bounded.
Lemma 1
Consider a network flow problem described by problem (1)-(6). For all and , let be decisions yielded by a dynamic algorithm. Suppose , , evolve by (7)-(9) with initial conditions . If there exists a constant such that , then
* for all .* 2. 2.
* for all .*
Proof:
Fix . Define an auxiliary virtual queue that is initialized by and evolves according to (9). It follows that . Since by assumption, we have . This implies that also satisfies:
[TABLE]
which is identical to (8) except the inequality is replaced by an equality. Since ; and satisfies (10), by inductions, .
Since and , we have . It follows that . 2. 2.
The proof of part (2) is similar and is in Appendix C.
∎
III-B The New Backpressure Algorithm
In this subsection, we propose a new backpressure algorithm that yields source session rates and link session rates at each slot such that the physical queues for each session at each node are bounded by a constant and the time average utility satisfies
[TABLE]
where are from the optimal solution to (1)-(6). Note that Jensen’s inequality further implies that
[TABLE]
The new backpressure algorithm is described in Algorithm 1. Similar to existing backpressure algorithms, the updates in Algorithm 1 at each node are fully distributed and only depend on weights at itself and its neighbor nodes. Unlike existing backpressure algorithms, the weights used to update decision variables and are not the virtual queues themselves, rather, they are augmented values equal to the sum of the virtual queues and the amount of net injected data in the previous slot . In addition, the updates involve an additional quadratic term, which is similar to a term used in proximal algorithms [15].
III-C Almost Closed-Form Updates in Algorithm 1
This subsection shows the decisions and in Algorithm 1 have either closed-form solutions or “almost” closed-form solutions at each iteration .
Lemma 2
Let denote the solution to (12)-(13).
Suppose and is differentiable. Let . If , then ; otherwise is the root to the equation and can be found by a bisection search. 2. 2.
Suppose and for some weight . Then:
[TABLE]
Proof:
Omitted for brevity. ∎
The problem (14)-(17) can be represented as follows by eliminating , completing the square and replacing maximization with minimization. (Note that .)
[TABLE]
Lemma 3
The solution to problem (18)-(20) is given by where can be found either by a bisection search (See Appendix D) or by Algorithm 2 with complexity .
Proof:
A similar problem where (19) is replaced with an equality constraint in considered in [20]. The optimal solution to this quadratic program is characterized by its KKT condition and a corresponding algorithm can be developed to obtain its KKT point. A complete proof is presented in Appendix D. ∎
Note that step (3) in Algorithm 2 has complexity and hence the overall complexity of Algorithm 2 is dominated by the sorting step (2) with complexity .
IV Performance Analysis of Algorithm 1
IV-A Basic Facts from Convex Analysis
Definition 1** (Lipschitz Continuity)**
Let be a convex set. Function is said to be Lipschitz continuous on with modulus if there exists such that for all .
Definition 2** (Strongly Concave Functions)**
Let be a convex set. Function is said to be strongly concave on with modulus if there exists a constant such that is concave on .
By the definition of strongly concave functions, it is easy to show that if is concave and , then is strongly concave with modulus for any constant .
Lemma 4
Let be a convex set. Let function be strongly concave on with modulus and be a global maximum of on . Then, for all .
IV-B Preliminaries
Define column vector . For each , define column vector
[TABLE]
which is composed by the control actions appearing in each constraint (2); and introduce a function with respect to as
[TABLE]
Thus, constraint (2) can be rewritten as
[TABLE]
Note that each vector is a subvector of and has length where is the degree of node (the total number of outgoing links and incoming links) if node is the source of session ; and has length if node is not the source of session .
Fact 2
Each function defined in (24) is Lipschitz continuous with respect to vector with modulus
[TABLE]
where is the degree of node .
Proof:
This fact can be easily shown by noting that each is a linear function with respect to vector and has at most non-zero coefficients that are equal to . ∎
Note that virtual queue update equation (9) can be rewritten as:
[TABLE]
and weight update equation (11) can be rewritten as:
[TABLE]
Define
[TABLE]
and call it a Lyapunov function. In the remainder of this paper, double summations are often written compactly as a single summation, e.g.,
[TABLE]
Define the Lyapunov drift as
[TABLE]
The following lemma follows directly from equation (25).
Lemma 5
At each iteration in Algorithm 1, the Lyapunov drift is given by
[TABLE]
Proof:
Fix and , we have
[TABLE]
where (a) follows from (25).
By the definition of , we have
[TABLE]
where (a) follows from (29). ∎
Define . At each time , consider choosing a decision vector that includes elements in each subvector to solve the following problem:
[TABLE]
The expression (30) is a modified drift-plus-penalty expression. Unlike the standard drift-plus-penalty expressions from [4], the above expression uses weights , which arguments each by , rather than virtual queues . It also includes a “prox”-like term that penalizes deviation from the previous vector. This results in the novel backpressure-type algorithm of Algorithm 1. Indeed, the decisions in Algorithm 1 were derived as the solution to the above problem (30)-(31). This is formalized in the next lemma.
Lemma 6
At each iteration , the action jointly chosen in Algorithm 1 is the solution to problem (30)-(31).
Proof:
The proof involves collecting terms associated with the and decisions. See Appendix E for details. ∎
Furthermore, the next lemma summarizes that the action jointly chosen in Algorithm 1 provides a lower bound for the drift-plus-penalty expression at each iteration .
Lemma 7
Let be an optimal solution to problem (1)-(6) given in Fact 1, i.e., . If , where is the degree of node , then the action jointly chosen in Algorithm 1 at each iteration satisfies
[TABLE]
where \Phi[t]=\sum_{f\in\mathcal{F},n\in\mathcal{N}}\big{(}\alpha_{n}\mathbf{1}_{\{n\neq\text{Dst}(f)\}}\|\mathbf{y}_{n}^{(f),\ast}-\mathbf{y}_{n}^{(f)}[t]\|^{2}+\alpha_{n}\mathbf{1}_{\{n=\text{Dst}(f)\}}\sum_{l\in\mathcal{I}(n)}(\mu_{l}^{(f),\ast}-\mu_{l}^{(f)}[t])^{2}\big{)}.
Proof:
See Appendix F. ∎
It remains to show that this modified backpressure algorithm leads to fundamentally improved performance.
IV-C Utility Optimality Gap Analysis
Define column vector \mathbf{Q}[t]=\big{[}Q_{n}^{(f)}[t]\big{]}_{f\in\mathcal{F},n\in\mathcal{N}\setminus\{\text{Dst}(f)\}} as the stacked vector of all virtual queues defined in (9). Note that (27) can be rewritten as . Define vectorized constraints (2) as .
Lemma 8
Let be an optimal solution to problem (1)-(6) given in Fact 1, i.e., . If in Algorithm 1, where is the degree of node , then for all ,
[TABLE]
where \zeta=\Phi[-1]=\sum_{f\in\mathcal{F},n\in\mathcal{N}}\big{(}\alpha_{n}\mathbf{1}_{\{n\neq\text{Dst}(f)\}}\|\mathbf{y}_{n}^{(f),\ast}\|^{2}+\alpha_{n}\mathbf{1}_{\{n=\text{Dst}(f)\}}\sum_{l\in\mathcal{I}(n)}(\mu_{l}^{(f),\ast})^{2}\big{)} is a constant.
Proof:
By Lemma 7, we have . Summing over yields
[TABLE]
where (a) follows from the fact that .
Recall , simplifying summations and rearranging terms yields
[TABLE]
where (a) follows from the fact that and . ∎
The next theorem summarizes that Algorithm 1 yields a vanishing utility optimality gap that approaches zero like .
Theorem 1
Let be an optimal solution to problem (1)-(6) given in Fact 1, i.e., . If in Algorithm 1, where is the degree of node , then for all , we have
[TABLE]
where is a constant defined in Lemma 8. Moreover, if we define , then
[TABLE]
Proof:
Recall that . By Lemma 8, we have
[TABLE]
where (a) follows from the trivial fact that .
Dividing both sides by a factor yields the first inequality in this theorem. The second inequality follows from the concavity of and Jensen’s inequality. ∎
IV-D Queue Stability Analysis
Lemma 9
Let be the virtual queues in Algorithm 1. For any ,
[TABLE]
Proof:
This lemma follows directly from the fact that and queue update equation (9) can be written as . ∎
The next theorem shows the boundedness of all virtual queues in Algorithm 1.
Theorem 2
Let be an optimal solution to problem (1)-(6) given in Fact 1, i.e., , and be a Lagrange multiplier vector given in Assumption 2. If in Algorithm 1, where is the degree of node , then for all ,
[TABLE]
where is a constant defined in Lemma 8.
Proof:
Let q(\boldsymbol{\lambda})=\sup_{\mathbf{y}\in\mathcal{C}}\big{\{}f(\mathbf{y})-\boldsymbol{\lambda}^{\mkern-1.5mu\mathsf{T}}\mathbf{g}(\mathbf{y})\big{\}} be the Lagrangian dual function defined in Assumption 2. For all , by Assumption 2, we have
[TABLE]
where follows from the definition of . Rearranging terms yields
[TABLE]
Fix . Summing over yields
[TABLE]
where (a) follows form Lemma 9 and (b) follows from Cauchy-Schwarz inequality.
On the other hand, by Lemma 8, we have
[TABLE]
Combining the last two inequalities and cancelling the common terms yields
[TABLE]
where (a) follows from the basic inequality for any .
Thus, for any and , we have
[TABLE]
∎
This theorem shows that the absolute values of all virtual queues are bounded by a constant from above. By Lemma 1 and discussions in Section III-A, the actual physical queues evolving via (8) satisfy . This is summarized in the next corollary.
Corollary 1
Let be an optimal solution to problem (1)-(6) given in Fact 1, i.e., , and be a Lagrange multiplier vector given in Assumption 2. If in Algorithm 1, where is the degree of node , then all actual physical queues in the network evolving via (8) satisfy
[TABLE]
where is a constant defined in Lemma 8.
IV-E Performance of Algorithm 1
Theorems 1 and 2 together imply that Algorithm 1 with can achieve a vanishing utility optimality gap that decays like , where is number of iterations, and guarantees the physical queues at each node for each session are always bounded by a constant that is independent of the utility optimality gap.
This is superior to existing backpressure algorithms from [5, 4, 10] that can achieve an utility gap only at the cost of an or queue length, where is an algorithm parameter. To obtain a vanishing utility gap, existing backpressure algorithms in [5, 4, 10] necessarily yield unbounded queues. To obtain a vanishing utility gap, existing backpressure algorithms in [5, 4] yield unbounded queues. We also comment that queue bound in the primal-dual type backpressure algorithm [5] is actually of the order where is the Lagrangian multiplier vector attaining strong duality and is a constant determined by the problem parameters. A recent work [21] also shows that the queue bound in the backpressure algorithm from drift-plus-penalty is of the order where is also a constant determined by the problem parameters. Since is a constant vector independent of , both algorithms are claimed to have or queue bounds. By Corollary 1, Algorithm 1 guarantees physical queues at each node are bounded by , where is constant given a problem. Thus, the constant queue bound guaranteed by Algorithm 1 is typically smaller than the or queue bounds from [5] and [21] even for a small . (A small can yield a poor utility performance in the backpressure algorithms in [5, 4].)
V Numerical Experiment
In this section, we consider a simple network with nodes and links and sessions as described in Figure 1. This network has two sessions: session from node to node has utility function and session from node to node has utility function . (The log utilities are widely used as metrics of proportional fairness in the network [17].) The routing path of each session is arbitrary as long as data can be delivered from the source node to the destination node. For simplicity, assume that each link has capacity . The optimal source session rate to problem (1)-(6) is and and link session rates, i.e., static routing for each session, is drawn in Figure 2.
To compare the convergence performance of Algorithm 1 and the backpressure algorithm in [4] (with the best utility-delay tradeoff among all existing backpressure algorithms), we run both Algorithm 1 with \alpha_{n}=\frac{1}{2}\big{(}d_{n}+1),\forall n\in\mathcal{N} and the backpressure algorithm in [4] with to plot Figure 3. It can be observed from Figure 3 that Algorithm 1 converges to the optimal source session rates faster than the backpressure algorithm in [4]. The backpressure algorithm in [4] with takes around iterations to converges to source rates close to while Algorithm 1 only takes around iterations to converges to (as shown in the zoom-in subfigure at the top right corner.) In fact, the backpressure algorithm in [4] with can not converge to the exact optimal source session rate but can only converge to its neighborhood with a distance gap determined by the value of . This is an effect from the fundamental utility-delay tradeoff of the the backpressure algorithm in [4]. In contrast, Algorithm 1 can eventually converge to the the exact optimal source session rate . A zoom-in subfigure at the bottom right corner in Figure 1 verifies this and shows that the source rate for Session in Algorithm 1 converges to while the source rate in the backpressure algorithm in [4] with oscillates around a point slightly larger than .
Corollary 1 shows that Algorithm 1 guarantees each actual queue in the network is bounded by constant . Recall that the backpressure algorithm in [4] can guarantee the actual queues in the network are bounded by a constant of order . Figure 4 plots the sum of actual queue length at each node for Algorithm 1 and the backpressure algorithm in [4] with and . (Recall a larger in the backpressure algorithm in [4] yields a smaller utility gap but a larger queue length.) It can be observed that Algorithm 1 has the smallest actual queue length (see the zoom-in subfigure) and the actual queue length of the backpressure algorithm in [4] scales linearly with respect to .
VI Conclusion
This paper develops a new first-order Lagrangian dual type backpressure algorithm for joint rate control and routing in multi-hop data networks. The new backpressure algorithm can achieve vanishing utility optimality gaps and finite queue lengths. This improves the state-of-art or utility-delay tradeoff attained by existing backpressure algorithms [5, 9, 7, 10].
Appendix A Network Utility Maximization with Predetermined Multi-Path
Consider multi-path network utility maximization in [16] where each session has multiple given paths, then the source session rate in problem (1)-(6) becomes a vector where is the set of paths for session and the link session rate becomes a vector . Define as the set of paths for session that are allowed to use link . Note that if all paths of session are forbidden to use link , then . The multi-path network utility maximization problem can be formulated as follows:
[TABLE]
The above formulation is in the form of problem (1)-(6) except that the variable dimension is extended.
Appendix B An Example Illustrating the Possibly Large Gap between Model (7) and Model (8)
Consider a network example shown in Figure 5. The network has nodes where only node [math] is a destination; and and can have exogenous arrivals. Assume all link capacities are equal to 1; and the exogenous arrivals are periodic with period 2k, as follows:
- •
Time slot : One packet arrives at node .
- •
Time slot : One packet arrives at node .
- •
- •
Time slot : One packet arrives at node .
- •
Time slot : One packet arrives at node .
- •
Time slot : One packet arrives at node .
- •
- •
Time slot : One packet arrives at node .
Under dynamics (7), each packet arrives on its own slot and traverses all links of its path to exit on the same slot it arrived. The queue backlog in each node is [math] for all time.
Under dynamics (8), the first packet arrives at time slot to node . This packet visits node at time slot , when the second packet also arrives at . One of these packets is delivered to node at time slot , and another packet also arrives to node . The nodes do not have any exogenous arrivals and act only to delay the delivery of all packets from the ai nodes. It follows that the link from node to node [math] will send exactly one packet over each slots . Similarly, the link from to [math] sends exactly one packet to node [math] over each of these same slots. Thus, node [math] receives packets on each slot , but can only output packet per slot. The queue backlog in this node grows linearly and reaches at time . Thus, the backlog in node [math] can be arbitrarily large when is large. This example demonstrates that, even when there is only one destination, the deviation between virtual queues under dynamics (7) and actual queues under dynamics (8) can be arbitrarily large, even when the in-degree and out-degree of and an in-degree of at most .
Appendix C Proof of Part (2) in Lemma 1
Fix . By (10),
[TABLE]
where (a) follows from the fact that are non-negative. Note that the right side of the above equation is identical to the right side of (7) and recall that . By inductions, we have . Since and , we have . It follows that .
Appendix D Proof of Lemma 3
Note that problem (18)-(20) satisfies Slater’s condition. So the optimal solution to problem (18)-(20) is characterized by KKT conditions [22]. Introducing Lagrange multipliers for inequality constraint and for inequality constraints . Let and be any primal and dual pair with the zero duality gap. By KKT conditions, we have z_{k}^{\ast}-a_{k}+\theta^{\ast}-\nu_{k}^{\ast}=0,\forall k\in\{1,2,\ldots,K\};\sum_{k=1}^{K}z_{k}^{\ast}\leq b;\theta^{\ast}\geq 0;\theta^{\ast}\big{(}\sum_{k=1}^{K}z_{k}^{\ast}-b\big{)}=0;z_{k}^{\ast}\geq 0,\forall k\in\{1,2,\ldots,K\};\nu_{k}^{\ast}\geq 0,\forall k\in\{1,2,\ldots,K\};\nu_{k}^{\ast}z_{k}^{\ast}=0,\forall k\in\{1,2,\ldots,K\}.
Eliminating in all equations yields \theta^{\ast}\geq a_{k}-z_{k}^{\ast},k\in\{1,2,\ldots,K\};\sum_{k=1}^{K}z_{k}^{\ast}\leq b;\theta^{\ast}\geq 0;\theta^{\ast}\big{(}\sum_{k=1}^{K}z_{k}^{\ast}-b\big{)}=0;z_{k}^{\ast}\geq 0,\forall k\in\{1,2,\ldots,K\};(z_{k}^{\ast}-a_{k}+\theta^{\ast})z_{k}^{\ast}=0,\forall k\in\{1,2,\ldots,K\}.
For all , we consider and separately:
If , then holds only when , which by implies that . 2. 2.
If , then is impossible, because implies that , which together with contradicts the slackness condition . Thus, if , we must have .
Summarizing both cases, we have , where is chosen such that , and \theta^{\ast}\big{(}\sum_{k=1}^{K}z_{k}^{\ast}-b\big{)}=0.
To find such , we first check if . If is true, the slackness condition \theta^{\ast}\big{(}\sum_{k=1}^{K}z_{k}^{\ast}-b\big{)} is guaranteed to hold and we need to further require . Thus if and only if . Thus, Algorithm 2 check if holds at the first step and if this is true, then we conclude and we are done!
Otherwise, we know . By the slackness condition \theta^{\ast}\big{(}\sum_{k=1}^{K}z_{k}^{\ast}-b\big{)}=0, we must have . To find such that , we could apply a bisection search by noting that all are decreasing with respect to .
Another algorithm of finding is inspired by the observation that if , then . Thus, we first sort all in a decreasing order, say is the permutation such that ; and then sequentially check if is the index such that and . To check this, we first assume is indeed such an index and solve the equation to obtain ; (Note that in Algorithm 2, to avoid recalculating the partial sum for each , we introduce the parameter and update incrementally. By doing this, the complexity of each iteration in the loop is only .) then verify the assumption by checking if , and . The algorithm is described in Algorithm 2 and has complexity . The overall complexity is dominated by the step of sorting all .
Appendix E Proof of Lemma 6
The objective function (30) can be rewritten as
[TABLE]
where (a) follows from the fact that and ; and (b) follows by collecting each linear term and each quadratic term . Note that each link session rate appears twice with opposite signs in the summation term \sum_{f\in\mathcal{F},n\in\mathcal{N}\setminus\{\text{Dst}(f)\}}W_{n}^{(f)}[t]\big{(}x_{f}\mathbf{1}_{\{n=\text{Src}(f)\}}+\sum_{l\in\mathcal{I}(n)}\mu_{l}^{(f)}-\sum_{l\in\mathcal{O}(n)}\mu_{l}^{(f)}\big{)} unless link flows into and recall that . The quadratic terms are collected in a similar way. Note that the term introduced to the objective function (30) is necessary to guarantee each quadratic term with the same link index but different flow indices have the same coefficient in the last line of (32).
Note that equation (32) is now separable for each scalar and vector . Thus, problem (30)-(31) can be decomposed into independent smaller optimization problems in the form of problem (12)-(13) with respect to each scalar , and in the form of problem (14)-(17) with respect to each vector .
Appendix F Proof of Lemma 7
Note that appears as a known constant in (12). Since is concave and is linear, it follows that (12) is strongly concave with respect to with modulus . Since is chosen to solve (12)-(13), by Lemma 4, , we have
[TABLE]
Similarly, we know (14) is strongly concave with respect to vector with modulus . By Lemma 4, , we have
[TABLE]
Recall that each column vector defined in (23) is composed by control actions that appear in each constraint (2); column vector is the collection of all control actions; and . Summing term (33)-I over all and term (34)-I over all and using an argument similar to the proof of Lemma 6 (Recall that is jointly chosen to minimize (30) by Lemma 6.) yields
[TABLE]
Recall that \Phi[t]=\sum_{f\in\mathcal{F},n\in\mathcal{N}}\big{(}\alpha_{n}\mathbf{1}_{\{n\neq\text{Dst}(f)\}}\|\mathbf{y}_{n}^{(f),\ast}-\mathbf{y}_{n}^{(f)}[t]\|^{2}+\alpha_{n}\mathbf{1}_{\{n=\text{Dst}(f)\}}\sum_{l\in\mathcal{I}(n)}(\mu_{l}^{(f),\ast}-\mu_{l}^{(f)}[t])^{2}\big{)}. Summing term (33)-II over all and term (34)-II over all yields
[TABLE]
Combining (33)-(36) and rearranging terms yields
[TABLE]
where (a) follows because and ; (b) follows from the fact that .
Recall that for any . Thus, for all , we have
[TABLE]
Substituting (38) into (37) yields
[TABLE]
where (a) follows from the Fact 2, i.e., each is Lipschitz with modulus and (b) follows because , and \frac{1}{2}\big{(}g_{n}^{(f)}(\mathbf{y}_{n}^{(f)}[t-1])\big{)}^{2}\geq 0.
Subtracting (28) from (39) and cancelling the common terms on both sides yields
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Tassiulas and A. Ephremides, “Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks,” IEEE Transactions on Automatic Control , vol. 37, no. 12, pp. 1936–1948, 1992.
- 2[2] M. J. Neely, “Dynamic power allocation and routing for satellite and wireless networks with time varying channels,” Ph.D. dissertation, Massachusetts Institute of Technology, 2003.
- 3[3] L. Georgiadis, M. J. Neely, and L. Tassiulas, “Resource allocation and cross-layer control in wireless networks,” Foundations and Trends in Networking , 2006.
- 4[4] M. J. Neely, Stochastic network optimization with application to communication and queueing systems . Morgan & Claypool Publishers, 2010.
- 5[5] A. Eryilmaz and R. Srikant, “Joint congestion control, routing, and mac for stability and fairness in wireless networks,” IEEE Journal on Selected Areas in Communications , vol. 24, no. 8, pp. 1514–1524, 2006.
- 6[6] A. L. Stolyar, “Maximizing queueing network utility subject to stability: Greedy primal-dual algorithm,” Queueing Systems , vol. 50, no. 4, pp. 401–457, 2005.
- 7[7] X. Lin and N. B. Shroff, “Joint rate control and scheduling in multihop wireless networks,” in Proceedings of IEEE Conference on Decision and Control (CDC) , 2004.
- 8[8] J.-W. Lee, R. R. Mazumdar, and N. B. Shroff, “Opportunistic power scheduling for dynamic multi-server wireless systems,” IEEE Transactions on Wireless Communications , vol. 5, no. 6, pp. 1506–1515, 2006.
