Approximate Gradient Descent Convergence Dynamics for Adaptive Control on Heterogeneous Networks
Jean Carpentier, Sebastien Blandin

TL;DR
This paper studies the convergence of adaptive control algorithms on complex networks, revealing limitations of standard methods and proposing rescaling techniques that improve system optimality, supported by theoretical analysis and extensive simulations.
Contribution
It introduces a rescaling approach to enhance backpressure algorithms for heterogeneous networks, with detailed convergence analysis and validation on synthetic and real-world network models.
Findings
Rescaling improves system optimality by a factor of up to O(k).
Theoretical convergence properties are established for generalized backpressure algorithms.
Simulations confirm the theoretical results on synthetic and Manhattan grid networks.
Abstract
Adaptive control is a classical control method for complex cyber-physical systems, including transportation networks. In this work, we analyze the convergence properties of such methods on exemplar graphs, both theoretically and numerically. We first illustrate a limitation of the standard backpressure algorithm for scheduling optimization, and prove that a re-scaling of the model state can lead to an improvement in the overall system optimality by a factor of at most depending on the network parameters, where characterizes the network heterogeneity. We exhaustively describe the associated transient and steady-state regimes, and derive convergence properties within this generalized class of backpressure algorithms. Extensive simulations are conducted on both a synthetic network and on a more realistic large-scale network modeled on the Manhattan grid on which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Smart Grid Energy Management · Advanced Wireless Network Optimization
Approximate Gradient Descent Convergence Dynamics
for Adaptive Control on Heterogeneous Networks
Jean Carpentier1 and Sebastien Blandin2
1Ecole Polytechnique, Palaiseau, France
2IBM Research, Singapore
[email protected], [email protected]
Abstract
Adaptive control is a classical control method for complex cyber-physical systems, including transportation networks. In this work, we analyze the convergence properties of such methods on exemplar graphs, both theoretically and numerically. We first illustrate a limitation of the standard backpressure algorithm for scheduling optimization, and prove that a re-scaling of the model state can lead to an improvement in the overall system optimality by a factor of at most depending on the network parameters, where characterizes the network heterogeneity. We exhaustively describe the associated transient and steady-state regimes, and derive convergence properties within this generalized class of backpressure algorithms. Extensive simulations are conducted on both a synthetic network and on a more realistic large-scale network modeled on the Manhattan grid on which theoretical results are verified.
Introduction
We consider the scheduling problem on queuing networks, and specifically on urban road networks. Concretely, the problem consists of the allocation of time slots to traffic lights at intersections. While the routing policy may impact the stability of the scheduling solution [2], the routing problem is often considered decoupled [15] and is not addressed here.
The computational complexity of the scheduling problem on large-scale road networks has motivated the search for efficient decentralized algorithms only requiring local knowledge of network properties and efficient in the absence of coordination. Such decentralized approaches have proven quite efficient in practice [18, 27], have connections with fluid dynamic models [4], and are amenable to agent-based learning methods such as reinforcement learning [16].
In the context of communication networks, the backpressure algorithm [21] provides a throughput-maximizing control policy, i.e. a policy that guarantees that given any feasible flow, the maximal network queue size is asymptotically bounded. Furthermore, the backpressure policy requires only evaluation of queue size on neighboring road links.
Several properties of the backpressure algorithm make it appealing for adaptive control of dynamical road networks. First, the backpressure solution is a policy, which by definition is able to handle variability of the network state. Second it can be implemented and deployed in a fully decentralized manner since it requires only local information. Lastly, it comes with theoretical guarantees on the queue size, and has been shown to perform very well in practice.
In the context of intelligent transport systems, significant research efforts have been dedicated to extensions of the backpressure algorithm in recent years, in particular to address the specifics of traffic light scheduling on road networks [23]. The case of unknown routing rates was also investigated [8], as well as the case of queues with finite capacity [9], see also [11] and [12].
In-depth theoretical analysis of the backpressure properties, such as its behavior under heavy load conditions [3], or its link with game theory [22] have also been investigated [17, 14]. Similar results exist for the closely related max weight algorithm [20] and an adapted backpressure algorithm [24] has been devised in this context. The backpressure algorithm was also shown to be a greedy gradient descent over the quadratic potential [26]. In this context, acceleration methods have been proposed [29]. We refer to [13] for application to wireless sensor networks.
One of the most explicit drawbacks of the backpressure algorithm occurs at a network scale: in steady state, for certain model networks, queue sizes strictly decrease from the origin to the destination along every possible path [5], [28], meaning that commuters traveling over longer paths incur longer queues. In this body of work, adjustments have been proposed via the consideration of an additional design cost explicitly accounting for path lengths, hence attempting to compensate that drawback. Other limitations have been investigated for specific network configurations [19].
In this work, we propose to analyze the convergence dynamics of the backpressure algorithm. Specifically, we are interested in a fine-grained analysis of the backpressure algorithm in different regimes, and associated convergence properties. We first explicitly re-cast the backpressure algorithm as a more general approximate gradient descent method, and show that in that class of methods, significant performance gaps exist depending on the choice of parameters. These general conclusions are derived on the case of a fundamental building block for network flow analysis, namely the network including two upstream links connected to one downstream link.
We then conduct an in-depth theoretical analysis of the network, and verify these results in simulation. We confirm experimentally that the results from the theoretical analysis obtained on a simple network apply to realistic networks such as the Manhattan grid.
The main contributions of this work include:
- •
illustration of arbitrarily large performance gaps in the backpressure class of adaptive control algorithms,
- •
theoretical identification and characterization of transient and stationary regimes of the network under backpressure algorithms,
- •
numerical validation of theoretical properties and illustration on realistic networks such as the Manhattan grid.
The rest of this article is organized as follows. We first introduce notations and formulate the problem considered. We then present our main results on the convergence dynamics on the network. We subsequently analyze the relative convergence of two instances of approximate gradient descent algorithms. Finally, we present detailed numerical results of the algorithm performance, and conclude.
Preliminaries
Network model
We consider a discrete-time network of queues with denoting the (continuous) number of vehicles queuing at location at time with the intention of traveling to the downstream link next. In the transportation context, each represents a distinct queue of vehicles waiting to cross an intersection with segregated movements (e.g. turn left, go straight, turn right). We also note the vector of and omit the time dependency for compactness. Queuing networks can also model public transport and multi-modal networks [10], although the emergence of mobile data often requires hybrid approaches [1].
The outflow of queue at time is the maximum number of vehicles able to cross the intersection within a time slot, defined as the minimum of the queue size and the queue capacity, assumed static:
[TABLE]
When this minimum is reached at the queue size , the intersection is in unsaturated regime, and when the minimum is reached at the queue capacity , the intersection is in saturated regime.
Given initial conditions for the queues, and prescribed source and sink flows specifying the number of vehicles entering and leaving the network, the conservation of vehicles reads:
[TABLE]
where is the proportion of vehicles reaching node intending to visit node next, and such that , and is a control variable specifying whether queue is activated, i.e. has green light, at time . For each intersection, the activation set follows standard constraints encoding compatible movements, (e.g. in the case of left-hand driving, turn right movements can be activated simultaneously, but not go straight and turn left movements).
Given an objective function satisfying Lyapunov properties, the scheduling problem is concerned with the design of an activation policy with good properties with respect to .
Backpressure algorithm
In this section we recall some existing results. The backpressure algorithm [21] provides maximal throughput stability in the sense that if the inputs flows are feasible in expectation, then the queue sizes are asymptotically bounded.
Definition 1**.**
The backpressure policy is the solution to the maximization problem :
[TABLE]
It can be shown, e.g. see [25], that this objective function leads to activating at each decision point the traffic movement maximizing the difference between its upstream queue and its downstream queue, hence the term “backpressure”.
We first show that the back pressure policy, although arising from a local greedy formulation, corresponds to an approximate gradient descent step.
Proposition 1**.**
The backpressure algorithm (3) is an approximate gradient descent step update on the objective function .
Proof.
If we note , the one-step temporal difference in the objective function reads:
[TABLE]
Expanding using the conservation equation (2) we can re-write as:
[TABLE]
where E is the vector of , and the time-dependence is omitted on the right-hand side. In the saturated regime, the first term on the right-hand side of equation (4) dominates, and a steepest gradient descent step on the approximate temporal difference reads:
[TABLE]
Approximating the throughput (1) as leads to the definition of the backpressure (3), which corresponds to making the approximation that the queues are in the saturated regime. ∎
Motivated by expression (5), in the following we define the priority of a queue as:
[TABLE]
This view of backpressure as a general one step update for an approximate gradient descent in the context of adaptive control motivates us to consider a generalization of the objective function via re-scaling. Specifically, given , we consider a generalized objective function associated with the generalized priorities:
[TABLE]
We now illustrate that this re-scaling can impact the performance of the approximate gradient descent method by an arbitrary factor depending on the network heterogeneity. We focus the analysis on the comparison between two values of , the case of which corresponds to the classical backpressure, and the case of which corresponds to a variant of the backpressure algorithm where time spent in the queue is the quantity to be optimized (since is the steady-state saturated regime approximation of time spent in the queue).
Heterogeneous Flows
We now define a simple but fundamental example and show the limitations of the backpressure algorithm on that case. Consider a simple network with upstream queues , and downstream queue , where heterogeneity between the upstream queues is parameterized by a factor . This topology corresponds to the classical merge junction in traffic engineering [6], see [7] for the underlying mathematical theory of network fluid-dynamics model.
Given a reference capacity , link capacities are defined as . Link inflows are defined as . Without loss of generality, we also assume that the downstream queue is constant since the main object of this study is the competing dynamics of upstream queues given an arbitrary downstream queues. For stability, we also make the classical assumption that the inflow is lower than the uniform capacity, i.e. .
We now prove that the classical backpressure algorithm (3) can perform arbitrary poorly for heterogeneous networks. We present the results for the case .
Proposition 2**.**
Consider the network from Figure 1, with and . A basic scheduling alternating activation of each upstream queue leads to:
[TABLE]
while the backpressure activation rule (3) yields:
[TABLE]
Proof.
First, given that in this discrete time setting each vehicle spends at least one time step in the queue, we have . Here, since the demand is feasible, an alternating schedule would result in , and similarly for , which proves (8).
Second, on this example, the backpressure priorities from equation (6) read . Since , we have . The queue is activated only if , or equivalently . When goes below that value, queue is activated because it has higher priority.
Hence the minimal queue size over time under backpressure is times the queue size under the alternating schedule. A similar analysis can be conducted with non-zero , the result is obtained for a sufficient upstream queue heterogeneity compared to the downstream queue value. ∎
In the rest of the article, we investigate the convergence dynamics of the approximate gradient descent dynamics under the generalized backpressure priorities (7).
Stability Domain
We first characterize the asymptotic convergence of the parametric network from Figure 1. To simplify the notations on this network, we index the upstream queues by .
Regime Types
For , the demand is feasible, capacities exceed input flows, which means that at least one queue is activated more often than needed to process the input flow, which means . For the junction, either only one upstream queue is in the unsaturated regime, and we call this network regime R1, or both upstream queues are in the unsaturated regime, and we call this network regime .
Due to space limitation, we focus on the regime which exhibits more complex behavior (indeed under the condition regime can be proven to be a transient regime evolving into eventually) and serves as illustrative example of a limitation of the classical backpressure algorithm in the example of previous section. With a slight abuse of notations, we use the saturation state, s for saturated and u for unsaturated, to index the queue (e.g. when queue is unsaturated and queue is in the saturated state).
A re-scaled backpressure algorithm
A discussed earlier, a queue is bounded below by a which is the value of its input flow:
[TABLE]
The expressions of the associated minimal priority can be derived by instantiating equation (7) with the priority value (10):
[TABLE]
In order for a queue to be activated, its priority must be at least greater than the minimal priority of every competing queue. We call this a-priori minimal priority to be activated:
[TABLE]
and the expression associated follows from (7) as:
[TABLE]
In regime one queue is saturated and the other queue is unsaturated. It follows from (12) that:
[TABLE]
where is such that . If , meaning that queue is in the saturated state, then equation (13) simplifies to , which is impossible by assumption. Hence the queue must be in the unsaturated state when reaching its activation priority, and the other queue is in the saturated state:
[TABLE]
We can now express sufficient conditions for each of the queues to be in saturated or unsaturated state, depending on the exogenous network parameters.
Proposition 3**.**
In the regime, the following states can exist:
- •
* when the following conditions are satisfied:*
- –
for :
- –
for :
- •
* when the following conditions are satisfied:*
- –
for :
- –
for :
Proof.
The bounds follow from instantiating equation (13) on the cases and . ∎
We represent the different phases characterized by Proposition 3 in Figure 2.
We observe that with the proposed non-uniform weights , the saturation region decreases, which suggests a better utilization of the network capacity.
Convergence properties
In this section we characterize the transient and stationary phases of the system, and provide theoretical results on the impact of the weights on the limit of the gradient descent.
Transient state
In regime , under the assumption that , we first prove that in general, there exists a transient regime during which the maximal priority decreases.
Definition 2**.**
The rolling min-max over 2 time slots reads:
[TABLE]
Lemma 16 provides conditions to ensure the overall decrease in queue size during the transient phase.
Lemma 1**.**
For the network, with , we have:
[TABLE]
Furthermore, with and the other queue, and with , we have :
[TABLE]
Proof.
The technical proof can be found in the supplemental material.
∎
We can now use the specificity of the R1 regime to describe the dynamics of the network:
Corollary 1**.**
If , i.e. , the saturated queue is activated for the time slot and:
[TABLE]
and
[TABLE]
In other words is activated at most once consecutively and it is followed by two activations of .
Proof.
The proof is obtained via a disjunction over the values of , details included in the supplemental material.
∎
Corollary 17 states first that the end of the transient state is related to the time when the unsaturated queue takes its minimal value. Equation (17) describes the queuing process in steady state: the saturated queue cannot be activated more than once consecutively. This implies that the unsaturated queue remains very close to its minimum value.
Characterization of the steady state
We now generalize the result from Corollary 17.
Proposition 4**.**
The steady state is reached after a finite time and is characterized by:
[TABLE]
where is defined by (14).
Proof.
First we prove that if queue is regularly activated then strictly decreases overtime. It eventually reaches the minimal value .
Lemma 16 states that is non-increasing therefore bounded above by . The input flow within one time slot is bounded (actually equal to ) so between two time slots, remains bounded above with . In the network, is affine of , so queues are also bounded above. Hence there exists an interval such that every queue is activated at least once within this interval otherwise the constants input flows would make it diverge.
Let us consider such an interval and a time in that interval at which is activated. Equation (15) states that is non increasing. Corollary 17 states that while , is decreasing strictly. Let us consider the other case, for which , corresponding to according to (14). We have that :
- •
if , equivalently and (16) yields strictly decreasing,
- •
else, , , so which is the lower bound.
In any case, either decreases or has reached its lower bound after the activation of , which means that in finite time, is arbitrarily close to . ∎
It follows that being constant bounds the size of the queues below and above. Experimentally, these bounds are quite tight, see Figure 3.
Theorem 1**.**
In the regime, given , the network converges to a steady state where under a generalized backpressure algorithm with :
- •
for the unsaturated queue :
[TABLE]
- •
and for the saturated queue:
[TABLE]
with
[TABLE]
Proof.
The proof consists of expanding the results from Proposition 4.
For the unsaturated queue we obtain:
[TABLE]
Equation (14) states that Consequently the queues associated with the priorities from equation (18) read:
[TABLE]
and the lower bound corresponds to (10).
For the saturated queue:
[TABLE]
We define as well the upper bound of s.t. . Consequently:
[TABLE]
For the minimum , we note that it is reached after an activation done with (defined as the minimal priority to be activated):
[TABLE]
This corresponds to associated queue values and the result follows. ∎
We illustrate the results of this section using a numerical simulation reported in Figure 3.
After a transient state corresponding to priority values higher than , the system stabilizes to a periodic state. Priorities oscillate around and is constant, as expected from the theory.
Total time spent in the network
We now derive the total time spent in the network, assuming that the average queue size is the average of the bounds from Theorem 1.
Definition 3**.**
The average queue size of is approximated by:
[TABLE]
The queue size translates directly into time spent given our implicit choice of a unit time step. We can now compare the classical backpressure algorithm and the proposed backpressure algorithm based on total time spent in the network.
Theorem 2**.**
In regime , the ratio of total time spent in the network as the heterogeneity increases is:
- •
with :
- •
with :
Proof.
We expand the expression of total time spent from Definition 3 using the detailed values from Theorem 1. ∎
Numerical Results
We first introduce the experimental setup, then go over results of benchmark experiments, and finally analyze performance on scenarios mimicking realistic conditions.
Experimental setup
We consider a heterogeneous Manhattan grid wherein some links have high capacity (major arterial roads) and other links have low capacity (secondary arterial roads). At each junction, distinct movements are authorized (left, straight, right) with the straight movement having double capacity.
The average demand for each origin node, destination node pair is drawn from an exponential distribution. For every origin-destination pair, the path minimizing the travel-time at the speed limit is computed, and flow is assigned accordingly.
Routing rates at each junction for the aggregate flow are computed from the full assignment, by computing the proportion of flow using each movement compared to the incoming flow. Input flow is drawn from a Poisson law with mean defined for each origin-destination pair as explained above.
Performance metrics such as time spent in the network are computed by summing over time the sizes of queues.
Benchmark experiments
In this section we perform controlled experiments to investigate and validate various properties of the proposed algorithm, compared to the classical backpressure algorithm. We consider a Manhattan grid with a major arterial every 5 blocks, and a time step of seconds.
First we analyze the impact of the parameter characterizing the magnitude of the demand. Figure 4 displays the ratio of time spent in the network for time steps.
These results confirm the theoretical results on the existence of three regimes depending on the values taken by : for low demand (low values), the flow is not really constrained so both algorithms have similar performance, for demand around and slightly above capacity there is little available supply and the new algorithm improves on the classical backpressure, and for higher demand the network is too saturated to leave room for optimization and both algorithms have similar performance. In this setting, the network is unstable for . The improvement is at most .
Second we analyze the impact of the parameter characterizing the network heterogeneity, with being the density of major arterial roads; corresponds to no major arterial, corresponds to all roads being major arterial, 2 to one over 2, etc. The results of simulations are presented in Figure 5.
Low values of correspond to a homogeneous network and the experiments confirm () that the proposed algorithm has no significant impact on traffic compared to the classical backpressure. For higher the network is more heterogeneous and as expected the proposed algorithm has better relative performance.
We now analyze the impact of the ratio of the capacities between major arterial roads and secondary arterial roads. The results of the simulations are displayed in Figure 6.
For a homogeneous network (capacity ratio close to ), the performance of both algorithms is similar. The comparative performance of the new algorithm with respect to the regular backpressure increases as the capacity ratio increases. The higher the capacity ratio the higher the flow on the major arterials (because their attractivity increases) so the more flow heterogeneity there is between major arterial and other roads.
These cases correspond to the lower part of Figure 2 for the single junction case where the proposed algorithm was proven theoretically to have better performance. In order to understand how the proposed algorithm achieves better performances we compare the average size of each queue for a heterogeneous network in Figure 7.
The results highlight that the proposed algorithm results in a moderate queue size increase on major arterials but yields significant queue size reduction on secondary arterial roads, meaning that the proposed algorithm is better able to take advantage of the network heterogeneity. In the next section we further explore the algorithm performance in realistic scenarios.
Peak hour scenario
In this section, we analyze a scenario modeled after a peak hour. The specific network considered is a Manhattan network with a grid with a major arterial road every 4 blocks. The factor varies over time in a triangular shape from [math] to to model a demand temporally exceeding the network capacity ( corresponds to an instable network).
Figure 8 illustrates that the excess demand causes the cumulative time spent in the network to increase in all cases. However the increase is less important in the case of the proposed algorithm, in particular when the demand returns to a low value, and subsequently. This can be explained by the fact that in the first part of the peak period, the network is capacity-constrained, hence no algorithm is given sufficient freedom to optimize. However in the second phase of the peak time, and subsequently, the proposed algorithm is able to take better advantage of available capacity.
Incident scenario
The second scenario that we consider is an incident modeled as a link with zero capacity for a one hour period in a Manhattan network. The network size and simulation parameters are chosen such that the boundary links are never impacted, and the incident happens after the network loading period, and clears before the end of the simulation.
We first analyze in Figure 9 the extent to which queues are smoothed in space via the backpressure effect of preserving already large queues from further inflow.
For both the classical backpressure and the proposed algorithm, the queues at the incident location are much lower compared to the fixed cycle policy, which in practice reduces the chances of grid-lock. This is achieved at the cost of having slightly longer queues upstream of the incident.
We now consider the cumulative queue length across the network as a function of time in Figure 10.
We observe that both the classical backpressure and the proposed algorithm outperform the fixed cycle policy, and the proposed algorithm slightly improves on the classical backpressure, with the difference between all methods increasing as the network gets more saturated, since the key benefit of the adaptive scheduling algorithms lies in efficiently using available link capacities around saturated conditions.
Conclusion
In this work we investigated the problem of the convergence properties of an approximate gradient descent method for network adaptive control.
Using a fundamental network, we proved that different regimes exist depending on exogenous parameters such as the magnitude of the demand with respect to the network capacity, and the heterogeneity of the flow across competing links. We characterized each of these regimes theoretically, and verified in simulation on realistic network the expected theoretical properties.
As part of this analysis, we also showed that appropriate calibration of the weights in the objective function can significantly improve the asymptotic objective value, by up to with the ratio of competing queues capacity.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Sebastien Blandin, Laura Wynter, Hasan Poonawala, Sean Laguna, and Basile Dura. FASTER: Fusion Analytic S for public Transport Event Response. In Proceedings of the 18 th Conference on Autonomous Agents and Multi Agent Systems . International Foundation for Autonomous Agents and Multiagent Systems, 2019.
- 2[2] Sebastien Boyer, Sebastien Blandin, and Laura Wynter. Stability of Transportation Networks Under Adaptive Routing Policies. Transportation Research Procedia , 7:578–597, 2015.
- 3[3] Maury Bramson. Stability and heavy traffic limits for queueing networks. In Perplexing Problems in Probability , pages 247–267. Springer, 1999.
- 4[4] Charles Brett, Saif Jabari, Sebastien Blandin, and Laura Wynter. In 95th Annual Meeting of the Transportation Research Board, Washington, D.C. , 2016.
- 5[5] Loc Bui, R Srikant, and Alexander Stolyar. Novel architectures and algorithms for delay reduction in back-pressure scheduling and routing. In INFOCOM 2009, IEEE , pages 2936–2940, 2009.
- 6[6] C. Daganzo. The cell transmission model, part II : Network traffic. Transportation Research Part B: Methodological , 29(2):79–93, 1995.
- 7[7] M. Garavello and B. Piccoli. Traffic flow on networks . American Institute of Mathematical Sciences, Springfield, MO, 2006.
- 8[8] Jean Gregoire, Emilio Frazzoli, Arnaud De La Fortelle, and Tichakorn Wongpiromsarn. Back-pressure traffic signal control with unknown routing rates. IFAC Proceedings Volumes , 47(3):11332–11337, 2014.
