Provably Safe Cruise Control of Vehicular Platoons
Sadra Sadraddini, Sivaranjani S, Vijay Gupta, Calin Belta

TL;DR
This paper develops provably safe cruise control strategies for autonomous vehicle platoons, ensuring collision avoidance and bounded platoon parameters despite disturbances, using set-invariance theory.
Contribution
It introduces both centralized and distributed control policies for safe platoon motion with formal safety guarantees.
Findings
Guarantees infinite-time collision avoidance.
Ensures platoon length and speed are within bounds.
Validates policies with numerical examples.
Abstract
We synthesize performance-aware safe cruise control policies for longitudinal motion of platoons of autonomous vehicles. Using set-invariance theories, we guarantee infinite-time collision avoidance in the presence of bounded additive disturbances, while ensuring that the length and the cruise speed of the platoon are bounded within specified ranges. We propose (i) a centralized control policy, and (ii) a distributed control policy, where each vehicle's control decision depends solely on its relative kinematics with respect to the platoon leader. Numerical examples are included.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Provably Safe Cruise Control of Vehicular Platoons
Sadra Sadraddini
Sivaranjani S
Vijay Gupta and Calin Belta
Sadra Sadraddini and Calin Belta are with the Department of Mechanical Engineering, Boston University, Boston, MA. {sadra,cbelta}@bu.edu. Sivaranjani S and Vijay Gupta are with the Department of Electrical Engineering, University of Notre Dame, South Bend, IN. {sseethar,vgupta2}@nd.edu
Calin Belta and Sadra Sadraddini were partially funded by NSF grants CPS-1446151 and CMMI-1400167. Vijay Gupta and Sivaranjani S were partially funded by the AFOSR grant FA 9550-15-1-0186 and NSF grants ECCS-1550016 and CNS 1239224. Sivaranjani S was partially supported by the Schlumberger Foundation Faculty for the Future Fellowship.
Abstract
We synthesize performance-aware safe cruise control policies for longitudinal motion of platoons of autonomous vehicles. Using set-invariance theories, we guarantee infinite-time collision avoidance in the presence of bounded additive disturbances, while ensuring that the length and the cruise speed of the platoon are bounded within specified ranges. We propose (i) a centralized control policy, and (ii) a distributed control policy, where each vehicle’s control decision depends solely on its relative kinematics with respect to the platoon leader. Numerical examples are included.
Index Terms:
Autonomous vehicles; Constrained control; Robust control
I Introduction
Platooning strings of autonomous vehicles equipped with sensing, communication and computation capabilities has two promising benefits. First, there is a potential to dramatically increase traffic flow capacity since autonomous vehicles can move with small separations at high velocities [1], a behavior that cannot be safely executed by human drivers. Second, platooning with tight spacing impacts the aerodynamics of vehicles in a manner that decreases fuel consumption [2].
Control-theoretic aspects of platoons have been studied for several decades (see [3] and [4] for a survey). The typical control objective has been asymptotic string stability, that is, the convergence of inter-vehicular spacings to desired set-points, specified in terms of constant spacing [5, 6] or a constant time headway [9, 10]. String stability has been studied for various information architectures such as predecessor following [11, 12], leader and predecessor following [8, 13], and nearest neighbor interaction [14, 15, 16]. methods have also been proposed to deal with exogenous disturbances [14, 17]. Recently, multi-agent control paradigms have been employed to analyze vehicular platoons from a scalability perspective for various information topologies [18] and design distributed controllers to enhance stability margins in large platoons [19].
While asymptotic string stability is important to guarantee that small disturbances do not propagate along the platoon, it is more important to guarantee the safety of the platoon in the sense that vehicles do not collide with each other. Early works on safe longitudinal platooning [20, 21, 22] focused on simple two-car scenarios where the input to the leader vehicle is considered as an adversarial input and the objective of the follower vehicle is to avoid collision with the leader for all allowable behaviors of the leader. Such a formalism may be too conservative since it does not assume intelligent control over the leader. A similar paradigm was considered in [23, 24] for platooning of heavy duty trucks, where the platoon is considered safe, if for all possible behaviors of a vehicle in the platoon, there exist control inputs for other vehicles to avoid collisions. A control invariant set was synthesized to restrict the evolution of the system accordingly.
Instead of considering driver behavior of the predecessor vehicle as adversarial, a more realistic and less conservative approach is to consider platooning when fast, time varying disturbances act in an adversarial manner on the vehicles. Disturbances are physically attributed to hard-to-model nonlinear effects such as changes of gears, side wind effects and actuator imperfections. Further, adversarial disturbances may also be used, to an extent, to capture the effect of communication delays and other delays in the platoon dynamics. The objective, then, is to design controllers such that collisions are avoided for all times and all allowable disturbances. It is well known that such a formulation is closely related to optimal control [25], viability theory [26] and set-invariance methods [27]. Controllers satisfying set-invariance specifications are often nonlinear even if the system is linear [28], while no invariance inducing linear controller may exist. Therefore, stabilizing linear controllers may not be sufficient for platoon safety. While recent works like [29], [30] guarantee safety by designing controllers to bound the spacing error of vehicles, these methods do not provide specific bounds on the control inputs. Therefore, exhaustive simulations and tuning of parameters are required to verify whether the platoon operation is collision-safe and actuator limits are respected.
We provide a formal correct-by-design approach to synthesize platoon safety control policies using set-invariance methods. We model each vehicle as a discrete-time double integrator subject to bounded additive disturbances and bounded control inputs. We specify the system requirements as infinite-time collision avoidance while restricting the length and the cruise speed of the platoon to user-defined ranges. We propose both centralized and distributed optimal control policies. For the centralized case, we compute a robust control invariant (RCI) set inside the safe set corresponding to the platoon specifications. We use the computational methods from [31] to synthesize RCI sets by solving a single linear program, which has polynomial complexity in the size of platoon. However, this method is still not scalable to large platoons. Therefore, using ideas from separable RCI sets [32], we propose a (conservative) distributed control policy with complexity, where each vehicle’s control decision depends solely on its relative kinematics with respect to the platoon leader.
This paper is organized as follows. In Sec. II, we formulate the problem. In Sec. III, we use set invariance theories to design centralized and distributed controllers that solve the safety problem, and provide numerical simulations.
Notation
We denote the sets of real and non-negative real numbers by and , respectively. We denote the vector of all ones by and the identity matrix in by . Given a set , its cardinality is denoted by . For , the projection of on the -th Cartesian direction is denoted by . Given sets , their Minkowski sum is represented by . Given a matrix and , is interpreted as . Given sets , , we use to denote Cartesian product . Given functions , , we denote the product function as , such that .
II Problem Statement and Approach
Consider a platoon of autonomous vehicles moving on a single lane road. The index of the leader vehicle is designated to be [math] and the indices of the follower vehicles are in increasing order (see Fig. 1). The discrete-time dynamics of each vehicle is modeled as:
[TABLE]
where , and are the absolute (in the ground (road) frame) position (measured as the front of the vehicle), velocity and control input respectively, and are the additive disturbances affecting the position and the velocity respectively, and is the sampling time. The platoon velocity is defined as the velocity of the leader, denoted by , and the platoon length is defined as the difference between the positions of the leader and the -th vehicle, . We denote the length of the ’th vehicle by . The platoon length is physically lower-bounded by .
The control input of the platoon is defined as , where each vehicle has a bounded range of admissible control inputs:
[TABLE]
Similarly, we define the platoon disturbance vector as , where the admissible range of disturbances on each vehicle is bounded:
[TABLE]
Note that and , and both are hyper-rectangles in their respective domains. The bounds in may depend on the vehicle velocities, leading to nonlinearities in (1). For simplicity, we assume that the bounds in are constant and known given a nominal operating point of (1). While the determination of these bounds is out of scope of this paper, a suitable experimental setup can be used to compute a (tight) hyper-rectangle containing all measured disturbances. The exact knowledge of is not required. As it will be clear from subsequent discussion, over-approximating retains collision avoidance guarantees, but is conservative. Under-approximation, on the other hand, may lead to collision.
We assume that the vehicles do not have (accurate, real-time) access to their absolute positions, but have real-time access to relative distances using on-board radars and relative velocities. Vehicles can measure their absolute velocities using on-board speedometers. Relative velocities can then be computed from data acquired by Vehicle-to-Vehicle (V2V) communication. We define the relative position and velocity of the -th vehicle with respect to the leader to be
[TABLE]
Substituting (4) in (1), we obtain:
[TABLE]
The platoon state is defined as the vector , . The evolution of the platoon state is given by:
[TABLE]
where , and are constant matrices obtained from (5).
Remark 1** (Information Structure)**
The requirement that all vehicles possess information about the leader’s position and velocity may lead to performance issues in large platoons due to communication delays. However, taking into account imperfections through disturbances can potentially allow the use of larger sampling times , thereby accommodating greater communication times and hence, longer platoons. Future work will involve relaxing the V2V communication demand on the platoon by considering local information from preceding and following vehicles.
We now state the problem addressed in this paper.
Problem 1
Given a platoon of autonomous vehicles whose dynamics is described by (6), a desired upper-bound for platoon length and a desired range of platoon speed , find a feedback control policy such that:
- •
collisions are avoided111Theoretically, collision-avoidance inequality constraints are strict. However, our computational methods (e.g., linear programming) can only handle non-strict inequalities. In practice, due to over-approximations of the disturbances, inequality constraints are satisfied with some margin and non-strict formulation does not raise a problem.:
[TABLE]
- •
the platoon length is bounded:
[TABLE]
- •
the platoon velocity is bounded:
[TABLE]
for all times and for all allowable sequences of admissible disturbances acting on the platoon. Denote by the set of all control policies such that (7)-(9) are satisfied. If , select an optimal control policy such that , where is a cost function.
Note that we have not specified desired speed ranges for the follower vehicles, as these naturally follow from the bounds on the platoon length and the collision avoidance requirements. Nevertheless, speed specifications for individual vehicles can be easily accommodated in our framework. Constraints (7)-(9) define a convex safe set . The problem of finding a safe control policy is equivalent to finding a robust control invariant set inside , which is formalized in Section III-A.
III Safe Cruise Control
We provide two solutions to Problem 1. First, we propose a centralized solution where is computed by a central coordinator (which can be placed on one of the vehicles, say the leader). Even though the computation of this solution has polynomial complexity in the size of platoon, it is not applicable to very large platoons. Second, we provide a distributed solution in which the control decision for each of the following vehicles is computed solely using their kinematics with respect to the leader.
III-A Controlled Invariance
Definition 1
A set is a robust control invariant (RCI) set for system (6) if
[TABLE]
Given a RCI set , it is well known that there exists a unique, maximal RCI which contains all the RCI sets in . The computation of requires implementing the well known fixed-point algorithm [28]:
[TABLE]
where . We have . The algorithm terminates at if , which concludes . Then, we have:
[TABLE]
Notice that if and only if . There is no guarantee that there exists a finite to terminate (10). Therefore, determining a safe control policy belongs to the semi-decidable class of decision problems in the sense that the corresponding algorithm eventually terminates if a safe control policy exists, but may run indefinitely if one does not exist. In addition to the termination issue, implementing (10) is computationally difficult even for small dimensions due to the quantifier elimination process and numerical issues that arise from handling a large number of constraints with many potential redundancies [28]. Therefore, we follow a different approach to find a RCI set for (6).
The idea is to find a RCI set instead of the maximal RCI set. Therefore, safety is guaranteed (soundness) but completeness may be lost. There exist methods that (under-)approximate the maximal RCI set [27], [33]. We use the method introduced by Raković et al. [31] to compute a RCI set.
Due to space limitations, we briefly explain the method to find a RCI set for a linear system characterized by the pair and the additive disturbance set . For system (6), we have and .
Consider the following set:
[TABLE]
where is an offset vector, is the matrix of parameters and , where is an integer, greater than the controllability index of the pair , that determines the number of parameters characterizing the RCI set, and is the contraction factor.
Proposition 1
[31]** The set is a RCI set in if there exists such that
[TABLE]
where is an offset vector for the controls.
It can be shown that checking the feasibility of the constraints in (12), alongside , is equivalent to feasibility checking of a linear program (LP). Moreover, by solving the corresponding LP with a zero cost vector or some ad-hoc cost criteria222See [31] for some useful cost vector suggestions., numerical values for and that characterize the RCI set are found333Our software implementing the method in [31] is publicly available at https://github.com/sadraddini/PARCIS. In many cases, the RCI sets computed using this method finely under-approximate, or are exactly equal to, the maximal RCI set. However, this method has the disadvantage of not providing an explicit representation for the RCI set, instead providing an implicit representation in high dimensions. Nevertheless, the invariance-inducing controller can be computed in high dimensions and mapped back to the original dimensions. The details of this method are not provided here as they are well documented in [31].
In this paper, the admissible disturbance set plays a major role in determining whether a safe control policy exists. We let
[TABLE]
where is a fixed set (assumed to be a hyper-rectangle, as previously stated), and is a scalar that is used to scale the set. Using the bisection method over , we search for the largest , denoted by , such that a RCI set exists. In this context, is the largest set of admissible disturbances that the platoon can accommodate. We denote the corresponding RCI set by .
III-B Centralized Control
We compute a RCI set . Define
[TABLE]
For all , constraints (7)-(9) are satisfied for all times if , where is the initial state. We select the suboptimal invariance-inducing control policy such that (global optimality is achieved if , or ). It can be shown that the resulting control policy is a piecewise affine function of state, which can be computed explicitly in an offline manner using parametric programming [34] or solved online using a linear/quadratic program. This control policy is centralized and the control decision is computed using the full knowledge of the state of the system.
Example 1
Consider a platoon of vehicles. Let and the desired platoon speed range be . We assume , , and . We use the method in [31] with and .
Let and let the upper-bound for platoon length be . Note that the platoon length cannot be smaller than m. The projection of the safe set onto the - space is the following triangle:
[TABLE]
We found (with precision). The projection of on -* is shown in Fig. 2.* 2. 2.
Let and . We found . We simulated the system for 120 time steps, which equals 60 seconds. The initial conditions were the centers of the intervals for allowed spacings and speeds. The disturbances at each time were selected randomly from the set , with a bias toward the boundary of (to simulate the system under heavy disturbances). We used , which penalizes the total control effort. Thus, the control inputs at each time were computed using the following optimization problem:
[TABLE]
*which is mapped into a quadratic program and is feasible by construction for all . The results in Fig. 2 show that the system specifications are met. The headways (the distances of the fronts of the vehicles from the backs of the vehicles immediately preceding them) are always greater than zero. Thus, collision avoidance is ensured. The platoon speed, length and controls are bounded within their specified ranges. The bottom plot in Fig. 2 shows the trajectory of the vehicles in the leader frame, where the vertical axis is time and the horizontal axis is space. Each vehicle is depicted by a black rectangle. The leader, which is the vehicle on the right, is fixed in this frame and the follower vehicles move due to the disturbances. * 3. 3.
We study how and the computation time of vary with . We keep the density constant and vary . We used the Gurobi **[35*]** LP solver on a dual core 2.8GHz iMac. The results, shown in Table I, indicate that grows with , even though is constant. For example, 4 vehicles following a leader within **m can accommodate larger disturbance magnitudes than 8 vehicles within *m. This is attributed to the advantage gained from coordinating the vehicles in a centralized manner. We also empirically observe that converges to a particular value as increases.
III-C Distributed Control
In this section, we propose a distributed solution to Problem 1. As discussed earlier, our approach requires V2V communication to measure relative distances and velocities. Note that distributed sensing is usually much cheaper than distributed decision making. We follow a divide and conquer approach to distributed control. We impose restrictions on the evolution of each vehicle such that control decisions can be made without a central coordination. The main advantage of our distributed framework is that the computation of control policies has complexity.
We make the following observations from inspecting (5). The state of each vehicle is not directly influenced by other vehicles (matrix is block-diagonal). However, the control decision and disturbances of the leader affects all the follower vehicles. Define and , as the new control input and disturbances,, respectively, of the follower vehicles. Also, define , , and and as the admissible sets of and , respectively. Note that . Now, and are no longer in hyperrectanglular form similar to (2) and (3), but are still polyhedrons. We examine the relationship between and - the case of and is similar but in higher dimensions. We have , where
[TABLE]
A graphical representation of the transformation of a hyper-rectangle under the linear transformation is illustrated in Fig. 3. In order to obtain hyper-rectangular admissible sets, we under-approximate and over-approximate by hyper-rectangles and , respectively (see Fig. 3). Over/under-approximating polyhedra by hyper-rectangles is conservative but facilitates a decentralized architecture since hyper-rectangles are cartesian products of intervals. Therefore, we add conservatism but retain the hyper-rectangular structure of the set of admissible controls and disturbances. Basic properties of hyper-rectangles imply:
[TABLE]
Next, we under-approximate the non-rectangular constraints in , i.e., the constraints in (7), by rectangular constraints. For simplicity, we assume , for the rest of the paper. We define by replacing (7) with:
[TABLE]
The constraints in (16) can be physically interpreted as envelopes for each vehicle’s position with respect to the leader (see Fig. 4). Every vehicle has to remain within its envelope for all times. Note that follows from (16). From a divide and conquer point of view, every vehicle assumes that its neighbors do not enter its envelope (assumption), while every vehicle promises to not move beyond its envelope (guarantee). Therefore, correctness is guaranteed with no centralized coordination. Note that the computations of ’s are independent, but the computation of each (actual control input) requires the control input of the leader, . Hence every vehicle has to communicate with the leader. Having fully decoupled the dynamics and all constraints for each follower vehicle, we compute separate RCI sets , where , \mathbb{S}_{i}=\Big{\{}(\tilde{x}_{i},\tilde{v}_{i})\big{|}il+(i-1)\frac{L-Nl}{N}\leq\tilde{x}_{i}\leq il+i\frac{L-Nl}{N}\Big{\}}, and we have
[TABLE]
We then define the invariance inducing control policy Finding the RCI set for the leader, where , we have:
[TABLE]
Similarly, we define the invariance inducing control policy
Proposition 2
We have , where , and is a RCI set in .
Proposition 3
We have .
Proof:
(sketch) Correctness follows from the fact that we have under-approximated the admissible control inputs and over-approximated the admissible disturbances. ∎
Example 2
*Consider the specification in Example 1. We implement the procedure explained in this section to design a distributed control policy. For every vehicle , the admissible set for is
- ranges halved as compared to in Example 1 - and the admissible sets for and are and , respectively - ranges doubled as compared to in Example 1. The computation time is negligible as the dimension of each RCI set is 2 (1 for the leader). We vary the density , find the corresponding and compare the result to the one obtained using the centralized control policy (Fig. 5). It is observed that the distributed control architecture can accommodate a smaller magnitude of disturbances. We expect the ranges in to have an increasing relationship with average speed of the platoon. Therefore, Fig. 5 suggests that platoons that move faster should have wider inter-vehicular spacings to be considered safe.*
We end this section with a final remark. Since the control inputs and disturbances of the leader affect all the vehicles in the platoon, a perfect leader with zero disturbances (and subsequently controls, as explained shortly) will eliminate the dynamical coupling between the vehicles in the platoon. This observation leads to the following statement.
Proposition 4** (Perfect Leader)**
Consider a platoon with identical follower vehicles and disturbances as described in Section II. If there are no disturbances acting on the leader, that is, , then the s for the centralized and distributed control policies are equal.
Proof:
(sketch) If there are no disturbances acting on the leader, then is a valid control policy that maintains the platoon speed within the desired range, provided that it is initially within the range. With , the only coupling between the vehicles is through the collision constraints in (7), which are replaced by (16) in the distributed architecture. We need to show that centralized controllers cannot handle larger by violating (16). Suppose a valid centralized control policy violating (16). It follows from (8) and Fig. 4 that at least one vehicle (e.g., the last vehicle) has to remain in its envelope when other vehicles move out of theirs. As the dynamics are independent, there exists a control policy to keep the (last) vehicle inside its envelope. Since the vehicles are identical and the disturbances are independent of each other, such a control policy should be valid to keep any vehicle in its envelope. Therefore, the existence of any centralized policy implies the existence of distributed control policies. Hence, a distributed control policy must exist for the same . ∎
IV Conclusion and Future Work
We utilized results in set-invariance to formally synthesize platoon cruise control policies subject to collision avoidance requirements. We found a centralized policy as well as a computationally attractive (but conservative) distributed policy. We will consider more relaxed platoon information structures such as predecessor-following in future work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. Varaiya, “Smart cars on smart roads: problems of control,” IEEE Transactions on automatic control , vol. 38, no. 2, pp. 195–207, 1993.
- 2[2] A. Al Alam, A. Gattami, and K. H. Johansson, “An experimental study on the fuel reduction potential of heavy duty vehicle platooning,” in Intelligent Transportation Systems (ITSC), 2010 13th International IEEE Conference on . IEEE, 2010, pp. 306–311.
- 3[3] M. R. Jovanović, “Vehicular chains,” Encyclopedia of Systems and Control , pp. 1524–1531, 2015.
- 4[4] S. E. Li, Y. Zheng, K. Li, and J. Wang, “An overview of vehicular platoon control under the four-component framework,” in Intelligent Vehicles Symposium (IV), 2015 IEEE . IEEE, 2015, pp. 286–291.
- 5[5] R. Caudill and W. Garrard, “Vehicle-follower longitudinal control for automated transit vehicles,” Journal of Dynamic Systems, Measurement, and Control , vol. 99, no. 4, pp. 241–248, 1977.
- 6[6] D. Swaroop and J. Hedrick, “Constant spacing strategies for platooning in automated highway systems,” Journal of dynamic systems, measurement, and control , vol. 121, no. 3, pp. 462–470, 1999.
- 7[7] J. Hedrick, M. Tomizuka, and P. Varaiya, “Control issues in automated highway systems,” Control Systems, IEEE , vol. 14, no. 6, pp. 21–32, 1994.
- 8[8] S. Sheikholeslam and C. A. Desoer, “Control of interconnected nonlinear dynamical systems: The platoon problem,” Aut. Control, IEEE Trans. on , vol. 37, no. 6, pp. 806–810, 1992.
