An ADMM Algorithm for MPC-based Energy Management in Hybrid Electric Vehicles with Nonlinear Losses
Sebastian East, Mark Cannon

TL;DR
This paper introduces a convex formulation and an ADMM algorithm for energy management in hybrid electric vehicles, achieving significant speed improvements over traditional dynamic programming methods.
Contribution
It presents a novel convex formulation that linearizes nonlinear dynamics and applies ADMM for efficient optimization in hybrid vehicle energy management.
Findings
ADMM outperforms DP in solution time by up to two orders of magnitude
Convex formulation enables efficient optimization with nonlinear dynamics
Simulation results validate the effectiveness of the proposed method
Abstract
In this paper we present a convex formulation of the Model Predictive Control (MPC) optimisation for energy management in hybrid electric vehicles, and an Alternating Direction Method of Multipliers (ADMM) algorithm for its solution. We develop a new proof of convexity for the problem that allows the nonlinear dynamics to be modelled as a linear system, then demonstrate the performance of ADMM in comparison with Dynamic Programming (DP) through simulation. The results demonstrate up to two orders of magnitude improvement in solution time for comparable accuracy against DP.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
An ADMM Algorithm for MPC-based Energy Management in Hybrid Electric Vehicles with Nonlinear Losses
Sebastian East and Mark Cannon This work was funded by the UK Engineering and Physical Sciences Research Council.S. East and M. Cannon are with the Department of Engineering Science, University of Oxford, OX1 3JP, [email protected],[email protected]
Abstract
In this paper we present a convex formulation of the Model Predictive Control (MPC) optimisation for energy management in hybrid electric vehicles, and an Alternating Direction Method of Multipliers (ADMM) algorithm for its solution. We develop a new proof of convexity for the problem that allows the nonlinear dynamics to be modelled as a linear system, then demonstrate the performance of ADMM in comparison with Dynamic Programming (DP) through simulation. The results demonstrate up to two orders of magnitude improvement in solution time for comparable accuracy against DP.
I INTRODUCTION
Automotive powertrains have become increasingly electrified in recent decades, in an effort to reduce both hydrocarbon consumption and tailpipe emissions [1]. Hybrid electric vehicles are a common configuration as they can reduce fuel consumption through regenerative braking, engine shut down, and reduced engine size, but the second power source introduces an additional degree of freedom that makes the powertrain underconstrained, so the fraction of power delivered by the engine and motor must be controlled. The total energy consumed during a journey can be significantly reduced by varying this fraction [2], but a control system that is optimal (in the sense of minimising fuel consumption) whilst also being implementable is still an open problem [3].
Model Predictive Control (MPC) is a control framework where for each control input update, the solution of an open-loop finite-horizon optimal control problem is solved online, and the first element of the optimal predicted control vector is implemented as the current control input. It has been applied to the hybrid vehicle energy management problem as the optimisation can explicitly account for constraints on system variables, and the recursive feedback provides a degree of robustness to modelling, measurement, and disturbance uncertainty [4, 5].
An important issue with MPC is the online solution of the optimal control problem: it must capture the important dynamics of the controlled system, but the optimisation problem may be high-dimensional, constrained, and possibly nonconvex. Dynamic Programming (DP) has been used to provide the globally minimising argument [6], however the approach is too slow to be implemented in real time and is computationally expensive. Conversely, linearisation has been used to reduce the optimisation to a quadratic program [7], which is a well-known problem and can be solved rapidly, however the linearisation process renders the applied control inputs sub-optimal. To alleviate these difficulties, the problem can be reformulated as a convex optimisation [8, 9, 10]. This allows the dynamics of the system to be accurately modelled using convex functions, and enables a guarantee of convergence to the global minimum.
The first contribution of this paper is an alternative proof of convexity for the energy management problem. Previous work has achieved convexity by relaxing the system dynamic constraints to inequalities, and demonstrating that the optimal solution is obtained when the dynamic constraints are satisfied with equality [8]. We instead show that the problem is convex when expressed in terms of battery power output. This modification allows the dynamics to be represented without approximation as a linear system, which simplifies the implementation of convex optimisation algorithms.
The second contribution is an Alternating Direction Method of Multipliers (ADMM) [11] algorithm for the solution of the convex problem. Our work in [4] demonstrated a projected Newton method that only considers the terminal state of charge constraint, and other work in the area has made use of generic convex optimisation solvers; the method presented here enforces state of charge constraints at all times, and is specifically designed to exploit separability of the cost and dynamics. We have previously demonstrated the properties of the algorithm without the assumption of convexity in [12]. Here, we show the comparative performance against DP in a convex formulation through simulation.
The paper is organised as follows: Chapter 2 describes an MPC framework for energy management, and Chapter 3 provides a proof of convexity for the associated optimisation problem. Chapter 4 details the DP and ADMM algorithms, Chapter 5 presents numerical simulations comparing their performance, and Chapter 6 provides conclusions.
II Model Predictive Control Framework
At each time-instant that the control variable is updated, we assume that a prediction is made of future vehicle velocity, , and road gradient, , at a sampling frequency of 1Hz and a horizon length of . The first derivative of is approximated using central difference, and the predicted driver power demand can be calculated using a backwards-causal longitudinal model
[TABLE]
where is the vehicle mass, is air density, is the drag coefficient, is the frontal area, is the rolling resistance coefficient, and is the acceleration due to gravity.
Figure 1 shows a simplified diagram of a parallel plug-in hybrid electric vehicle powertrain. Fuel power, , is fed to an internal combustion engine that delivers mechanical power, , through a clutch, which can be disengaged so that the engine can idle (or be turned off) and the vehicle can be driven in an all-electric mode. Chemical power, , from the battery is delivered as electrical power, , through a circuit to a motor that delivers mechanical power, , to the drivetrain. This flow of power can be reversed to charge the battery. It is assumed that all drivetrain components are 100% mechanically efficient, so the motor and engine output powers are combined additively through a mechanical coupling device to provide the driver power demand, . We ignore the effect of high speed dynamics such as clutch engagement and engine lag.
A controller is required to regulate the behaviour of this system as it is under-constrained in three degrees of freedom: the fraction of total driver demand power delivered from the engine, the clutch engagement, and the gear selection. Both the clutch disengagement and gear selection (assuming a discrete variable transmission) introduce integer decision variables that make the resulting control problem non-convex (and significantly more complex), so we assume here that these variables are determined by heuristics.
We assume a simple shifting strategy by which the powertrain speed can be found from
[TABLE]
where are constants and is the number of gear ratios. We also assume that when the vehicle is braking, the clutch is open, the engine is idle, and the brakes provide a fixed fraction of the braking force . Finally, we assume that when the demand power is positive and the powertrain speed is lower than the minimum engine speed , the clutch is open and the electric motor provides all of the driving power. Therefore, the engine power fraction only needs to be calculated when the driver power demand is positive, and the powertrain speed is greater than the minimum engine speed. These statements define the sets:
[TABLE]
from which predicted motor and engine speeds are given by
[TABLE]
The engine loss map is modelled as a set of convex quadratic functions of of the form
[TABLE]
for in the interval , and the motor loss map is modelled using convex quadratic functions as
[TABLE]
for in the interval . Loss functions are then found of the form
[TABLE]
by linear interpolation between the loss map coefficients on the basis of and . The battery dynamics are
[TABLE]
where we assume constant and for convexity. Limits on engine power are given for as
[TABLE]
where and are lower and upper limits on engine and motor torque, is the largest real root of , and upper and lower limits on battery state of charge are given by and . The MPC optimisation is then given by
[TABLE]
and the first element of is then applied as the control input to the engine.
III CONVEXITY
We now demonstrate a convex reformulation of (2). Firstly, we restrict the domain of each and so that they are non-decreasing, i.e impose new lower limits:
[TABLE]
This is expected as an increase in motor or engine output power would require an increase in input power, and it is an assumption also made in [8]. This domain restriction ensures that is a one-to-one function, so
[TABLE]
where is given by
[TABLE]
Lemma III.1
Under the assumptions on (convex, twice differentiable, non-decreasing, one-to-one), the inverse map is concave, twice differentiable, and increasing.
Proof:
The assumptions on imply that (where and are the first and second derivatives of ), and, if , then is unique for all . Therefore
[TABLE]
for all , which demonstrates that is differentiable and monotonically increasing, and it is known that
[TABLE]
for all , which demonstrates that is twice differentiable and concave. ∎
Using this definition of , the cost function in problem (2) can be rewritten as
[TABLE]
Lemma III.2
Under the assumptions on (convex, non-decreasing), the function is convex and non-increasing with .
Proof:
The composition of a non-decreasing function and decreasing function is necessarily non-increasing. The concave property of implies that for any (where is a domain such that is concave), and ,
[TABLE]
Therefore, using the properties that is non-decreasing and convex, we have
[TABLE]
for all , which demonstrates that is convex in . ∎
Problem (2) can now be re-written as
[TABLE]
This is now a convex optimisation problem, where
[TABLE]
Existing analyses of convexity for similar optimal energy management problems replace the system dynamic equations, which appear in (2) as equality constraints, with inequalities, and then show that for a given solution the dynamic equations will be satisfied with equality [8]. The formulation presented here is superior for this application as it renders the dynamics as a linear system, which simplifies the implementation of convex optimisation algorithms.
IV OPTIMISATION METHODS
From Lemma III.2 it is known that is non-increasing in , so it can be shown that if , then the minimising argument of (3) is given trivially by , and for . If this is not the case, an optimisation procedure must be applied.
Previous research on convex optimisation for the energy management problem has used general convex optimisation solvers to obtain the control inputs; here, we instead implement an ADMM algorithm similarly to [12]. To demonstrate the relative performance characteristics of the algorithm we compare it in simulation with DP.
IV-A Alternating Direction Method of Multipliers
First, we introduce a dummy variable , and re-write (3) as
[TABLE]
where is an column of ones, is an lower triangular matrix of ones, and an indicator function is defined by
[TABLE]
The augmented Lagrangian associated with problem (4) is
[TABLE]
where are constants discussed in Section V-A. We define projection functions for as
[TABLE]
then the ADMM iteration is given by
[TABLE]
which is initialised with the values
[TABLE]
The variable is included in the formulation to ensure that the update is separable, and the individual updates are convex optimisation problems that we solve using an unconstrained Newton method with a backtracking line-search. The matrix inversion associated with the update can be computed off-line as it involves no decision variables, and the remaining variable updates are trivial. This iteration is repeated until the residuals and , where is the convergence threshold, and
[TABLE]
In [12], we demonstrate that this algorithm will converge to a point satisfying the first order conditions of problem (3), which must be the optimal point in this convex formulation. The solution for problem (2) is then given by
[TABLE]
IV-B Dynamic Programming
Although it is well known that DP is computationally expensive and unlikely to perform as well as a dedicated convex optimisation algorithm, we use it here to demonstrate baseline performance as it is commonly used to solve various forms of the energy management problem and is guaranteed to provide the optimal solution for a sufficiently high mesh density. In the absence of other tailored algorithms for this problem, a comparison with a general purpose convex optimization solver would be similarly flawed, as the results would be obfuscated by the performance of the differing software implementations.
Definition IV.1
Define as a matrix of costs, where is the cost-to-go at state index (we assume the values of are evenly spaced) and timestep index . The function returns the cost-to-go, linearly interpolated from the available values of and , and returns outside the feasible range of . We use the notation to refer to column of .
Definition IV.2
Define as a matrix of optimal control inputs, where the value is the optimal control input at state index and timestep index . The function returns the optimal control input linearly interpolated from the available values of and , and the notation refers to the column of .
Definition IV.3
Define as the vector with elements equal to the discrete, evenly-spaced values that may be assigned to the control input , such that and .
The algorithm is initialised by setting , then for the cost to go is calculated at by solving
[TABLE]
The integer that minimises this function is found by evaluating all possible values, and selecting the minimum. The same method is used to evaluate
[TABLE]
The optimal trajectory and control inputs can be found from
[TABLE]
for .
V SIMULATIONS
We are interested in the comparative performance of ADMM and DP and not the properties of the MPC framework itself, so we only consider single-shot instances of the MPC optimisation with assumed driver demand profiles.
We use the FTP-75 cycle to define a vector, , of velocity samples, and generate a vector , of gradient samples
[TABLE]
for at 1Hz, where is a constant defined for each optimisation. The predicted velocity and gradient vectors used for each problem instance are given by
[TABLE]
where is rounded to the nearest integer, and . The predicted power demand is then calculated from (1). Essentially, is a variable used to modulate the magnitude of the predicted power, and is a variable used to modulate the length of the predicted power vector.
We use a vehicle model to represent a 1800kg parallel plug-in hybrid passenger vehicle with a 100kW petrol internal combustion engine, a 50kW electric motor, and a 21.5Ah lithium-ion battery. We assume a fixed open circuit voltage of 350V and internal resistance of for the battery, and the fixed braking fraction, , is set at 0.4. An initial state of charge is used to represent battery discharge as the drive cycle is shortened, and upper and lower limits on state of charge are set as 62% and 50%.
V-A Optimisation Parameters
For DP we assume that , as the feasible battery storage band is approximately an order of magnitude greater than the maximum power available from the motor, and this ensures that the and meshes have similar numerical density. We then use the value of to modulate the desired accuracy of the solution. From numerical experiments we find that is sufficient to achieve a high accuracy approximation of the global minimum, and all further references to the global minimum/optimum refer to control inputs and state trajectories that are found this way.
For the ADMM algorithm, and must be tuned to match the dynamics of the optimisation problem, and we tune these parameters to the instance where and . First we obtain the global minimum, then run 300 ADMM iterations and record a cost that measures the deviation of the fuel consumption vector at algorithm termination, , from the global optimum fuel consumption vector, , with
[TABLE]
The results shown in Figure 2 illustrate that for this particular instance of the optimisation problem, variations of within two orders of magnitude and variations of within one order of magnitude induce little suboptimality in the resulting optimisation. We use the parameters and for subsequent ADMM optimisations. The value may appear low, however it is appropriate given the relative scaling of the fuel consumption for each timestep (kJ) and the battery capacity (MJ) in (5).
V-B Optimisation with variation in power demand
A practically useful algorithm should demonstrate consistent performance across a broad range of optimisation scenarios, and to test this we generate drive cycles with varied magnitudes using and , then solve using DP with and ADMM with . These ranges of and are chosen to produce solutions with comparable accuracy that are computationally tractable. We run each instance of both algorithms 10 times and record the average time for completion and the deviation from the optimum cost measured by (7). The results are shown in Fig. 3.
The ADMM algorithm solves the optimisation fastest with (the cycle it is tuned to) and is slowest to converge for . Between these scenarios, all results lie within a linear band with a maximum variation in computation time of less than an order of magnitude, displaying a degree of robustness to variations in drive cycle. Comparatively, the results for DP display more consistency as they are contained within a tighter band, but also show worse performance: for the range of parameters tested, DP takes approximately two orders of magnitude more time for a comparative level of accuracy. For example, using the cycle, ADMM terminates with after with , whereas DP takes to achieve using . The bands also diverge, suggesting that ADMM performs even better for higher accuracy, although it is not clear to what extent this relationship can be extrapolated.
V-C Optimisation with variation in prediction horizon
To investigate robustness to variations in horizon length, we generate drive cycles with and , then solve the optimisation and record parameters using the same method as above. The results are shown in Figure 4.
It is shown that the time taken for the ADMM algorithm to terminate decreases dramatically as the length of the horizon is reduced. For example, the time taken with is for and for , then for . As the time taken for DP decreases in a similar manner, the key observation is that the significant speed improvement with ADMM is sustained as the horizon is reduced. This suggests that if the algorithm can deliver a performance benefit for a broad range of drive cycle characteristics at the start of a journey, it will then also deliver the same performance benefit as the journey is completed.
In Figure 5 we show state trajectories for the and cycle that are obtained with DP and ADMM using a comparative level of computational time, as well as the global optimum. These plots clearly illustrate the superior performance of ADMM for this problem, but also show a significant limitation. Whilst the ADMM state trajectory follows the optimum much more closely, it is also shown to violate the SOC constraints by . This is because there is no explicit guarantee in the ADMM termination criteria that the constraints are met: this is only ensured when the primal residual is exactly zero. We do not, however, believe that this is a significant problem with ADMM for this application, as for all optimisations shown in Figures 3 and 4, no state of charge violation was greater that , and this level of violation is likely to be lower in magnitude than the uncertainty in the current state of charge.
VI CONCLUSION
A convex formulation of the MPC optimisation associated with energy management in hybrid electric vehicles is presented, with a demonstration of convexity that allows the state dynamics to be modelled as a linear system. We propose an ADMM algorithm for the solution of this problem, and demonstrate approximately two orders of magnitude improvement in computation time compared to DP. A degree of robustness to variations in the magnitude and length of the predicted drive cycle is also demonstrated.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. A. Malikopoulos, “Supervisory power management control algorithms for hybrid electric vehicles: A survey,” IEEE Transactions on Intelligent Transportation Systems , vol. 15, no. 5, pp. 1869–1885, 2014.
- 2[2] A. Sciarretta and L. Guzzella, “Control of hybrid electric vehicles,” IEEE Control Systems , vol. 27, no. 2, pp. 60–70, 2007.
- 3[3] C. Marina Martinez, X. Hu, D. Cao, E. Velenis, B. Gao, and M. Wellers, “Energy Management in Plug-in Hybrid Electric Vehicles: Recent Progress and a Connected Vehicles Perspective,” IEEE Transactions on Vehicular Technology , vol. 66, no. 6, 2017.
- 4[4] J. Buerger, S. East, and M. Cannon, “Fast dual loop nonlinear receding horizon control for energy management in hybrid electric vehicles,” IEEE Transactions on Control System Technology , vol. PP, no. 99, 2018.
- 5[5] C. Xiang, F. Ding, W. Wang, and W. He, “Energy management of a dual-mode power-split hybrid electric vehicle based on velocity prediction and nonlinear model predictive control,” Applied Energy , vol. 189, pp. 640–653, 2017.
- 6[6] C. Sun, S. J. Moura, X. Hu, J. K. Hedrick, and F. Sun, “Dynamic Traffic Feedback Data Enabled Energy Management in Plug-in Hybrid Electric Vehicles,” IEEE Transactions on Control Systems Technology , vol. 23, no. 3, pp. 1075–1086, 2015.
- 7[7] H. Borhan, A. Vahidi, A. M. Phillips, M. L. Kuang, I. V. Kolmanovsky, and S. Di Cairano, “MPC-based energy management of a power-split hybrid electric vehicle,” IEEE Transactions on Control Systems Technology , vol. 20, no. 3, pp. 593–603, 2012.
- 8[8] B. Egardt, N. Murgovski, M. Pourabdollah, and L. Johannesson Mardh, “Electromobility Studies Based on Convex Optimization: Design and Control Issues Regarding Vehicle Electrification,” IEEE Control Systems , vol. 34, no. 2, pp. 32–49, 2014.
