Recursively Feasible Stochastic Model Predictive Control using Indirect Feedback
Lukas Hewing, Kim P. Wabersich, Melanie N. Zeilinger

TL;DR
This paper introduces a stochastic MPC method for linear systems with unbounded, correlated disturbances, ensuring recursive feasibility and providing performance bounds, demonstrated through building control examples.
Contribution
The paper proposes a recursive feasibility initialization scheme for stochastic MPC that handles correlated disturbances and guarantees chance constraint satisfaction.
Findings
Recursive feasibility is maintained in stochastic MPC with correlated disturbances.
The method achieves an average asymptotic performance bound under i.i.d. disturbance assumptions.
Illustrative examples demonstrate the approach's effectiveness in building control scenarios.
Abstract
We present a stochastic model predictive control (MPC) method for linear discrete-time systems subject to possibly unbounded and correlated additive stochastic disturbance sequences. Chance constraints are treated in analogy to robust MPC using the concept of probabilistic reachable sets for constraint tightening. We introduce an initialization of each MPC iteration which is always recursively feasibility and thereby allows that chance constraint satisfaction for the closed-loop system can readily be shown. Under an i.i.d. zero mean assumption on the additive disturbance, we furthermore provide an average asymptotic performance bound. Two examples illustrate the approach, highlighting feedback properties of the novel initialization scheme, as well as the inclusion of time-varying, correlated disturbances in a building control setting.
| Controller | nom | rec | df |
|---|---|---|---|
| Controller | nom | rec | df | recSC |
|---|---|---|---|---|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Recursively Feasible Stochastic Model Predictive Control using Indirect Feedback
Lukas Hewing, Kim P. Wabersich, Melanie N. Zeilinger This work was supported by the Swiss National Science Foundation under grant no. PP00P2 157601 / 1.All authors are with the Institute for Dynamic Systems and Control, ETH Zurich. [lhewing|wkim|mzeilinger]@ethz.ch
Abstract
We present a stochastic model predictive control (MPC) method for linear discrete-time systems subject to possibly unbounded and correlated additive stochastic disturbance sequences. Chance constraints are treated in analogy to robust MPC using the concept of probabilistic reachable sets for constraint tightening. We introduce an initialization of each MPC iteration which is always recursively feasibility and thereby allows that chance constraint satisfaction for the closed-loop system can readily be shown. Under an i.i.d. zero mean assumption on the additive disturbance, we furthermore provide an average asymptotic performance bound. Two examples illustrate the approach, highlighting feedback properties of the novel initialization scheme, as well as the inclusion of time-varying, correlated disturbances in a building control setting.
I Introduction
Most real world control applications are subject to uncertainty and external disturbances, which can severely deteriorate both performance and safety of the system. In model predictive control (MPC) this problem can be addressed by explicitly assessing worst-case disturbances, leading to robust MPC approaches [1]. These approaches, however, can be overly conservative, for example in cases where occasional constraint violations are permissible. Using additional information about the disturbances in form of distributions, stochastic MPC offers advantages through a less conservative treatment of constraints, as well as by improving average performance e.g. by optimizing the expected cost [2].
Stochastic MPC approaches can typically be divided into two classes. While randomized methods rely on the generation of suitable disturbance realizations or scenarios, analytic approximation methods reformulate the problem into a deterministic one [3]. In this paper, we consider an analytic approximation for linear discrete-time systems subject to additive disturbances which are potentially correlated in time and have unbounded support.
Introducing feedback from state measurements into the MPC in a stochastic setting leads to the question of recursive feasibility of the underlying optimization problem. Typically, this issue is addressed in one of two ways [4]. Either it is ensured using a robust constraint tightening, which is generally restricted to the case of bounded support disturbance distributions, see e.g. [5, 6, 7], or the original MPC problem is allowed to become infeasible and a suitable recovery mechanism is employed, e.g. in [8, 9, 10]. These recovery mechanisms usually come with a loss of strict guarantees on the closed-loop chance constraint satisfaction [4]. In [11], strict guarantees were recovered under a unimodality assumption on the additive noise distribution. Additionally, a recent approach guarantees recursive feasibility also with unbounded disturbances for the case of suitably discounted violation probabilities [12].
The approach presented in this paper offers strong guarantees w.r.t. closed-loop chance constraint satisfaction and guarantees recursive feasibility, even for unbounded disturbance distributions, while incorporating feedback from the measured state in the MPC problem. To the best of our knowledge, these properties have not been established in previous approaches. The results are achieved by introducing feedback of the currently measured state through the cost function only, while tightened constraints are satisfied w.r.t. a nominal system state , ensuring feasibility. This results in the fact that the system error evolves linearly in closed-loop, facilitating straightforward analysis of performance and chance constraint satisfaction. Related concepts and their implications for stochastic MPC have recently been discussed in [13]. We demonstrate that this form of feedback has strong influence also on the nominal state trajectory and results in closed-loop performance comparable to previous stochastic MPC methods—while requiring significantly fewer assumptions on the disturbance distribution for chance constraint satisfaction. For constraint tightening, we employ techniques related to tube-based MPC [14], making use of the concept of probabilistic reachable sets (PRS) [11, 15, 16]. These PRS take a similar role as robust invariant sets in robust MPC such that they contain the error dynamics with a specified probability. In this paper, we extend this concept to non-zero mean disturbance sequences correlated in time, enabling the treatment of a broad class of problems. We finally demonstrate the flexibility resulting from these properties in a building control task, for which the disturbance sequence is non-i.i.d., non-zero mean, and strongly correlated in time.
II Preliminaries
II-A Notation
We refer to quantities of the system realized in closed-loop at time using parentheses, e.g. is the state measured at time step , while quantities used in the MPC prediction are indexed with subscript, e.g. is the system state predicted time steps ahead. In order to specify the time at which the prediction is made, we use . The weighted 2-norm is , and () refers to a positive (semi-)definite matrix. The notation refers to the Pontryagin set difference. The distribution of a random variable is specified as . The probability density of is denoted , conditioned on another random variable it is . Similarly, probabilities and conditional probabilities are denoted , and the expected value and variance of w.r.t. a random variable are and , respectively. Two random variables , that share the same distribution are equal in distribution, denoted .
II-B Considered System
We consider a linear time-invariant (LTI) system under additive disturbances
[TABLE]
with state , inputs and randomly distributed disturbance realizations taking values in . The system is subject to chance constraints on states and inputs
[TABLE]
where and are convex sets. The probabilities are to be understood w.r.t. knowledge at time step 0, i.e. conditioned on the given initial state. Hard constraints, e.g. on the inputs, can be included in the formulation by imposing a probability of 1. In general, however, these can only be satisfied for disturbance distributions of bounded support.
We consider control problems of arbitrarily large, but finite111We choose a finite control horizon mainly to avoid technicalities. Most properties are easily carried over to an infinite-horizon control problem, e.g. by considering the limit of [17]., horizon and a disturbance sequence distributed according to distribution , of which at least the first two moments are known. Note that the individual disturbances are therefore not necessarily independent, identically distributed (i.i.d.) or zero mean. Using a cost function the resulting stochastic (finite-horizon) optimal control problem can be stated as {mini!}[4] {π_k} E_W ( ∑_k=0^¯N l_k(x(k), u(k)) )
\addConstraintx(k+1)= A x(k) + B u(k) + w(k) \addConstraintu(k) = π_k(x(0),w(0),…,w(k)) \addConstraintW= [w(0)^T, …, w(¯N)^T]^T∼Q^W \addConstraintPr(x(k)∈X — x(0))≥p_x \addConstraintPr(u(k)∈U — x(0)) ≥p_u ,
for all , in which is a sequence of control laws using information up to time step .
This paper presents a feasible approximate solution to control problem (1) based on receding horizon or model predictive control over a shortened horizon . We will show that the receding horizon controller derived in the following sections satisfies all constraints of optimization problem (1), in particular, it satisfies closed-loop chance constraints (2).
In the following section, we recall probabilistic reachable sets (PRS) and extend the concept to non-i.i.d. disturbance sequences. These PRS form the basis for constraint tightening used in the stochastic model predictive control approach presented in Section IV.
III Probabilistic Reachable Sets
The concept of PRS for stochastic MPC was used in [11] and is related to probabilistic set invariance [18, 15, 19]. In the following, we recall definitions of PRS and consider the extension to non-i.i.d., i.e. time-varying and correlated disturbance sequences. Note that there exists rich literature on related problems, in particular for stochastic reachability of hybrid system [16] and stochastic reach-avoid problems [20, 21].
III-A Definitions
For the following definitions, consider a stochastic process
[TABLE]
where takes values in .
Definition 1** (Probabilistic -step reachable set).**
A set with is an -step probabilistic reachable set (-step PRS) of probability level for process (3) initialized at if
[TABLE]
Definition 2** (Probabilistic reachable set).**
A set is a probabilistic reachable set (PRS) of probability level for for process (3) initialized at if
[TABLE]
Note that this probability bound needs to hold at all time steps individually, as opposed to holding all time steps jointly, which would define much more restrictive sets. It is implicit in the definition of PRS that a PRS is an -step PRS for all . Conversely, it follows that a set is a PRS of level if it satisfies
[TABLE]
III-B PRS for State
Consider now the case in which the process (3) is defined through linear dynamics under disturbance sequence , i.e.
[TABLE]
In [11], it is shown that under the assumption of unimodal i.i.d. disturbances , a convex -step PRS is also an -step PRS for all . In this case, the relation (4) simplifies to
[TABLE]
In many applications of PRS, e.g. when using PRS for constraint tightening, it is desirable to find sets that are small in a suitable sense. In the following, we present a variance-based computational method for processes of form (5), which in particular for the special case of Gaussian disturbance sequences provides tight sets, in the sense that any down-scaling of the sets would violate the probability guarantees.
III-C Variance-based PRS Computation
We present a straightforward approach for PRS computation based on mean-variance information of under a correlated disturbance sequence with , . Due to the linear dynamics (5) we have for the sequence that
[TABLE]
with
[TABLE]
Using the multivariate Chebyshev inequality we find that
[TABLE]
is an -step PRS of probability level . The marginal expectation and variance , are directly available from and .
Since in the case of correlated disturbance sequences the i.i.d. assumption does not hold, it is not possible to construct PRS along Definition 2 using (6). Instead, one way to construct an ellipsoidal PRS is to find a minimum size ellipsoid which contains all ellipses , thereby satisfying (4). This can be formulated as a semidefinite program, see e.g. [22].
Remark 1* (Zero mean i.i.d. disturbances).*
When is a zero mean i.i.d. disturbance sequence with , a PRS can be constructed by solving the Lyapunov equation as for arbitrary horizons .
Remark 2* (Gaussian distributions).*
If is normally distributed, can be set to , where is the quantile function of the chi-squared distribution with degrees of freedom, resulting in significantly smaller sets.
III-D PRS for Input
The methods for PRS computations outlined in the previous section are suitable for bounding a state error evolving along (5), and are ultimately used to guarantee chance constraint satisfaction on the states (2a). For the satisfaction of input chance constraints (2b) we need to similarly bound input errors, which we assume to be given by a feedback policy on the state errors . A possibility of computing (-step) PRS for is to transform a PRS on , i.e.
[TABLE]
where . This, however, can lead to significant conservatism, especially if maps to a lower dimensional space, i.e. when , as is often the case.
A less conservative approach is therefore to first construct the distribution (or mean and variance information) of and to find PRS based on this information directly. In the case of linear feedback laws this is often straightforward and PRS construction can be similarly carried out along (7) as well as Remark 1 and 2 by considering and .
IV Stochastic MPC using Probabilistic Reachable Sets
The goal is to find an approximate solution to the optimal control problem in (1) by solving it over a shortened horizon in a receding horizon fashion, i.e. as an MPC controller. To highlight the difference between predicted quantities and the closed-loop quantities in (1) we make use of the subscript , resulting in
[TABLE]
where the predictions dynamics are initialized at the currently measured state at each time step, i.e. . The predicted disturbance sequence includes the information about up to time step . Assuming access to past disturbance realizations it is therefore distributed according to defined by the conditional distribution
[TABLE]
The resulting predicted state sequence is therefore similarly a random variable.
The differences between closed-loop distributions of
[TABLE]
and predictive distributions of
[TABLE]
are illustrated in Figure 1. The upper plot displays different samples of the disturbance sequence as well as a confidence region for the disturbance, such that realizations lie within the shaded red region with a certain probability. Additionally, the predictive distribution at time step , including confidence region and samples, is shown for a particular realization of . It is evident that for highly correlated sequences, the predicted distribution strongly depends on past disturbance realizations.
The lower plot illustrates the resulting state distribution from a dynamic system under disturbance and receding horizon control, including the predictive distribution for a specific realization. Note that the distribution of is typically not available, since it results from a receding horizon control law. Under a suitable restriction of predictive control policies , however, the predictive distribution is typically available, since the predicted system is subject to simple, often linear, predictive dynamics. Of particular interest is the relation to the stated chance constraints (2), which are indicated by the dashed line. While the closed-loop dynamics satisfy the displayed chance constraints with high probability, this is not necessarily the case for the predicted distributions, as evident from the example distribution of . Constraining the predictive constraint violation probabilities can therefore lead to an overly conservative controller, and often feasibility issues, especially when considering unbounded disturbances.
With these consideration we formulate a shortened horizon optimization problem with chance constraints conditioned on the initial state , i.e. {mini!}[4] {π_i} E_W_k ( l_f(x_N) + ∑_i=0^N-1 l_k+i(x_i,u_i) )
\addConstraintx_i+1 = A x_i + B u_i + w_i \addConstraintu_i = π_i(x_0,w_0,…,w_i) \addConstraintW_k= [w_0^T, …, w_N^T]^T∼Q^W_k \addConstraintPr(x_i∈X — x(0) ) ≥p_x \addConstraintPr(u_i∈U — x(0) ) ≥p_u \addConstraintx_0= x(k) ,
for all , where we use a terminal cost to approximate the remainder of the horizon of the stochastic optimal control problem (1) and to establish asymptotic performance bounds (see Section IV-D).
Remark 3*.*
The formulation of chance constraints (IV), (IV) considers the probability conditioned on the initial state , i.e. it is subject to random variables as well as for the -th step constraint. This is in contrast to predictive chance constraints conditioned on the currently measured state , which are often considered in stochastic MPC formulations [3, 11] and are only subject to .
IV-A Nominal Dynamics and MPC Initialization
In order to reformulate the probabilistic cost and constraints, we split dynamics (8) into a nominal part and error . We furthermore restrict the class of control policies in the prediction to affine feedback laws acting on the error with a predefined gain . This results in predictive dynamics in nominal and error state given by
[TABLE]
We couple these predicted dynamics to the closed-loop system by defining
[TABLE]
where we initialize the nominal state at time [math] to . The nominal predicted state is hence updated to its value predicted in the previous time step , while is updated to the measured state , introducing feedback into the optimization problem, as further discussed in Section IV-B and demonstrated in simulation in Section V-A. Since we assume a fixed gain , the optimization is then carried out over and the resulting input applied to the closed-loop system (1) is
[TABLE]
where and are the state error and optimal nominal control input, respectively, at time step .
Defining the nominal state and the error state also for the closed-loop system (1) and assuming feasibility of the optimization (which will be shown in Section IV-C), we notice that due to the choice of and control law (11) we have
[TABLE]
meaning that the closed-loop error dynamics remain linear, even under the receding horizon controller, which constitutes a nonlinear control law.
IV-B Stochastic MPC Formulation
In order to satisfy chance constraints (IV), (IV), we propose an analytic approximation using tightened deterministic constraints on the nominal system state and input . To this end, we make use of suitable PRS of the error system, which are employed similarly to an invariant error set in robust MPC. To accommodate different probability levels for input and state constraints, we make use of two reachable sets of different probability levels for each time step in the control horizon , i.e.
[TABLE]
and arrive at tightened constraints of
[TABLE]
In order to guarantee recursive feasibility of the optimization problem, we furthermore introduce a terminal set and terminal tightening PRS with and , satisfying the following properties:
Assumption 1** (Terminal invariance).**
The terminal set is positively invariant for system (9a) under the control law , i.e. for all we have , and .
Combining these ingredients, the resulting tractable stochastic MPC optimization problem is defined as follows: {mini!}[4] {v_i} E_W_k ( l_f(x_N) + ∑_i=0^N-1 l_k+i(x_i,u_i))
\addConstraintx_i+1= z_i+1 + e_i+1 \addConstraintz_i+1= Az_i + B v_i \addConstrainte_i+1= (A+BK)e_i + w_i \addConstraintW_k= [w_0^T, …, w_N^T]^T∼Q^W_k \addConstraintz_i∈X ⊖R^x_i+k \addConstraintv_i∈U ⊖R^u_i+k \addConstraintz_N∈Z_f \addConstraintx_0= x(k), z_0 = z_1(k-1) , e_0 = x_0 - z_0 ,
for all . Different from other robust and stochastic MPC approaches there is no direct feedback from the measured state on the updated nominal state . Note that feedback also on the nominal trajectory is nevertheless introduced via the cost in optimization problem (1). We will show in the following that, due to the linear evolution of the closed-loop error (12), this indirect form of feedback offers benefits for the theoretical properties in terms of recursive feasibility, closed-loop chance constraint satisfaction and stability properties, as discussed in the following sections.
IV-C Recursive Feasibility and Chance Constraint Satisfaction
Since the stochastic variables in optimization problem (1) only affect the cost, recursive feasibility can be established in terms of the nominal state and input . Due to the considered update law in (10) this follows standard arguments in predictive control.
Theorem 1** (Recursive feasibility).**
Consider system (1) under the control law (11) resulting from (1). If optimization problem (1) is feasible for , then it is recursively feasible, i.e. it is feasible for all times .
Proof.
Let be the optimal solution of optimization problem (1) at time step with the resulting nominal state trajectory, satisfying the terminal constraint (1) as well as constraints (1) and (1). We want to find a candidate solution which similarly satisfies the terminal constraint (1) and constraints (1), (1) with and , i.e. for the next time step . We choose this candidate solution by shifting and applying the linear control gain in the final time step, i.e. . The first entries evidently fulfill input constraints (1) again, since . Due to Assumption 1 we furthermore have that the final entry and since for all it similarly satisfies the constraint for . Since the resulting candidate trajectory for the nominal state is . Due to an analogue argument this satisfies constraints (1), and due to Assumption 1. ∎
We can furthermore establish that optimization problem (1) results in a feasible solution to (IV) by noting the following.
Lemma 1**.**
Consider system (1) under the control law (11) resulting from (1). Conditioned on , the predicted error has the same distribution as the closed-loop error, i.e. for .
Proof.
See Appendix. ∎
From Lemma 1 it follows directly that if is an step PRS of level for the error system (12), then
[TABLE]
resulting in feasibility in (IV), i.e. obtained from (1) is a feasible solution in (IV). This directly relates to the fact that recursive feasibility due to Theorem 1 implies satisfaction of closed-loop chance constraints (2).
Theorem 2** (Chance constraint satisfaction).**
Consider system (1) under the control law (11) resulting from (1). The resulting states and inputs satisfy the closed-loop chance constraints (2).
Proof.
Due to the linear evolution of the closed-loop error (12) and by definition of -step PRS we have that for all . Due to feasibility of (1) we furthermore have . With it therefore holds . The same argument holds for the input constraints. ∎
We have therefore shown, that an MPC controller based on formulation (1) satisfies closed-loop chance constraints and is therefore a feasible solution to the stochastic optimal control problem (1).
Remark 4*.*
In (1), feedback on the nominal system is introduced through the cost only, while constraints are satisfied w.r.t. a nominal state. A straightforward extension is to consider a soft-constraint term in the cost and thereby introduce feedback from measurements also w.r.t. the constraints, which can be beneficial e.g. in cases of model mismatch while maintaining theoretical guarantees, as demonstrated in examples in Section V-A.
The presented MPC optimization problem requires the evaluation of an expected cost (IV-B). Depending on the specific form, evaluation of the expected value can be computationally expensive. For some cost functions, e.g. linear or quadratic costs, however, this can be done cheaply based on the moments of the predicted state , which will be outlined in the following section, together with a resulting average asymptotic performance bound.
IV-D Quadratic Costs and Asymptotic Average Bound
Consider the special case of regularization with zero mean i.i.d. disturbances, that is
[TABLE]
for all and a quadratic cost function
[TABLE]
with , . For the terminal cost we assume the following to hold.
Assumption 2** (Terminal cost weight).**
The terminal weight is chosen as the solution to the Lyapunov equation
[TABLE]
If is stabilizing, this solution always exists.
For the quadratic cost function, evaluation of the expected value can be carried out in terms of mean and variance of , , denoted , and , , respectively, as
[TABLE]
Due to the linear prediction dynamics, the evolution of state and input mean and variance can be readily expressed as
[TABLE]
Note that variances and are not affected by and can therefore be neglected in an implementation of problem (1) with cost function (16). After computation of the constraint tightening, the problem is therefore of similar complexity as nominal MPC.
For cost function (16) under zero mean i.i.d. noise, we can establish an asymptotic average performance bound, based on a cost decrease in expectation.
Theorem 3** (Cost decrease).**
Consider system (1) subject to i.i.d. disturbances (15) under the control law (11) resulting from (1) with cost function (16). Let be the optimal cost of (1), then
[TABLE]
Proof.
See Appendix. ∎
Using the cost decrease in Theorem 3 a standard argument leads to the following asymptotic cost bound.
Corollary 1** (Average asymptotic cost bound).**
Consider system (1) subject to i.i.d. disturbances (15) under the control law (11) resulting from (1) with cost functions (16). We have
[TABLE]
Proof.
See Appendix and [23, 7, 11] for similar derivations. ∎
Remark 5*.*
Note that the SMPC formulation is therefore guaranteed to provide better or equal asymptotic average cost as under linear feedback gain .
V Simulation Examples
We present two simulation studies to demonstrate the proposed approach. The first illustrative example demonstrates feedback properties in a simple regulation task, while the second demonstrates the approach for time-varying correlated disturbance sequences in a building control setting.
V-A Double integrator
To demonstrate the feedback properties on the nominal system trajectory through the state update scheme (10), we consider a simple regulation problem for a double integrator under i.i.d. noise and quadratic cost with , , i.e. the setup considered in Section IV-D, specifically
[TABLE]
The system is subject to constraints on the absolute velocity
[TABLE]
and we use the tube controller . For simplicity, we do not consider constraints on the inputs and make use of a constant tightening based on along Remark 1 and 2. Considering only the velocity state we get for all
[TABLE]
We choose terminal set and consider a prediction horizon of . We compare the approach, which we call PRS-rec, to the following two variants.
PRS-nom:
Cost (IV-B) is computed for the nominal states , resulting in no feedback from on the nominal system trajectory.
PRS-df:
Approach with direct feedback, in which whenever feasible and otherwise, as previously presented in [11].
V-A1 Nominal Results
We simulate the system times starting from initial condition over a horizon of time steps. We compare the resulting average closed-loop cost
[TABLE]
again averaged over all trials, as well as the maximum empirical constraint violation rate in a time step with
[TABLE]
Additionally we report the closed-loop cost starting at time step to evaluate the long term behavior without initial transient regularization. The results are given in Table I. All approaches have guaranteed closed-loop chance constraint satisfaction and it is evident that also empirically all constraints are (conservatively) fulfilled. By comparing PRS-rec and PRS-df to PRS-nom it is evident that feedback on the nominal trajectory can significantly reduce the closed-loop cost, particularly , which Corollary 1 asymptotically guarantees to be smaller than , i.e. the cost under only the linear control law . The example therefore exemplifies how without feedback on the nominal trajectory (PRS-nom) the controller degenerates to a linear control law over time. The considered novel form of feedback (PRS-rec) achieves similar cost to the direct feedback form (PRS-df), which, however, requires stronger assumptions for strict satisfaction of chance constraints, namely i.i.d. unimodal disturbance distributions and symmetric reachable sets [11].
V-A2 Model Mismatch and Soft Constraints
As mentioned in Remark 4, in the presented approach feedback only acts through the cost function which can have adverse effects on chance constraint satisfaction in cases of model mismatch, as is often the case in practical applications. We investigate this case by reducing the input matrix in simulation to , such that the controllers are designed w.r.t. misspecified actuator gain. We repeat the regulation experiments for the aforementioned controllers, as well as a soft constrained variant
PRS-recSC:
Includes an additional cost for constraint violation in the predictive distribution. Specifically, we penalize the predicted expected value of the state if it lies outside the tightened constraint set , i.e. such that . The cost on the slack variable is and chosen such that the soft constraint is always satisfied, if feasible [24].
Note that the theoretical results w.r.t. constraint satisfaction for the nominal system model similarly hold for PRS-recSC.
The results of this simulation are summarized in Table II. With respect to the incurred cost, the results from this simulation are qualitatively similar to the nominal case without modeling error. PRS-nom incurs the highest cost, since no feedback acts on the nominal trajectory and the controller degenerates to the linear feedback law . Additionally it is unable to achieve the specified level of constraint satisfaction due to the large model mismatch. The recursively feasible approach without soft constraints PRS-rec displays the lowest achieved cost, but does incur similar constraint violations as PRS-nom. For the direct feedback controller PRS-df this is not the case, since the direct feedback also acts w.r.t. to the constraints. This does, however, come at significantly higher cost. Finally, the soft constrained recursively feasible controller PRS-recSC results in similar chance constraint satisfaction, as well as good performance in terms of cost, even in this case of significant model mismatch.
In the following example we demonstrate the flexibility of the presented method by considering a building control problem, subject to non i.i.d. disturbance distributions, for which the strong guarantees in terms of closed-loop chance constraint satisfaction similarly hold.
V-B Building Control
Consider the simple building temperature control example shown in Figure 2, consisting of four rooms with individual temperatures (states) , combined heating/cooling units for each room (inputs) , and uncertain outside temperature (disturbances), represented by . The system dynamics are modeled using a resistance network (see e.g. [25]), in which each room is characterized by a thermal capacity, while the interactions are governed by the thermal conductance, corresponding to the resistances.
The resulting system description is given by
[TABLE]
in which captures the thermal conductances between the rooms, represents the thermal inertia with respect to heating/cooling, is the effect of the outside temperature on each room, is given by the overall average outside temperature, which is included into the nominal system for an intuitive presentation of the results222Note that can be similarly included in the disturbance sequence , yielding equivalent results. A zero mean representation is used for increased interpretability., and is the uncertain deviation from the mean outside temperature, see Appendix for parameters and matrices. We consider a normally distributed disturbance sequence which is highly correlated in time, i.e. the current outside temperature significantly influences the temperature in the following hours. For illustration, Figure 3 (top) displays samples from the conditional distributions used in prediction.
We consider regulation with respect to 21.75\text{,}\mathrm{\SIUnitSymbolCelsius}$$ in each room, and a quadratic cost on the temperature deviation and a cost on the absolute value of each input, corresponding to an economic cost for heating or cooling, i.e. with . We choose the tube controller feedback using an LQR design with weight matrices and consider for simplicity the desired set point as terminal set for each state, i.e. , where denotes the one vector. Comfort constraints on the room temperature and physical input constraints on the heating/cooling power are given by
[TABLE]
Simulating the system with initial condition yields the closed-loop behavior for room 1 shown in Figure 3. Because of the economic cost on the heating power, the applied input is close to zero for the first 10 hours until chilly outside temperatures during night cool the rooms below the desired set point. At hours, the reduced room temperature before nighttime requires an increased heating effort during the predicted future time steps.
Note the time-varying constraint tightening on the nominal system state , which results from considering time-varying correlated disturbance sequence. This helps to significantly reduce conservatism of the MPC controller compared to time-invariant disturbance bounds. The approach is able to regulate the room temperature close to the reference point while satisfying all comfort and input constraints. The example therefore illustrates the potential applicability of the presented approach to a number of interesting engineering applications, while providing theoretical guarantees on chance constraint satisfaction.
VI Conclusion
We presented a stochastic model predictive control approach for additive correlated disturbance sequences. Using the concept of probabilistic reachable sets for constraint tightening, as well as a novel initialization scheme of the MPC, we were able to show chance constraint satisfaction for the closed-loop system. We demonstrated in simulation that this novel update scheme achieves similar or better performance to established forms of feedback in MPC, while facilitating theoretical analysis. Finally, we demonstrated the presented framework for a building control task, highlighting the applicability of the approach to a broad class of interesting engineering applications.
Acknowledgments
The authors would like to thank A. Bollinger, C. Gaehler, J. Lygeros, and R. Smith for providing the building control simulation model.
Appendix
Proof of Lemma 1.
Since we have and due to (12) that . For the predicted errors we similarly have with that
[TABLE]
Since has the same distribution as this is equal in distribution to
[TABLE]
∎
Proof of Theorem 3.
For notational convenience we will omit the conditioning on , . Let be the cost of optimization problem (1) under input sequence . We have
[TABLE]
where is a candidate solution with resulting nominal state trajectory , see also proof of Theorem 1. Note that since has the same distribution as , we have for the resulting initial error state
[TABLE]
such that
[TABLE]
The candidate state sequence and inputs therefore result in
[TABLE]
where the last element of follows from .
We therefore have
[TABLE]
where the last equation follows from the fact that and are independent and is zero mean. Due to Assumption 2 this further simplifies to
[TABLE]
∎
Proof of Corollary 1.
Let . By the law of iterated expectations, we have
[TABLE]
since is independent of previous and given . Using the cost decrease of Theorem 3 this means
[TABLE]
From this it follows that the asymptotic average is bounded from below by zero (since is finite) and from above by
[TABLE]
and the claim follows. ∎
Building control example
The dynamics are discretized using Euler forward with step size which yields
[TABLE]
with , where
[TABLE]
and , are given by
[TABLE]
and .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W. Langson, I. Chryssochoos, S. Raković, and D. Mayne, “Robust model predictive control using tubes,” Automatica , vol. 40, no. 1, pp. 125 – 133, 2004.
- 2[2] A. Mesbah, “Stochastic Model Predictive Control: An Overview and Perspectives for Future Research,” IEEE Control Systems , vol. 36, no. 6, pp. 30–44, 2016.
- 3[3] M. Farina, L. Giulioni, and R. Scattolini, “Stochastic linear Model Predictive Control with chance constraints - A review,” Journal of Process Control , vol. 44, pp. 53–67, 2016.
- 4[4] B. Kouvaritakis and M. Cannon, Model Predictive Control: Classical, Robust and Stochastic . Springer, 2016.
- 5[5] M. Cannon, B. Kouvaritakis, S. V. Rakovic, and Q. Cheng, “Stochastic tubes in model predictive control with probabilistic constraints,” IEEE Transactions on Automatic Control , vol. 56, no. 1, pp. 194–200, 2011.
- 6[6] M. Korda, R. Gondhalekar, F. Oldewurtel, and C. N. Jones, “Stochastic MPC framework for controlling the average constraint violation,” IEEE Transactions on Automatic Control , vol. 59, no. 7, pp. 1706–1721, 2014.
- 7[7] M. Lorenzen, F. Dabbene, R. Tempo, and F. Allgöwer, “Constraint-Tightening and Stability in Stochastic Model Predictive Control,” IEEE Transactions on Automatic Control , vol. 62, no. 7, pp. 3165–3177, 2017.
- 8[8] M. Farina, L. Giulioni, L. Magni, and R. Scattolini, “A probabilistic approach to Model Predictive Control,” Conference on Decision and Control , pp. 7734–7739, 2013.
