MPC for Humanoid Gait Generation: Stability and Feasibility

Nicola Scianca; Daniele De Simone; Leonardo Lanari; Giuseppe Oriolo

arXiv:1901.08505·cs.RO·December 20, 2024

MPC for Humanoid Gait Generation: Stability and Feasibility

Nicola Scianca, Daniele De Simone, Leonardo Lanari, Giuseppe Oriolo

PDF

TL;DR

This paper introduces IS-MPC, a stable model predictive control framework for humanoid gait generation that guarantees stability and feasibility through explicit constraints, validated by simulations and real robot experiments.

Contribution

It proposes a novel stable MPC formulation with explicit stability constraints for humanoid gait generation, ensuring bounded CoM trajectories and recursive feasibility.

Findings

01

Guarantees stability of CoM/ZMP dynamics.

02

Ensures recursive feasibility of the MPC algorithm.

03

Validated on NAO and HRP-4 humanoid robots.

Abstract

We present IS-MPC, an intrinsically stable MPC framework for humanoid gait generation which incorporates an explicit stability constraint in the formulation. The proposed method uses as prediction model a dynamically extended LIP where ZMP velocities are the control inputs, producing in real time a gait (including footsteps with the associated timing) that realizes omnidirectional motion commands coming from an external source. The stability constraint links the future ZMP velocities to the current system state so as to guarantee the essential requirement that the generated CoM trajectory is bounded with respect to the ZMP trajectory. Since the control horizon of the MPC algorithm is finite, only part of the future ZMP velocities are decision variables of the QP problem; the remaining part, called tail, must be either conjectured or anticipated using preview information on the reference…

Figures37

Click any figure to enlarge with its caption.

Equations131

\hat{X}_{f}^{k}

\hat{X}_{f}^{k}

\hat{Y}_{f}^{k}

Θ_{f}^{k}

T_{s}^{k} = {T_{s}^{1}, \dots, T_{s}^{F}},

T_{s}^{k} = {T_{s}^{1}, \dots, T_{s}^{F}},

v = \overset{v}{ˉ} + Δ v = \frac{L ˉ _{s} + Δ L _{s}}{T _{s} - Δ T _{s}},

v = \overset{v}{ˉ} + Δ v = \frac{L ˉ _{s} + Δ L _{s}}{T _{s} - Δ T _{s}},

T_{s} = \overline{T}_{s} \frac{α + v ˉ}{α + v} .

T_{s} = \overline{T}_{s} \frac{α + v ˉ}{α + v} .

t_{s}^{j} = t_{s}^{j - 1} + \overline{T}_{s} \frac{α + v ˉ}{α + v ( t _{s}^{j - 1} )},

t_{s}^{j} = t_{s}^{j - 1} + \overline{T}_{s} \frac{α + v ˉ}{α + v ( t _{s}^{j - 1} )},

\left(\begin{array}[]{c}\dot{x}\\ \dot{y}\\ \dot{\theta}\end{array}\right)=\left(\begin{array}[]{ccc}\cos\theta&-\sin\theta&0\\ \sin\theta&\cos\theta&0\\ 0&0&1\end{array}\right)\left(\begin{array}[]{c}v_{x}\\ v_{y}\\ \omega\end{array}\right).

\left(\begin{array}[]{c}\dot{x}\\ \dot{y}\\ \dot{\theta}\end{array}\right)=\left(\begin{array}[]{ccc}\cos\theta&-\sin\theta&0\\ \sin\theta&\cos\theta&0\\ 0&0&1\end{array}\right)\left(\begin{array}[]{c}v_{x}\\ v_{y}\\ \omega\end{array}\right).

\left(\begin{array}[]{c}\Delta x^{j}\\ \Delta y^{j}\end{array}\right)=\int_{t^{j-1}_{s}}^{t^{j}_{s}}R_{\theta}\left(\begin{array}[]{c}v_{x}(\tau)\\ v_{y}(\tau)\end{array}\right)d\tau\pm R_{j}\left(\begin{array}[]{c}0\\ \ell/2\end{array}\right),

\left(\begin{array}[]{c}\Delta x^{j}\\ \Delta y^{j}\end{array}\right)=\int_{t^{j-1}_{s}}^{t^{j}_{s}}R_{\theta}\left(\begin{array}[]{c}v_{x}(\tau)\\ v_{y}(\tau)\end{array}\right)d\tau\pm R_{j}\left(\begin{array}[]{c}0\\ \ell/2\end{array}\right),

\overset{x}{¨}_{c} = η^{2} (x_{c} - x_{z}),

\overset{x}{¨}_{c} = η^{2} (x_{c} - x_{z}),

\left(\begin{array}[]{c}\dot{x}_{c}\\ \ddot{x}_{c}\\ \dot{x}_{z}\end{array}\right)=\left(\begin{array}[]{ccc}0&1&0\\ \eta^{2}&0&-\eta^{2}\\ 0&0&0\end{array}\right)\left(\begin{array}[]{c}x_{c}\\ \dot{x}_{c}\\ x_{z}\end{array}\right)+\left(\begin{array}[]{c}0\\ 0\\ 1\end{array}\right)\dot{x}_{z}.

\left(\begin{array}[]{c}\dot{x}_{c}\\ \ddot{x}_{c}\\ \dot{x}_{z}\end{array}\right)=\left(\begin{array}[]{ccc}0&1&0\\ \eta^{2}&0&-\eta^{2}\\ 0&0&0\end{array}\right)\left(\begin{array}[]{c}x_{c}\\ \dot{x}_{c}\\ x_{z}\end{array}\right)+\left(\begin{array}[]{c}0\\ 0\\ 1\end{array}\right)\dot{x}_{z}.

\overset{x}{˙}_{z} (t) = \overset{x}{˙}_{z}^{i}, t \in [t_{i}, t_{i + 1}) .

\overset{x}{˙}_{z} (t) = \overset{x}{˙}_{z}^{i}, t \in [t_{i}, t_{i + 1}) .

x_{z} (t) = x_{z}^{i} + (t - t_{i}) \overset{x}{˙}_{z}^{i}, \mbox w i t h ∣ \overset{x}{˙}_{z}^{i} ∣ \leq γ,

x_{z} (t) = x_{z}^{i} + (t - t_{i}) \overset{x}{˙}_{z}^{i}, \mbox w i t h ∣ \overset{x}{˙}_{z}^{i} ∣ \leq γ,

R_{j}^{T}\left(\begin{array}[]{c}\delta\sum_{l=0}^{i}\dot{x}_{z}^{k+l}-x_{f}^{j}\\[10.0pt] \delta\sum_{l=0}^{i}\dot{y}_{z}^{k+l}-y_{f}^{j}\end{array}\right)\leq\frac{1}{2}\left(\begin{array}[]{c}d_{z,x}\\[5.0pt] d_{z,y}\end{array}\right)-R_{j}^{T}\left(\begin{array}[]{c}x_{z}^{k}\\[5.0pt] y_{z}^{k}\end{array}\right).

R_{j}^{T}\left(\begin{array}[]{c}\delta\sum_{l=0}^{i}\dot{x}_{z}^{k+l}-x_{f}^{j}\\[10.0pt] \delta\sum_{l=0}^{i}\dot{y}_{z}^{k+l}-y_{f}^{j}\end{array}\right)\leq\frac{1}{2}\left(\begin{array}[]{c}d_{z,x}\\[5.0pt] d_{z,y}\end{array}\right)-R_{j}^{T}\left(\begin{array}[]{c}x_{z}^{k}\\[5.0pt] y_{z}^{k}\end{array}\right).

R_{j-1}^{T}\left(\begin{array}[]{c}x_{f}^{j}-x_{f}^{j-1}\\[5.0pt] y_{f}^{j}-y_{f}^{j-1}\end{array}\right)\leq\pm\left(\begin{array}[]{c}0\\[5.0pt] \ell\end{array}\right)+\frac{1}{2}\left(\begin{array}[]{c}d_{a,x}\\[5.0pt] d_{a,y}\end{array}\right),

R_{j-1}^{T}\left(\begin{array}[]{c}x_{f}^{j}-x_{f}^{j-1}\\[5.0pt] y_{f}^{j}-y_{f}^{j-1}\end{array}\right)\leq\pm\left(\begin{array}[]{c}0\\[5.0pt] \ell\end{array}\right)+\frac{1}{2}\left(\begin{array}[]{c}d_{a,x}\\[5.0pt] d_{a,y}\end{array}\right),

x_{s}

x_{s}

x_{u}

\overset{x}{˙}_{s}

\overset{x}{˙}_{s}

\overset{x}{˙}_{u}

x_{u}^{k} = η \int_{t_{k}}^{\infty} e^{- η (τ - t_{k})} x_{z} (τ) d τ .

x_{u}^{k} = η \int_{t_{k}}^{\infty} e^{- η (τ - t_{k})} x_{z} (τ) d τ .

x_{u}^{k + C} = η \int_{t_{k + C}}^{\infty} e^{- η (τ - t_{k + C})} x_{z} (τ) d τ .

x_{u}^{k + C} = η \int_{t_{k + C}}^{\infty} e^{- η (τ - t_{k + C})} x_{z} (τ) d τ .

x_{u}^{k} = x_{z}^{k} + \frac{1 - e ^{- η δ}}{η} i = 0 \sum \infty e^{- i η δ} \overset{x}{˙}_{z}^{k + i},

x_{u}^{k} = x_{z}^{k} + \frac{1 - e ^{- η δ}}{η} i = 0 \sum \infty e^{- i η δ} \overset{x}{˙}_{z}^{k + i},

x_{u}^{k + C} = x_{z}^{k + C} + \frac{1 - e ^{- η δ}}{η} e^{C η δ} i = C \sum \infty e^{- i η δ} \overset{x}{˙}_{z}^{k + i} .

x_{u}^{k + C} = x_{z}^{k + C} + \frac{1 - e ^{- η δ}}{η} e^{C η δ} i = C \sum \infty e^{- i η δ} \overset{x}{˙}_{z}^{k + i} .

x_{z} (t) = x_{z}^{k} + i = 0 \sum \infty (ρ (t - t_{k + i}) - ρ (t - t_{k + i + 1})) \overset{x}{˙}_{z}^{k + i},

x_{z} (t) = x_{z}^{k} + i = 0 \sum \infty (ρ (t - t_{k + i}) - ρ (t - t_{k + i + 1})) \overset{x}{˙}_{z}^{k + i},

\int_{t_{k}}^{\infty} e^{- η (τ - t_{k})} (ρ (τ - t_{k + i}) - ρ (τ - t_{k + i + 1})) d τ = \frac{1 - e ^{- η δ}}{η ^{2}} e^{- i η δ} .

\int_{t_{k}}^{\infty} e^{- η (τ - t_{k})} (ρ (τ - t_{k + i}) - ρ (τ - t_{k + i + 1})) d τ = \frac{1 - e ^{- η δ}}{η ^{2}} e^{- i η δ} .

x_{z} (t) = x_{z}^{k} +

x_{z} (t) = x_{z}^{k} +

+

i = 0 \sum C - 1 e^{- i η δ} \overset{x}{˙}_{z}^{k + i} = - i = C \sum \infty e^{- i η δ} \overset{x}{˙}_{z}^{k + i} + \frac{η}{1 - e ^{- η δ}} (x_{u}^{k} - x_{z}^{k}) .

i = 0 \sum C - 1 e^{- i η δ} \overset{x}{˙}_{z}^{k + i} = - i = C \sum \infty e^{- i η δ} \overset{x}{˙}_{z}^{k + i} + \frac{η}{1 - e ^{- η δ}} (x_{u}^{k} - x_{z}^{k}) .

i = 0 \sum C - 1 e^{- i η δ} \overset{x}{˙}_{z}^{k + i} = - i = C \sum \infty e^{- i η δ} \dot{\tilde{x}}_{z}^{k + i} + \frac{η}{1 - e ^{- η δ}} (x_{u}^{k} - x_{z}^{k}),

i = 0 \sum C - 1 e^{- i η δ} \overset{x}{˙}_{z}^{k + i} = - i = C \sum \infty e^{- i η δ} \dot{\tilde{x}}_{z}^{k + i} + \frac{η}{1 - e ^{- η δ}} (x_{u}^{k} - x_{z}^{k}),

x_{u}^{k + C} = x_{z}^{k + C} + \frac{1 - e ^{- η δ}}{η} e^{C η δ} i = C \sum \infty e^{- i η δ} \dot{\tilde{x}}_{z}^{k + i} .

x_{u}^{k + C} = x_{z}^{k + C} + \frac{1 - e ^{- η δ}}{η} e^{C η δ} i = C \sum \infty e^{- i η δ} \dot{\tilde{x}}_{z}^{k + i} .

\dot{\tilde{x}}_{z}^{k + i} = 0 \mbox f or i \geq C .

\dot{\tilde{x}}_{z}^{k + i} = 0 \mbox f or i \geq C .

i = 0 \sum C - 1 e^{- i η δ} \overset{x}{˙}_{z}^{k + i} = \frac{η}{1 - e ^{- η δ}} (x_{u}^{k} - x_{z}^{k}),

i = 0 \sum C - 1 e^{- i η δ} \overset{x}{˙}_{z}^{k + i} = \frac{η}{1 - e ^{- η δ}} (x_{u}^{k} - x_{z}^{k}),

x_{u}^{k + C} = x_{z}^{k + C} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

MPC for Humanoid Gait Generation:

Stability and Feasibility

Nicola Scianca, Daniele De Simone, Leonardo Lanari, Giuseppe Oriolo The authors are with the Dipartimento di Ingegneria Informatica, Automatica e Gestionale, Sapienza Università di Roma, Via Ariosto 25, 00185 Rome, Italy. E-mail: {lastname}@diag.uniroma1.it. This work was supported by the European Commission through the H2020 project 645097 COMANOID.

Abstract

We present IS-MPC, an intrinsically stable MPC framework for humanoid gait generation that incorporates a stability constraint in the formulation. The method uses as prediction model a dynamically extended LIP with ZMP velocities as control inputs, producing in real time a gait (including footsteps with timing) that realizes omnidirectional motion commands coming from an external source. The stability constraint links future ZMP velocities to the current state so as to guarantee that the generated CoM trajectory is bounded with respect to the ZMP trajectory. Being the MPC control horizon finite, only part of the future ZMP velocities are decision variables; the remaining part, called tail, must be either conjectured or anticipated using preview information on the reference motion. Several options for the tail are discussed, each corresponding to a specific terminal constraint. A feasibility analysis of the generic MPC iteration is developed and used to obtain sufficient conditions for recursive feasibility. Finally, we prove that recursive feasibility guarantees stability of the CoM/ZMP dynamics. Simulation and experimental results on NAO and HRP-4 are presented to highlight the performance of IS-MPC.

I Introduction

Many gait generation approaches for humanoids guarantee that balance is maintained during locomotion by enforcing the condition that the Zero Moment Point (ZMP, the point where the horizontal component of the moment of the ground reaction forces becomes zero) remains at all times within the support polygon of the robot. Correspondingly, these approaches identify the ZMP as the fundamental variable to be controlled.

Due to the complexity of full humanoid dynamics, however, direct control of the ZMP is very difficult to achieve. In view of this, simplified models are generally used to relate the evolution of the ZMP to that of the Center of Mass (CoM) of the robot, which can be instead effectively controlled. Widely adopted linear models are the Linear Inverted Pendulum (LIP), in which the ZMP represents an input, and the Cart-Table (CT), where the ZMP appears as the output [1]. The first is appropriate for inversion-based control approaches: given a sequence of footsteps, and thus a ZMP trajectory interpolating them, the LIP is used to compute a CoM trajectory which corresponds to the ZMP trajectory; see, e.g., [2, 3, 4]. The CT model lends itself more naturally to the design of feedback laws for tracking ZMP trajectories, the most successful example in this context being the LQ preview controller of [5].

Regardless of the adopted model, there is a potential instability issue at the heart of the problem. In particular, a certain ZMP trajectory may be realized by an infinity of CoM trajectories, which, due to the nature of the CoM/ZMP dynamics, will in general be divergent with respect to the ZMP trajectory itself. In this situation, dynamic balance can be in principle achieved by properly choosing the ZMP trajectory, but internal instability indicates that such motion will not be feasible in practice for the humanoid.

The seminal paper [6] reformulates the gait generation problem in a Model Predictive Control (MPC) setting. This is convenient because it allows to generate simultaneously the ZMP and the CoM trajectories while satisfying constraints, such as the ZMP balance condition as well as kinematic constraints on the maximum step length and foot rotation [7]. Moreover, the MPC approach guarantees a certain robustness against perturbations. It is therefore not surprising that it has been adopted in many methods for gait generation; e.g., see [8, 9, 10, 11] for linear MPC and [12, 13] for nonlinear MPC.

As for all control schemes, a fundamental issue in MPC approaches is the stability of the obtained closed-loop system, especially in view of the previous remark about the instability of the CoM/ZMP dynamics. As discussed in [14], two main approaches have emerged for achieving stability when MPC is used for humanoid gait generation. The first is heuristic in nature and consists in using a sufficiently long control horizon [15], so that the optimization process can discriminate against diverging behaviors, as done for example in [7]. The second approach has been to enforce a terminal state constraint (i.e., a constraint on the state at the end of the control horizon), based on the fact that the MPC literature highlights the beneficial role of such constraints for closed-loop stability in set-point control problems [16].

In particular, terminal constraints were used for humanoid balancing in [17] and for gait generation in [18]. The latter makes use of a LIP model, requiring its unstable component to stop at the end of the control horizon, a kind of terminal constraint referred to as capturability constraint (from the concept of capture point [19]). This constraint has also been used in [20], where it is imposed only at the foot landing instant, and in [21], which addresses locomotion in a multi-contact setting.

Another approach focusing on the instability issue relies on the concept of Divergent Component of Motion (DCM), used in [22] to identify an initial condition for stable execution of regular gaits, and in [23] to realize transitions between bipedal and quadrupedal gaits. The DCM concept has also been extended to the 3D context in [24, 25]. More relevant to our review is [26], which presents an MPC scheme for gait generation that enforces a terminal constraint (actually converted to a terminal cost for the sake of feasibility) on the DCM component.

In this paper, we move from the fundamental observation that the control problem addressed in MPC-based gait generation is neither a set-point nor a tracking problem. In fact, since the ZMP control objective is encoded via time-varying state constraints, there is no error to be regulated to (or close to) zero. The only significant stability issue in this context is internal stability, i.e., the boundedness of the CoM trajectory with respect to the ZMP trajectory. Therefore, one cannot simply claim that the use of a terminal constraint will automatically entail internal stability. In fact, to the best of our knowledge, no MPC-based gait generation method exists in the literature for which a rigorous analysis of the stability issue has been performed in connection with the use and the choice of a terminal constraint.

Another tightly related aspect to be considered is that terminal constraints may have a detrimental effect on feasibility, i.e., the existence of solutions for the optimization problem which is at the core of any MPC scheme [27]. A particularly desirable property is recursive feasibility, which entails that if the optimization problem is feasible at a certain iteration it will remain such in future iterations. It appears that this also crucial issue has seldom been explored for MPC-based gait generation, with the notable exceptions of [28, 29].

In [30] we have introduced a novel MPC approach for humanoid gait generation which relies on the inclusion of an explicit stability constraint in the formulation of the problem. In particular, the idea was to enforce a condition on the future ZMP velocities (representing the control inputs) so as to guarantee that the generated CoM trajectory remains bounded with respect to the ZMP trajectory. Since the control horizon of the MPC algorithm is finite, only part of the future ZMP velocities are decision variables and can therefore be subject to a constraint; the remaining part, called tail, must be conjectured.

Here, we fully develop our approach into a complete, Intrinsically Stable MPC (IS-MPC) framework for gait generation. In particular, the paper adds the following contributions with respect to [30]:

we describe a footstep generation module that can be used in conjunction with our MPC scheme in order to modify step timing and length in real time in response to omnidirectional motion commands coming from a higher-level module; 2. 2.

depending on the available preview information on the commanded motion, we discuss several versions of the tail (truncated, periodic, anticipative) to be used in the stability constraint, and show that each of them corresponds to a specific terminal constraint; 3. 3.

we analyze in detail the impact of the new constraint on feasibility, and show analytically how, under certain assumptions, it is possibile to guarantee recursive feasibility of the IS-MPC scheme; 4. 4.

we prove that recursive feasibility of IS-MPC implies the desired internal stability of the CoM/ZMP dynamics; 5. 5.

we validate our findings by providing dynamic simulations and actual experiments on two different humanoid robots: an HRP-4 and a NAO.

The results on tails, recursive feasibility and internal stability are the main contributions of this paper. We consider them particularly important because they indicate that, contrarily to what is often claimed in the literature, simply adding a terminal constraint (e.g., the capturability constraint) does not per se guarantee stability of MPC-based gait generation schemes. Indeed, the appropriate tail to be used in the stability constraint — equivalently, the appropriate terminal constraint — depends upon the future characteristics of the commanded motion. In this sense, to guarantee recursive feasibility one should always choose the anticipative tail, which makes the most use of the available preview information on such motion. Once recursive feasibility is achieved, CoM/ZMP stability is automatically ensured in IS-MPC.

Another potential benefit of the theoretical analysis of feasibility is that it paves the road for a formal study of the robustness of IS-MPC. Although this is out of the scope of this paper, by relying on this analysis it should be possible to devise modifications of the basic scheme which will preserve recursive feasibility in the presence of quantified bounded uncertainties and/or disturbances.

The paper is organized as follows. In the next section, we formulate the considered gait generation problem and discuss the structure of the proposed approach. Section III describes the algorithm which generates timing and locations of the candidate footsteps. In Sect. IV we introduce the prediction model and the constraints used in the IS-MPC scheme, with the exception of the stability constraint which is given a thorough discussion in the dedicated Sect. V. The IS-MPC algorithm is described in detail in Sect. VI. Section VII addresses the central issues of stability and feasibility of the proposed method; in particular, a theoretical analysis of the feasibility of the generic IS-MPC iteration is presented and used to obtain sufficient conditions for recursive feasibility, whose role in guaranteeing stability is rigorously established. Simulations on the HRP-4 humanoid are presented in Sect. VIII, while experimental results on both the NAO and the HRP-4 humanoids are shown in Sect. IX. Section X offers a few concluding remarks.

II Problem and Approach

Consider the problem of generating a walking gait for a humanoid in response to high-level reference velocities, which are given as the driving ( $v_{x}$ , $v_{y}$ ) and steering ( $\omega$ ) velocities of an omnidirectional single-body mobile robot chosen as a template model for motion generation. These velocities, which may encode a persistent trajectory or converge to a stationary point, are produced by an external source; this could be a human operator in a shared control context, or another module of the control architecture working in open-loop (planning) or in closed-loop (feedback control).

The proposed MPC-based framework, whose block scheme is shown in Fig. 1, works in a digital fashion over sampling intervals of duration $\delta$ . Throughout the paper, it is assumed that the reference velocities $v_{x}$ , $v_{y}$ , $\omega$ are made available for gait generation with a preview horizon $T_{p}=P\cdot\delta$ , with $P$ the number of intervals within the preview horizon. At the generic instant $t_{k}=k\cdot\delta$ , the high-level references velocities over $[t_{k},t_{k}+T_{p}]$ are then sent to the footstep generation module, which uses Quadratic Programming (QP) to generate candidate footsteps over the same interval. In particular, vectors $\hat{X}_{f}^{k}$ , $\hat{Y}_{f}^{k}$ collect the Cartesian positions of the footsteps, with the ‘hat’ indicating that these are candidates which can be modified by the MPC module; whereas vector $\Theta_{f}^{k}$ collects the footstep orientations, which will not be modified. The footstep generation module also generates the timing ${\cal T}_{s}^{k}$ of the sequence.

The output of the footstep generation module is sent to the Intrinsically Stable MPC (IS-MPC) module, which solves another QP problem to produce in real time the actual footstep positions $X_{f}^{k}$ , $Y_{f}^{k}$ and the trajectory ${\mbox{\boldmath$ p $}}^{\ast}_{c}$ of the humanoid CoM over the control horizon $T_{c}=C\cdot\delta$ , with $C$ the number of intervals within the control horizon. It is assumed that $T_{c}\leq T_{p}$ , i.e., $C\leq P$ . The inclusion of a stability constraint in the formulation guarantees that the CoM trajectory will be bounded, in a sense to be made precise later.

The pose (position and orientation) of the footsteps with the associated timing is used to generate — still in real time — the swing foot trajectory ${\mbox{\boldmath$ p $}}^{\ast}_{\it swg}$ over the control horizon. Together with the CoM trajectory, this is sent to the kinematic control block, which generates velocity inputs at the joint level in order to achieve output tracking (we are assuming that the humanoid robot is velocity- or position-controlled).

In the next sections we will discuss in detail the proposed control scheme. We will first describe the footstep generation scheme, and then turn our attention to the IS-MPC algorithm, which is our core contribution. The kinematic control block can use any standard pseudoinverse-based feedback law and therefore will not be discussed further.

III Candidate Footstep Generation

The proposed footstep generation module runs synchronously with the IS-MPC scheme and chooses both the timing and the candidate location of the next footsteps in response to the high-level reference velocities. Timing is determined first by a simple rule expressing the fact that a change in the reference velocity should affect both the step duration and length. The candidate footstep locations are then chosen through quadratic optimization.

Note that generating the timing and the orientation of the candidate footsteps outside the IS-MPC is essential to retain the linear structure of the latter. The IS-MPC scheme will still be able to adapt the position of the footsteps to guarantee reactivity to disturbances.

At each sampling instant $t_{k}$ , the candidate footstep generation module receives in input the high-level reference velocities over the preview horizon, i.e., from $t_{k}$ to $t_{k}+T_{p}=t_{k+P}$ (see Fig. 1). In output, it provides the candidate footstep sequence $(\hat{X}^{k}_{f},\hat{Y}^{k}_{f},\Theta^{k}_{f})$ over the same interval with the associated timing ${\cal T}_{s}^{k}$ . In particular, these quantities are defined111To keep a light notation, the $k$ symbol identifying the current sampling instant is used for the sequence vectors but not for their individual elements. as

[TABLE]

and

[TABLE]

where $(x_{f}^{j},y_{f}^{j},\theta_{f}^{j})$ is the pose of the $j$ -th footstep in the preview horizon and $T^{j}_{s}$ is the duration of the step between the $(j-1)$ -th and the $j$ -th footstep, taken from the start of the single support phase to the next. Since the duration of steps is variable, the number $F$ of footsteps falling within the preview horizon $T_{p}$ may change at each $t_{k}$ .

Below, we discuss first how timing is determined and then describe the procedure for generating the candidate footsteps.

III-A Candidate Footstep Timing

In our method, the duration $T_{s}$ of each step is related to the magnitude $v=(v_{x}^{2}+v_{y}^{2})^{1/2}$ of the reference Cartesian velocity at the beginning of that step.

Assume that a triplet of cruise parameters $(\bar{v},\overline{T}_{s},\bar{L}_{s})$ has been chosen, where $\bar{v}$ is a central value of $v$ and $\overline{T}_{s}$ , $\bar{L}_{s}$ are the corresponding values of the step duration and length, respectively, with $\bar{v}=\bar{L}_{s}/\overline{T}_{s}$ . The choice of these parameters will depend on the specific kinematic and dynamic capabilities of the humanoid robot under consideration.

The idea is that a deviation from $\bar{v}$ should reflect on a change in both $T_{s}$ and $L_{s}$ . In formulas:

[TABLE]

with $\Delta L_{s}=\alpha\Delta T_{s}$ . One easily obtains

[TABLE]

Figure 2 shows the resulting rule for determining $T_{s}$ as a function of $v$ in comparison to other possible rules. For illustration, we have set $\bar{v}=0.15$ m/s, $\overline{T}_{s}=0.8$ s, $\bar{L}_{s}=0.12$ m and $\alpha=0.1$ m/s. It is confirmed that an increase of $v$ , for example, corresponds to both a decrease of $T_{s}$ and an increase in $L_{s}$ .

Note that the reference angular velocity $\omega$ does not enter into rule (1). The rationale is that the step duration and length along curved and rectilinear paths do not differ significantly if the Cartesian velocity $v$ is the same. For a purely rotational motion ( $v=0$ ) where the humanoid is only required to rotate on the spot, the above rule would yield the maximum value of $T_{s}$ .

In practice, equation (1) is iterated along the preview horizon $[t_{k},t_{k}+T_{p}]$ in order to obtain the footstep timestamps:

[TABLE]

with $t^{0}_{s}$ equal to the timestamp of the last footstep before $t_{k}$ . Iterations must be stopped as soon as $t^{j}_{s}>t_{k+P}$ , discarding the last generated timestamp since it will be outside the preview horizon. The resulting step timing will be ${\cal T}^{k}_{s}=\{T^{1}_{s},\ldots,T^{F}_{s}\}$ , with $T^{j}_{s}=t_{s}^{j+1}-t^{j}_{s}$ .

III-B Candidate Footstep Placement

Once the timing of the steps in the preview horizon $[t_{k},t_{k}+T_{p}]$ has been chosen, the poses of candidate footsteps are generated. To this end, we use a reference trajectory obtained by integrating the following template model under the action of the high-level reference velocities over $T_{p}$ :

[TABLE]

This is an omnidirectional motion model which allows the template robot to move along any Cartesian path with any orientation, so as to perform, e.g., lateral walks, diagonal walks, and so on.

The idea is to distribute the candidate footsteps around the reference trajectory in accordance to the timing ${\cal T}_{s}^{k}$ while taking into account the kinematic constraints of the robot. These constraints will also be used in the IS-MPC stage, and therefore we will provide their description directly in Sect. IV-C (see also Fig. 7).

A sequence of two QP problems is solved. The first is

$\left\{\begin{minipage}{433.62pt} $$\min_{\Theta^{k}_{f}}\>\sum_{j=1}^{F}(\theta^{j}_{f}-\theta^{j-1}_{f}-\int_{t^{j-1}_{s}}^{t^{j}_{s}}\omega(\tau)d\tau)^{2}$$ $$\mbox{subject to}\quad|\theta^{j}_{f}-\theta^{j-1}_{f}|\leq\theta_{\rm max}$$ \end{minipage}\right.$

Here, $\theta_{\rm max}$ is the maximum allowed rotation between two consecutive footsteps. The second QP problem is

$\left\{\begin{minipage}{433.62pt} $$\min_{\hat{X}_{f}^{k},\hat{Y}_{f}^{k}}\>\sum_{j=1}^{F}(\hat{x}_{f}^{j}-\hat{x}_{f}^{j-1}-\Delta x^{j})^{2}+(\hat{y}_{f}^{j}-\hat{y}_{f}^{j-1}-\Delta y^{j})^{2}$$ \centerline{\hbox{subject to kinematic constraints~{}(\ref{eq:footposcon})}} \end{minipage}\right.$

Here, $(\hat{x}_{f}^{0},\hat{y}_{f}^{0})$ is the known position of the support foot at $t_{k}$ and $\Delta x^{j}$ , $\Delta y^{j}$ are given by

[TABLE]

where $R_{\theta}$ , $R_{j}$ are the rotation matrices associated respectively to $\theta(\tau)$ (the orientation of the template robot at any given time $\tau$ ) and to the footstep orientation $\theta_{j}$ , and $\ell$ is the reference coronal distance between consecutive footsteps. The sign of the second term alternates for left/right footsteps.

At the end of this procedure, the candidate footstep sequence $(\hat{X}^{k}_{f},\hat{Y}^{k}_{f},\Theta^{k}_{f})$ with the associated timing ${\cal T}_{s}^{k}$ is sent to the IS-MPC stage. The final footstep positions $(X^{k}_{f},Y^{k}_{f})$ will be determined by the latter while the footstep orientations $\Theta^{k}_{f}$ and timing ${\cal T}_{s}^{k}$ will not be modified.

Some examples of candidate footsteps generation are shown in Fig. 3. Note that the orientation of the humanoid robot is tangent to the path for the circular walk, but is kept constant ( $\omega=0$ ) for the other two walks, which represent then proper examples of omnidirectional motion.

IV IS-MPC: Prediction Model and Constraints

The IS-MPC module uses the Linear Inverted Pendulum (LIP) as a prediction model. The constraints are of three kinds. The first concerns the position of the ZMP, which must be at all times within the support polygon defined by the footstep sequence and the associated timing. The second type of constraint ensures that the generated steps are compatible with the kinematic capabilities of the robot. The third is the new stability constraint guaranteeing that the CoM trajectory generated by our MPC scheme will be bounded with respect to the ZMP trajectory. The first two constraints must be verified throughout the control horizon, whereas the third is a single scalar condition on each coordinate.

In this section, we discuss in detail the prediction model and the constraints on ZMP and kinematic feasibility. The next section will be devoted to the stability constraint, which deserves a thorough discussion.

IV-A Prediction Model

The LIP is a popular choice for describing the motion of the CoM of a biped walking on flat horizontal floor when its height is kept constant and no rotational effects are present. From now on, we express motions in the robot frame, which has its origin at the center of the current support foot, the $x$ -axis (sagittal) aligned with the support foot, and the $y$ -axis (coronal) orthogonal to the $x$ -axis. In the LIP model, which applies to both point feet and finite-sized feet, the dynamics along the sagittal and coronal axes are governed by decoupled, identical linear differential equations.

Consider for illustration the motion along the $x$ axis (see Fig. 4), and let $x_{c}$ and $x_{z}$ be respectively the coordinate of the CoM and the ZMP. The LIP dynamics is

[TABLE]

where $\eta=\sqrt{g/h_{c}}$ , with $g$ the gravity acceleration and $h_{c}$ the constant height of the CoM. In this model, the ZMP position $x_{z}$ represents the input, whereas the CoM position $x_{c}$ is the output.

To obtain smoother trajectories, we take the ZMP velocity $\dot{x}_{z}$ as the actual control input. This leads to the following third-order prediction model (LIP $+$ dynamic extension)

[TABLE]

Our MPC scheme uses piecewise-constant control over the sampling intervals (see Fig. 5):

[TABLE]

In particular, a bound of the form $|\dot{x}_{z}^{i}|\leq\gamma$ , with $\gamma$ a positive constant, will be satisfied for all $i$ . In fact, the reference velocities $v_{x}$ , $v_{y}$ , $\omega$ will be bounded in any realistic gait generation problem. As shown by Fig. 2, the footstep generation module will then produce a sequence of footsteps along which the step duration is bounded below. This timing will be reflected in the associated ZMP constraints (see Sect. IV-B), which will in turn entail as solution a piecewise-continuous trajectory $x_{z}(t)$ with bounded derivative. Therefore, for $t\in[t_{i},t_{i+1}]$ it will be

[TABLE]

where we have used the notation $x_{z}^{i}=x_{z}(t_{i})$ .

The generic iteration of IS-MPC plans over the control horizon, i.e., from $t_{k}$ to $t_{k}+T_{c}=t_{k+C}$ . Since $T_{c}\leq T_{p}$ , a subset of the $F$ candidate footsteps produced by the footstep generation module fall inside the control horizon; denote their number by $F^{\prime}<F$ . The MPC iteration will then generate:

$\bullet$

the control variables, i.e., the input values $\dot{x}_{z}^{k+i}$ , $\dot{y}_{z}^{k+i}$ , for $i=0,\dots,C-1$ ;

$\bullet$

the other decision variables, i.e., the actual footstep positions $(x_{f}^{j},x_{f}^{j})$ , for $j\!=\!1,\ldots,F^{\prime}$ .

$\bullet$

as a byproduct, the output history $x_{c}(t)$ , $y_{c}(t)$ , for $t\in[t_{k},t_{k+C}]$ which will be ultimately used to drive the actual humanoid.

As already mentioned, the orientations of the footsteps are instead inherited from the generated sequence (more on this in Sect. IV-B).

Note that the footsteps do not appear in the prediction model, but will show up in the constraints, as discussed in the rest of this section.

IV-B ZMP Constraints

The first constraint guarantees dynamic balance by imposing that the ZMP lies inside the current support polygon at all time instants within the control horizon.

When the robot is in single support on the $j$ -th footstep, the admissible region for the ZMP is the interior of the footstep, which can be approximated as a rectangle of dimensions $d_{z,x},d_{z,y}$ , centered at $(x_{f}^{j},y_{f}^{j})$ , and oriented as $\theta^{j}$ . Using the fact that the ZMP profile is piecewise-linear as entailed by (5), the constraint can be expressed as222For compactness, we shall only write the right-hand side of bilateral inequality constraints. For example, constraint (6) should be completed by a left-hand side obtained by adding (rather than subtracting) the two terms that appear in the right-hand side.:

[TABLE]

If the above sampled-time ZMP constraint is satisfied, then the original continuous-time constraint is also satisfied thanks to the linearity of $x_{z}(t)$ within each sampling interval. Constraint (6), complete with the corresponding left-hand side, must be imposed throughout the control horizon ( $i=0,\dots,C-1$ ) and for all the associated footsteps ( $j=0,\dots,F^{\prime}$ ).

Note that constraint (6) is nonlinear in the footstep orientation $\theta^{j}$ , which however is not a decision variable, being simply inherited from the footstep generation module. The constraint is instead linear in $x_{f}^{j}$ , $y_{f}^{j}$ , as well as in the ZMP velocity inputs.

During double support, the support polygon would be the convex hull of the two footsteps, whose boundary is a nonlinear function of their relative position. To preserve linearity, we adopt an approach based on moving constraints [31]. In particular, the admissible region for the ZMP in double support has exactly the same shape and dimensions it has in single support, and it roto-translates (i.e., simultaneously rotates and translates) from one footstep to the other in such a way to always remain in the support polygon (see Fig. 6). This results in a slightly conservative constraint which is however linear in the decision variables.

IV-C Kinematic Constraints

The second type of constraint is introduced to ensure that all steps are compatible with the robot kinematic limits. Consider the $j$ -th step in $T_{c}$ , with the support foot centered at $(x_{f}^{j-1},y_{f}^{j-1})$ and oriented as $\theta^{j-1}$ . The admissible region for placing the footstep is defined as a rectangle having the same orientation $\theta^{j-1}$ and whose center is displaced from the support foot center by a distance $\ell$ in the coronal direction (see Fig. 7). Denoting by $d_{a,x}$ and $d_{a,y}$ the dimensions of the kinematically admissible region, the constraint can be written as

[TABLE]

with the sign alternating for the two feet. The above constraint, complete with the corresponding left-hand side, must be imposed for all footsteps in the control horizon ( $j=1,\dots,F^{\prime}$ ).

V IS-MPC: Enforcing Stability

The LIP dynamics (3) is inherently unstable. As a consequence, even when the ZMP lies at all times within the support polygon (gait balance) it may still happen that the CoM diverges exponentially with respect to the ZMP; in this case, the gait would obviously become unfeasible in practice, due to the kinematic limitations of the robot. The role of the stability constraint is then to guarantee that the CoM trajectory remains bounded with respect to the ZMP (internal stability).

In this section, we first describe the structure of the stability constraint and then discuss the possible tails for its implementation.

V-A Stability Constraint

Since we want to enforce boundedness of the CoM w.r.t. the ZMP, we can ignore the dynamic extension and focus directly on the LIP system.

By using the following change of coordinates

[TABLE]

the LIP part of system (3) is decomposed into a stable and an unstable subsystem:

[TABLE]

The unstable component $x_{u}$ is also known as divergent component of motion (DCM) [22] or capture point [32].

In spite of the LIP instability, for any input ZMP trajectory $x_{z}(t)$ of the form (5) there exists a special initialization of $x_{u}$ such that the resulting output CoM trajectory is bounded with respect to the input [33]. In particular, this is the (only) initial condition on $x_{u}$ for which the free evolution of (11) exactly cancels the component of the forced evolution that would diverge with respect to $x_{z}(t)$ . In the MPC context, where the initial condition at $t_{k}$ is denoted by $x_{u}(t_{k})=x_{u}^{k}$ , the special initialization is expressed as

[TABLE]

Note that this particular initialization depends on the future values of the LIP input, i.e., the ZMP coordinate $x_{z}$ . In the following, we refer to (12) as the stability condition.

The stability condition, which involves $x_{u}$ at the initial instant $t_{k}$ of the control horizon, can be propagated to its final instant $t_{k+C}$ by integrating (11) from $x_{u}^{k}$ in (12):

[TABLE]

Condition (12) — or equivalently, (13) — can be used to set up the corresponding constraint for the MPC problem. To this end, we use the piecewise-linear profile (5) of $x_{z}$ to obtain explicit forms.

Proposition 1

For the piecewise-linear $x_{z}$ in (5), condition (12) becomes

[TABLE]

while (13) takes the form

[TABLE]

Proof. Rewrite eq. (5) as

[TABLE]

where $\rho(t)=t\,\delta_{-1}(t)$ denotes the unit ramp and $\delta_{-1}(t)$ the unit step. Using Properties 1, 4 and 3 given in the Appendix, we get

[TABLE]

Plugging this expression in condition (12) and using Property 2 of the Appendix one obtains (14).

To prove (15), rewrite (16) as

[TABLE]

The contribution of the first two terms of $x_{z}$ to the integral in (13) is $x_{z}^{k+C}$ . Using Properties 1, 3 and 4 one verifies that the contribution of the third term is exactly the second term in the right hand side of (15). This completes the proof.

In (14), one should logically separate the values of $\dot{x}_{z}^{i}$ within the control horizon, i.e. the control variables $\dot{x}_{z}^{i}$ for $i=k,\dots,k+C-1$ , from the remaining values, i.e., from $k+C$ on. The infinite summation is then split in two parts and (14) can be rearranged as333Constraint (17) can be written as a function of the actual state variables of our prediction model ( $x_{c}$ , $\dot{x}_{c}$ and $x_{z}$ ) using the coordinate transformation (9). The same is true for all subsequent forms of the stability constraint as well as of the terminal constraint.

[TABLE]

Observe the inversion between (14), which expresses the stable initialization at $t_{k}$ for a given $x_{z}(t)$ , and (17), which constrains the control variables so that the associated stable initialization matches the current state at $t_{k}$ . In the following, we will refer to (17) as the stability constraint.

The control variables do not appear in condition (15), which involves only the value of the state variable $x_{u}^{k+C}$ at the end of the control horizon. In other terms, this condition represents what is called a terminal constraint in the MPC literature.

Both the stability and the terminal constraint contain an infinite summation which depends on $\dot{x}_{z}^{k+C}$ , $\dot{x}_{z}^{k+C+1},\dots$ , i.e., the ZMP velocities after the control horizon. These are obviously unknown, because they will be determined by future iterations of the MPC algorithm; as a consequence, including either of the constraints in the MPC formulation would lead to a non-causal (unrealizable) controller. However, by exploiting the preview information on $v_{x}$ , $v_{y}$ , $\omega$ , we can make an informed conjecture at $t_{k}$ about these ZMP velocities, which we will denote by $\dot{\tilde{x}}_{z}^{k+C}$ , $\dot{\tilde{x}}_{z}^{k+C+1},\dots$ and refer to collectively as the tail in the following. Correspondingly, the stability constraint (17) assumes the form

[TABLE]

while the terminal constraint (15) becomes

[TABLE]

Using either of these in the MPC formulation will lead to a causal (realizable) controller.

V-B Tails

We now discuss three possible options for the structure of the tail depending on the assumed behavior of the ZMP velocities after the control horizon. Basically, they correspond to (i) neglecting them (ii) assuming they are periodic (iii) anticipating a more general profile based on preview information. For each option, we shall explicitly compute the corresponding form of both the stability and the terminal constraint.

V-B1 Truncated Tail

The simplest option is to truncate the tail, by assuming that the corresponding ZMP velocities are all zero. This is a sensible choice if the preview information indicates that the robot is expected to stop at the end of the control horizon.

Proposition 2

Let (truncated tail)

[TABLE]

The stability constraint becomes

[TABLE]

while the terminal constraint becomes

[TABLE]

Proof. The above expressions are readily derived from the general constraints (18) and (19), respectively.

Interestingly, the terminal constraint (21) is equivalent to the capturability constraint, originally introduced in [18].

V-B2 Periodic Tail

The second option is to use a periodic tail obtained by infinite replication of the ZMP velocities within the control horizon. This assumption is justified when the reference velocities are themselves periodic (in particular, constant) in $T_{c}$ , which is typically chosen as the gait period (total duration of two consecutive steps) or a multiple of it. Formulas for a replication period different from the control horizon may be easily derived.

Proposition 3

Let (periodic tail)

[TABLE]

The stability constraint becomes

[TABLE]

while the terminal constraint becomes

[TABLE]

Proof. If the tail is periodic, the infinite summation in (18) can be rewritten as follows:

[TABLE]

which can be plugged in (18) and in (19), respectively, to obtain (22) and (23).

Note that, using (11), the terminal constraint (23) can be rewritten as

[TABLE]

V-B3 Anticipative Tail

In the general case, one can use the candidate footsteps produced by the footstep generation module beyond the control horizon to conjecture a tail in $[T_{c},T_{p}]$ . This is done in two phases: in the first, we generate in $[T_{c},T_{p}]$ a ZMP trajectory which belongs at all times to the admissible ZMP region defined by the footsteps $\{(\hat{x}^{F^{\prime}}_{f},\hat{y}^{F^{\prime}}_{f},\theta^{F^{\prime}}_{f}),\dots,(\hat{x}^{F}_{f},\hat{y}^{F}_{f},\theta^{F}_{f})\}$ . In the second phase, we sample the time derivative of this ZMP trajectory every $\delta$ seconds.

Denote the samples obtained by the above procedure by $\dot{x}_{z,\rm ant}^{k+i}$ , for $i=C,\dots,P-1$ . The anticipative tail is then obtained by:

•

setting $\dot{\tilde{x}}_{z}^{k+i}=\dot{x}_{z,\rm ant}^{k+i}$ for $i=C,\dots,P-1$ ;

•

using a truncated or periodic expression for the residual part of the tail located after the preview horizon, i.e., for $\dot{\tilde{x}}_{z}^{k+i}$ , $i=P,P+1,\dots\,$ .

The stability constraint (18) then becomes

[TABLE]

Once a form is chosen for the residual part of the tail, this formula leads to a closed-form expression of the stability constraint which consists of a finite number of terms, and is therefore still amenable to real-time implementation. Similarly, one can use (19) to derive the corresponding expression of the terminal constraint.

In the following, and specifically in the feasibility analysis of Sect. VII-B2, we will use a particular form of anticipative tail such that (i) the ZMP trajectory in $[T_{c},T_{p}]$ is always at the center of the ZMP admissible region, and (ii) the residual part of the tail is truncated.

VI IS-MPC: Algorithm

Each iteration of our IS-MPC algorithm solves a QP problem based on the prediction model and constraints described in Sect. IV, with the addition of the stability constraint discussed in the previous section.

VI-A Formulation of the QP Problem

Collect in vectors

[TABLE]

all the MPC decision variables.

At this point, the QP problem can be formulated as:

$\left{\begin{minipage}{433.62pt}

Bibliography39

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Kajita, H. Hirukawa, K. Harada, and K. Yokoi, Introduction to Humanoid Robotics . Springer Publishing Company Inc., 2014.
2[2] K. Harada, S. Kajita, K. Kaneko, and H. Hirukawa, “An analytical method for real-time gait planning for humanoid robots,” International Journal of Humanoid Robotics , vol. 03, no. 01, pp. 1–19, 2006.
3[3] M. Morisawa, K. Harada, S. Kajita, K. Kaneko, F. Kanehiro, K. Fujiwara, S. Nakaoka, and H. Hirukawa, “A biped pattern generation allowing immediate modification of foot placement in real-time,” in 6th IEEE-RAS Int. Conf. on Humanoid Robots , 2006, pp. 581–586.
4[4] T. Buschmann, S. Lohmeier, M. Bachmayer, H. Ulbrich, and F. Pfeiffer, “A collocation method for real-time walking pattern generation,” in 7th IEEE-RAS Int. Conf. on Humanoid Robots , 2007, pp. 1–6.
5[5] S. Kajita, F. Kanehiro, K. Kaneko, K. Fujiwara, K. Harada, K. Yokoi, and H. Hirukawa, “Biped walking pattern generation by using preview control of zero-moment point,” in 2003 IEEE Int. Conf. on Robotics and Automation , 2003, pp. 1620–1626.
6[6] P.-B. Wieber, “Trajectory free linear model predictive control for stable walking in the presence of strong perturbations,” in 6th IEEE-RAS Int. Conf. on Humanoid Robots , 2006, pp. 137–142.
7[7] A. Herdt, H. Diedam, P.-B. Wieber, D. Dimitrov, K. Mombaur, and M. Diehl, “Online walking motion generation with automatic footstep placement,” Advanced Robotics , vol. 24, no. 5-6, pp. 719–737, 2010.
8[8] J. Alcaraz-Jiménez, D. Herrero-Pérez, and H. Martínez-Barberá, “Robust feedback control of ZMP-based gait for the humanoid robot Nao,” The International Journal of Robotics Research , vol. 32, no. 9-10, pp. 1074–1088, 2013.