Learning $Q$-function approximations for hybrid control problems

Sandeep Menta; Joseph Warrington; John Lygeros; Manfred Morari

arXiv:2105.13517·math.OC·June 9, 2021·IEEE Control. Syst. Lett.

Learning $Q$-function approximations for hybrid control problems

Sandeep Menta, Joseph Warrington, John Lygeros, Manfred Morari

PDF

TL;DR

This paper introduces a method to approximate $Q$-functions for hybrid control systems, enabling more efficient model predictive control without the need for empirically tuned terminal costs, and demonstrates its effectiveness on benchmark problems.

Contribution

The paper proposes a novel approach to approximate $Q$-functions for hybrid systems within MPC, reducing reliance on terminal costs and improving control performance.

Findings

01

Successfully learned $Q$-approximations for high-dimensional hybrid systems

02

Achieved comparable or better control performance than Hybrid MPC

03

Reduced computation time compared to traditional Hybrid MPC

Abstract

The main challenge in controlling hybrid systems arises from having to consider an exponential number of sequences of future modes to make good long-term decisions. Model predictive control (MPC) computes a control action through a finite-horizon optimisation problem. A key ingredient in this problem is a terminal cost, to account for the system's evolution beyond the chosen horizon. A good terminal cost can reduce the horizon length required for good control action and is often tuned empirically by observing performance. We build on the idea of using $N$ -step $Q$ -functions $(Q^{(N)})$ in the MPC objective to avoid having to choose a terminal cost. We present a formulation incorporating the system dynamics and constraints to approximate the optimal $Q^{(N)}$ -function and algorithms to train the approximation parameters through an exploration of the state space. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.