Learning $Q$-function approximations for hybrid control problems
Sandeep Menta, Joseph Warrington, John Lygeros, Manfred Morari

TL;DR
This paper introduces a method to approximate $Q$-functions for hybrid control systems, enabling more efficient model predictive control without the need for empirically tuned terminal costs, and demonstrates its effectiveness on benchmark problems.
Contribution
The paper proposes a novel approach to approximate $Q$-functions for hybrid systems within MPC, reducing reliance on terminal costs and improving control performance.
Findings
Successfully learned $Q$-approximations for high-dimensional hybrid systems
Achieved comparable or better control performance than Hybrid MPC
Reduced computation time compared to traditional Hybrid MPC
Abstract
The main challenge in controlling hybrid systems arises from having to consider an exponential number of sequences of future modes to make good long-term decisions. Model predictive control (MPC) computes a control action through a finite-horizon optimisation problem. A key ingredient in this problem is a terminal cost, to account for the system's evolution beyond the chosen horizon. A good terminal cost can reduce the horizon length required for good control action and is often tuned empirically by observing performance. We build on the idea of using -step -functions in the MPC objective to avoid having to choose a terminal cost. We present a formulation incorporating the system dynamics and constraints to approximate the optimal -function and algorithms to train the approximation parameters through an exploration of the state space. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
