Approximate Dynamic Programming for Constrained Linear Systems: A Piecewise Quadratic Approximation Approach
Kanghui He, Shengling Shi, Ton van den Boom, and Bart De Schutter

TL;DR
This paper presents a novel approximate dynamic programming method for constrained linear systems that combines model predictive control insights with a piecewise quadratic neural network to improve online computation and stability.
Contribution
It introduces a new ADP approach using a convex piecewise quadratic neural network to approximate the value function for constrained linear quadratic regulation problems.
Findings
The method achieves faster online computation.
It maintains stability with good value function approximation.
Comparative simulations show improved performance.
Abstract
Approximate dynamic programming (ADP) faces challenges in dealing with constraints in control problems. Model predictive control (MPC) is, in comparison, well-known for its accommodation of constraints and stability guarantees, although its computation is sometimes prohibitive. This paper introduces an approach combining the two methodologies to overcome their individual limitations. The predictive control law for constrained linear quadratic regulation (CLQR) problems has been proven to be piecewise affine (PWA) while the value function is piecewise quadratic. We exploit these formal results from MPC to design an ADP method for CLQR problems. A novel convex and piecewise quadratic neural network with a local-global architecture is proposed to provide an accurate approximation of the value function, which is used as the cost-to-go function in the online dynamic programming problem. An…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuel Cells and Related Materials · Advanced Control Systems Optimization · Adaptive Dynamic Programming Control
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
