MPC-Guided Safe Reinforcement Learning and Lipschitz-Based Filtering for Structured Nonlinear Systems
Patrick Kostelac, Xuerui Wang, and Anahita Jamshidnejad

TL;DR
This paper introduces an integrated MPC-RL framework that combines the robustness of MPC with the adaptability of RL, ensuring safety and constraint satisfaction in nonlinear systems like aerospace and robotics.
Contribution
It proposes a novel combined MPC-RL approach with Lipschitz-based safety filtering, enabling real-time safe control without heavy online optimization.
Findings
Enhanced disturbance rejection in nonlinear aeroelastic systems
Reduced actuator effort while maintaining safety
Robust performance under turbulence conditions
Abstract
Modern engineering systems, such as autonomous vehicles, flexible robotics, and intelligent aerospace platforms, require controllers that are robust to uncertainties, adaptive to environmental changes, and safety-aware under real-time constraints. RL offers powerful data-driven adaptability for systems with nonlinear dynamics that interact with uncertain environments. RL, however, lacks built-in mechanisms for dynamic constraint satisfaction during exploration. MPC offers structured constraint handling and robustness, but its reliance on accurate models and computationally demanding online optimization may pose significant challenges. This paper proposes an integrated MPC-RL framework that combines stability and safety guarantees of MPC with the adaptability of RL. During training, MPC defines safe control bounds that guide the RL component and that enable constraint-aware policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Aeroelasticity and Vibration Control · Adaptive Dynamic Programming Control
