MPC-Guided Safe Reinforcement Learning and Lipschitz-Based Filtering for Structured Nonlinear Systems

Patrick Kostelac; Xuerui Wang; and Anahita Jamshidnejad

arXiv:2512.12855·cs.RO·December 16, 2025

MPC-Guided Safe Reinforcement Learning and Lipschitz-Based Filtering for Structured Nonlinear Systems

Patrick Kostelac, Xuerui Wang, and Anahita Jamshidnejad

PDF

Open Access

TL;DR

This paper introduces an integrated MPC-RL framework that combines the robustness of MPC with the adaptability of RL, ensuring safety and constraint satisfaction in nonlinear systems like aerospace and robotics.

Contribution

It proposes a novel combined MPC-RL approach with Lipschitz-based safety filtering, enabling real-time safe control without heavy online optimization.

Findings

01

Enhanced disturbance rejection in nonlinear aeroelastic systems

02

Reduced actuator effort while maintaining safety

03

Robust performance under turbulence conditions

Abstract

Modern engineering systems, such as autonomous vehicles, flexible robotics, and intelligent aerospace platforms, require controllers that are robust to uncertainties, adaptive to environmental changes, and safety-aware under real-time constraints. RL offers powerful data-driven adaptability for systems with nonlinear dynamics that interact with uncertain environments. RL, however, lacks built-in mechanisms for dynamic constraint satisfaction during exploration. MPC offers structured constraint handling and robustness, but its reliance on accurate models and computationally demanding online optimization may pose significant challenges. This paper proposes an integrated MPC-RL framework that combines stability and safety guarantees of MPC with the adaptability of RL. During training, MPC defines safe control bounds that guide the RL component and that enable constraint-aware policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Aeroelasticity and Vibration Control · Adaptive Dynamic Programming Control