Parameter-Robust MPPI for Safe Online Learning of Unknown Parameters
Matti Vahs, Jaeyoun Choi, Niklas Schmid, Jana Tumova, Chuchu Fan

TL;DR
This paper introduces PRMPPI, a control framework that combines online parameter learning with safety constraints, ensuring safe and efficient robot operation in uncertain, dynamic environments.
Contribution
It presents a novel integration of Stein Variational Gradient Descent and Conformal Prediction within a model predictive control framework for safety and learning.
Findings
Higher success rates in experiments
Lower tracking error compared to baselines
More accurate parameter estimates
Abstract
Robots deployed in dynamic environments must remain safe even when key physical parameters are uncertain or change over time. We propose Parameter-Robust Model Predictive Path Integral (PRMPPI) control, a framework that integrates online parameter learning with probabilistic safety constraints. PRMPPI maintains a particle-based belief over parameters via Stein Variational Gradient Descent, evaluates safety constraints using Conformal Prediction, and optimizes both a nominal performance-driven and a safety-focused backup trajectory in parallel. This yields a controller that is cautious at first, improves performance as parameters are learned, and ensures safety throughout. Simulation and hardware experiments demonstrate higher success rates, lower tracking error, and more accurate parameter estimates than baselines.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Control Systems Optimization
