Parameter-Robust MPPI for Safe Online Learning of Unknown Parameters

Matti Vahs; Jaeyoun Choi; Niklas Schmid; Jana Tumova; Chuchu Fan

arXiv:2601.02948·cs.RO·January 7, 2026

Parameter-Robust MPPI for Safe Online Learning of Unknown Parameters

Matti Vahs, Jaeyoun Choi, Niklas Schmid, Jana Tumova, Chuchu Fan

PDF

Open Access

TL;DR

This paper introduces PRMPPI, a control framework that combines online parameter learning with safety constraints, ensuring safe and efficient robot operation in uncertain, dynamic environments.

Contribution

It presents a novel integration of Stein Variational Gradient Descent and Conformal Prediction within a model predictive control framework for safety and learning.

Findings

01

Higher success rates in experiments

02

Lower tracking error compared to baselines

03

More accurate parameter estimates

Abstract

Robots deployed in dynamic environments must remain safe even when key physical parameters are uncertain or change over time. We propose Parameter-Robust Model Predictive Path Integral (PRMPPI) control, a framework that integrates online parameter learning with probabilistic safety constraints. PRMPPI maintains a particle-based belief over parameters via Stein Variational Gradient Descent, evaluates safety constraints using Conformal Prediction, and optimizes both a nominal performance-driven and a safety-focused backup trajectory in parallel. This yields a controller that is cautious at first, improves performance as parameters are learned, and ensures safety throughout. Simulation and hardware experiments demonstrate higher success rates, lower tracking error, and more accurate parameter estimates than baselines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Control Systems Optimization