Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior

Shizhe Cai; Zeya Yin; Jayadeep Jacob; Fabio Ramos

arXiv:2507.06625·cs.RO·March 5, 2026

Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior

Shizhe Cai, Zeya Yin, Jayadeep Jacob, Fabio Ramos

PDF

Open Access

TL;DR

This paper introduces Q-SVMPC, a novel learning-based MPC method that uses Stein variational inference guided by learned Q-values to maintain solution diversity and improve robustness in complex control tasks.

Contribution

It presents a new Q-guided Stein variational MPC approach with an RL-informed policy prior, enhancing solution diversity and stability over existing methods.

Findings

01

Improved sample efficiency in navigation and manipulation tasks

02

Enhanced stability and robustness compared to baseline methods

03

Successful real-world application in fruit-picking task

Abstract

Model Predictive Control (MPC) enables reliable trajectory optimization under dynamics constraints, but often depends on accurate dynamics models and carefully hand-designed cost functions. Recent learning-based MPC methods aim to reduce these modeling and cost-design burdens by learning dynamics, priors, or value-related guidance signals. Yet many existing approaches still rely on deterministic gradient-based solvers (e.g., differentiable MPC) or parametric sampling-based updates (e.g., CEM/MPPI), which can lead to mode collapse and convergence to a single dominant solution. We propose Q-SVMPC, a Q-guided Stein variational MPC method with an RL-informed policy prior, which casts learning-based MPC as trajectory-level posterior inference and refines trajectory particles via SVGD under learned soft Q-value guidance to explicitly preserve diverse solutions. Experiments on navigation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Robot Manipulation and Learning