Toward Single-Step MPPI via Differentiable Predictive Control
Viet-Anh Le, Renukanandan Tumu, and Rahul Mangharam

TL;DR
This paper introduces Step-MPPI, a neural network-based framework that enables efficient single-step MPPI for real-time control by learning an optimized sampling distribution, reducing computational cost and tuning complexity.
Contribution
The paper presents a novel neural network approach to learn sampling distributions for single-step MPPI, improving efficiency and scalability over traditional methods.
Findings
Step-MPPI achieves millisecond-level latency in complex control tasks.
It effectively handles high-dimensional and long-horizon control problems.
The learned distribution improves sampling efficiency compared to standard MPPI.
Abstract
Model predictive path integral (MPPI) is a sampling-based method for solving complex model predictive control (MPC) problems, but its real-time implementation faces two key challenges: the computational cost and sample requirements grow with the prediction horizon, and manually tuning the sampling covariance requires balancing exploration and noise. To address these issues, we propose Step-MPPI, a framework that learns a sampling distribution for efficient single-step lookahead MPPI implementation. Specifically, we use a neural network to parameterize the MPPI proposal distribution at each time step, and train it in a self-supervised manner over a long horizon using the MPC cost, constraint penalties, and a maximum-entropy regularization term. By embedding long-horizon objectives into training the neural distribution policy, Step-MPPI achieves the foresight of a multi-step optimizer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
