Transformer-based Model Predictive Control: Trajectory Optimization via Sequence Modeling
Davide Celestini, Daniele Gammelli, Tommaso Guffanti, Simone D'Amico,, Elisa Capello, Marco Pavone

TL;DR
This paper introduces a transformer-augmented MPC framework that enhances trajectory optimization efficiency and convergence, reducing computation time and solver iterations while maintaining high performance in robot control tasks.
Contribution
It proposes embedding transformer-based neural networks into MPC to provide near-optimal initial guesses, improving convergence and reducing computational costs.
Findings
Up to 75% improvement in trajectory generation performance.
Reduced solver iterations by up to 45%.
Achieved 7x faster MPC runtime without performance loss.
Abstract
Model predictive control (MPC) has established itself as the primary methodology for constrained control, enabling general-purpose robot autonomy in diverse real-world scenarios. However, for most problems of interest, MPC relies on the recursive solution of highly non-convex trajectory optimization problems, leading to high computational complexity and strong dependency on initialization. In this work, we present a unified framework to combine the main strengths of optimization-based and learning-based methods for MPC. Our approach entails embedding high-capacity, transformer-based neural network models within the optimization process for trajectory generation, whereby the transformer provides a near-optimal initial guess, or target plan, to a non-convex optimization problem. Our experiments, performed in simulation and the real world onboard a free flyer platform, demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
