Transformer-based Model Predictive Control: Trajectory Optimization via   Sequence Modeling

Davide Celestini; Daniele Gammelli; Tommaso Guffanti; Simone D'Amico,; Elisa Capello; Marco Pavone

arXiv:2410.23916·cs.RO·November 1, 2024

Transformer-based Model Predictive Control: Trajectory Optimization via Sequence Modeling

Davide Celestini, Daniele Gammelli, Tommaso Guffanti, Simone D'Amico,, Elisa Capello, Marco Pavone

PDF

TL;DR

This paper introduces a transformer-augmented MPC framework that enhances trajectory optimization efficiency and convergence, reducing computation time and solver iterations while maintaining high performance in robot control tasks.

Contribution

It proposes embedding transformer-based neural networks into MPC to provide near-optimal initial guesses, improving convergence and reducing computational costs.

Findings

01

Up to 75% improvement in trajectory generation performance.

02

Reduced solver iterations by up to 45%.

03

Achieved 7x faster MPC runtime without performance loss.

Abstract

Model predictive control (MPC) has established itself as the primary methodology for constrained control, enabling general-purpose robot autonomy in diverse real-world scenarios. However, for most problems of interest, MPC relies on the recursive solution of highly non-convex trajectory optimization problems, leading to high computational complexity and strong dependency on initialization. In this work, we present a unified framework to combine the main strengths of optimization-based and learning-based methods for MPC. Our approach entails embedding high-capacity, transformer-based neural network models within the optimization process for trajectory generation, whereby the transformer provides a near-optimal initial guess, or target plan, to a non-convex optimization problem. Our experiments, performed in simulation and the real world onboard a free flyer platform, demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.