Transformers As Generalizable Optimal Controllers

Turki Bin Mohaya; Maitham F. AL-Sunni; John M. Dolan; Peter Seiler

arXiv:2603.14910·eess.SY·March 17, 2026

Transformers As Generalizable Optimal Controllers

Turki Bin Mohaya, Maitham F. AL-Sunni, John M. Dolan, Peter Seiler

PDF

Open Access

TL;DR

This paper demonstrates that transformer-based policies can effectively learn near-optimal state-feedback controllers for a broad class of linear systems, generalizing across different system dimensions and parameters.

Contribution

It introduces a transformer-based approach to approximate optimal controllers for heterogeneous LTI systems, enabling generalization and adaptability without explicit plant models.

Findings

01

Achieves small sub-optimality compared to LQR

02

Remains stabilizing under moderate perturbations

03

Benefits from lightweight fine-tuning on unseen systems

Abstract

We study whether optimal state-feedback laws for a family of heterogeneous Multiple-Input, Multiple-Output (MIMO) Linear Time-Invariant (LTI) systems can be captured by a single learned controller. We train one transformer policy on LQR-generated trajectories from systems with different state and input dimensions, using a shared representation with standardization, padding, dimension encoding, and masked loss. The policy maps recent state history to control actions without requiring plant matrices at inference time. Across a broad set of systems, it achieves empirically small sub-optimality relative to Linear Quadratic Regulator (LQR), remains stabilizing under moderate parameter perturbations, and benefits from lightweight fine-tuning on unseen systems. These results support transformer policies as practical approximators of near-optimal feedback laws over structured linear-system…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Adaptive Dynamic Programming Control · Reinforcement Learning in Robotics