Temporal Attention for Adaptive Control of Euler-Lagrange Systems with Unobservable Memory
Giansalvo Cirrincione, Adriano Fagiolini

TL;DR
This paper introduces a novel meta-control architecture using temporal attention for adaptive control of Euler-Lagrange systems with unobservable friction states, enhancing tracking accuracy in certain regimes.
Contribution
It proposes a method to select attention heads via surrogate analysis and integrates this into reinforcement learning for improved adaptive control.
Findings
Outperforms Transformer baseline in short and matched memory regimes.
Achieves 12-19 percentage point reduction in tracking error.
Large effect sizes with statistical significance in tested regimes.
Abstract
Adaptive control of Euler-Lagrange systems is challenging when friction is governed by a finite-horizon internal state that is not directly observable from joint measurements. In this setting, the measured closed-loop state is no longer Markovian, and standard certainty-equivalence adaptive laws may lose their convergence guarantees. The paper proposes a meta-control architecture in which the gains of a computed-torque controller are generated by a self-attention block processing a short window of recent motion history. The number of attention heads is selected before policy training through a surrogate analysis of the autocovariance of the memory-state gradient along the temporal window. This surrogate is based on a temporal adaptation of an incremental rank-tracking framework previously developed by the authors. The selected head count is then fixed and used as an architectural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
