Beyond Similarity: Temporal Operator Attention for Time Series Analysis

Jevon Twitty; Vinh Pham; Nitiwith Rotchanarak; Viresh Pati; Yubin Kim; Shihao Yang; Jiecheng Lu

arXiv:2605.11287·cs.LG·May 13, 2026

Beyond Similarity: Temporal Operator Attention for Time Series Analysis

Jevon Twitty, Vinh Pham, Nitiwith Rotchanarak, Viresh Pati, Yubin Kim, Shihao Yang, Jiecheng Lu

PDF

TL;DR

This paper introduces Temporal Operator Attention (TOA), a novel framework augmenting attention with learnable sequence operators to better capture temporal dynamics in time series, outperforming traditional models.

Contribution

The paper formalizes the limitations of standard attention in time series and proposes TOA with stochastic regularization, improving performance across various benchmarks.

Findings

01

TOA enhances forecasting, anomaly detection, and classification accuracy.

02

Explicit operator learning improves modeling of oscillatory and signed transformations.

03

TOA achieves strong gains especially in reconstruction-heavy tasks.

Abstract

A persistent paradox in time-series forecasting is that structurally simple MLP and linear models often outperform high-capacity Transformers. We argue that this gap arises from a mismatch in the sequence-modeling primitive: while many time-series dynamics are governed by global temporal operators (e.g., filtering and harmonic structure), standard attention forms each output as a convex combination of inputs. This restricts its ability to represent signed and oscillatory transformations that are fundamental to temporal signal processing. We formalize this limitation as a simplex-constrained mixing bottleneck in softmax attention, which becomes especially restrictive for operator-driven time-series tasks. To address this, we propose $Temporal Operator Attention (TOA)$ , a framework that augments attention with explicit, learnable sequence-space operators, enabling direct signed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.