Rough Transformers: Lightweight and Continuous Time Series Modelling   through Signature Patching

Fernando Moreno-Pino; \'Alvaro Arroyo; Harrison Waldon; Xiaowen Dong,; \'Alvaro Cartea

arXiv:2405.20799·stat.ML·January 14, 2025

Rough Transformers: Lightweight and Continuous Time Series Modelling through Signature Patching

Fernando Moreno-Pino, \'Alvaro Arroyo, Harrison Waldon, Xiaowen Dong,, \'Alvaro Cartea

PDF

Open Access 1 Repo

TL;DR

The paper introduces Rough Transformers, a lightweight continuous-time model for time-series data that captures multi-scale dependencies efficiently, outperforming traditional models in accuracy and computational cost.

Contribution

It presents a novel multi-view signature attention mechanism and a continuous-time Transformer variant that reduces computational costs while improving dependency modeling.

Findings

01

Outperforms vanilla Transformers on various time-series tasks

02

Reduces computational time and memory usage significantly

03

Captures local and global dependencies effectively

Abstract

Time-series data in real-world settings typically exhibit long-range dependencies and are observed at non-uniform intervals. In these settings, traditional sequence-based recurrent models struggle. To overcome this, researchers often replace recurrent architectures with Neural ODE-based models to account for irregularly sampled data and use Transformer-based architectures to account for long-range dependencies. Despite the success of these two approaches, both incur very high computational costs for input sequences of even moderate length. To address this challenge, we introduce the Rough Transformer, a variation of the Transformer model that operates on continuous-time representations of input sequences and incurs significantly lower computational costs. In particular, we propose multi-view signature attention, which uses path signatures to augment vanilla attention and to capture both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alvaroarroyo/rformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Advanced Database Systems and Queries · Data Management and Algorithms

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections