Fine-grained Attention in Hierarchical Transformers for Tabular Time-series
Raphael Azorin, Zied Ben Houidi, Massimo Gallo, Alessandro Finamore,, and Pietro Michiardi

TL;DR
This paper introduces Fieldy, a hierarchical transformer model with fine-grained attention at both row and column levels, improving performance on tabular time-series tasks by capturing detailed field-level patterns.
Contribution
The paper proposes a novel fine-grained hierarchical attention mechanism for transformers that enhances pattern learning in tabular time-series data at the field level.
Findings
Fieldy outperforms state-of-the-art models on regression tasks.
Combining row-wise and column-wise attention improves accuracy.
Model size remains comparable to existing approaches.
Abstract
Tabular data is ubiquitous in many real-life systems. In particular, time-dependent tabular data, where rows are chronologically related, is typically used for recording historical events, e.g., financial transactions, healthcare records, or stock history. Recently, hierarchical variants of the attention mechanism of transformer architectures have been used to model tabular time-series data. At first, rows (or columns) are encoded separately by computing attention between their fields. Subsequently, encoded rows (or columns) are attended to one another to model the entire tabular time-series. While efficient, this approach constrains the attention granularity and limits its ability to learn patterns at the field-level across separate rows, or columns. We take a first step to address this gap by proposing Fieldy, a fine-grained hierarchical model that contextualizes fields at both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Time Series Analysis and Forecasting · Neural Networks and Reservoir Computing
MethodsSoftmax · Attention Is All You Need
