Comparing Prior and Learned Time Representations in Transformer Models   of Timeseries

Natalia Koliou; Tatiana Boura; Stasinos Konstantopoulos; George; Meramveliotakis; George Kosmadakis

arXiv:2411.12476·cs.LG·November 20, 2024

Comparing Prior and Learned Time Representations in Transformer Models of Timeseries

Natalia Koliou, Tatiana Boura, Stasinos Konstantopoulos, George, Meramveliotakis, George Kosmadakis

PDF

Open Access

TL;DR

This paper compares fixed and learned time representations in Transformer models for time series prediction, highlighting challenges in encoding prior knowledge and emphasizing the need for human-in-the-loop approaches.

Contribution

It introduces a comparison between fixed and learned time representations in Transformers for time series, revealing difficulties in encoding prior knowledge and suggesting future research directions.

Findings

01

Fixed representations perform well on known periodicities.

02

Learned representations face challenges in encoding prior knowledge.

03

Human-in-the-loop methods may enhance robustness.

Abstract

What sets timeseries analysis apart from other machine learning exercises is that time representation becomes a primary aspect of the experiment setup, as it must adequately represent the temporal relations that are relevant for the application at hand. In the work described here we study wo different variations of the Transformer architecture: one where we use the fixed time representation proposed in the literature and one where the time representation is learned from the data. Our experiments use data from predicting the energy output of solar panels, a task that exhibits known periodicities (daily and seasonal) that is straight-forward to encode in the fixed time representation. Our results indicate that even in an experiment where the phenomenon is well-understood, it is difficult to encode prior knowledge due to side-effects that are difficult to mitigate. We conclude that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPower Systems and Technologies

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Layer Normalization