Self-attention with Functional Time Representation Learning
Da Xu, Chuanwei Ruan, Sushant Kumar, Evren Korpeoglu, Kannan Achan

TL;DR
This paper introduces a novel functional time representation learning approach for self-attention models, enabling them to effectively incorporate temporal information in event sequence modeling, which improves performance on real-world datasets.
Contribution
It proposes a new functional feature map and time kernel to embed time spans into high-dimensional spaces, bridging the gap between time-independent and time-dependent sequence modeling in self-attention.
Findings
Models outperform baselines in continuous-time event prediction tasks.
The proposed methods effectively capture time-event interactions.
Functional time representations improve interpretability and performance.
Abstract
Sequential modelling with self-attention has achieved cutting edge performances in natural language processing. With advantages in model flexibility, computation complexity and interpretability, self-attention is gradually becoming a key component in event sequence models. However, like most other sequence models, self-attention does not account for the time span between events and thus captures sequential signals rather than temporal patterns. Without relying on recurrent network structures, self-attention recognizes event orderings via positional encoding. To bridge the gap between modelling time-independent and time-dependent event sequence, we introduce a functional feature map that embeds time span into high-dimensional spaces. By constructing the associated translation-invariant time kernel function, we reveal the functional forms of the feature map under classic functional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Advanced Text Analysis Techniques
