Improving Transformers using Faithful Positional Encoding
Tsuyoshi Id\'e, Jokin Labaien, and Pin-Yu Chen

TL;DR
This paper introduces a mathematically grounded positional encoding for Transformers that preserves input order information and enhances performance in time-series classification tasks.
Contribution
The paper presents a novel positional encoding method for Transformers, ensuring order information is retained and improving prediction accuracy.
Findings
Systematic performance improvement in time-series classification
Mathematically guaranteed preservation of positional information
Outperforms standard sinusoidal encoding in experiments
Abstract
We propose a new positional encoding method for a neural network architecture called the Transformer. Unlike the standard sinusoidal positional encoding, our approach is based on solid mathematical grounds and has a guarantee of not losing information about the positional order of the input sequence. We show that the new encoding approach systematically improves the prediction performance in the time-series classification task.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding
