Improving Transformers using Faithful Positional Encoding

Tsuyoshi Id\'e; Jokin Labaien; and Pin-Yu Chen

arXiv:2405.09061·cs.LG·May 17, 2024

Improving Transformers using Faithful Positional Encoding

Tsuyoshi Id\'e, Jokin Labaien, and Pin-Yu Chen

PDF

Open Access

TL;DR

This paper introduces a mathematically grounded positional encoding for Transformers that preserves input order information and enhances performance in time-series classification tasks.

Contribution

The paper presents a novel positional encoding method for Transformers, ensuring order information is retained and improving prediction accuracy.

Findings

01

Systematic performance improvement in time-series classification

02

Mathematically guaranteed preservation of positional information

03

Outperforms standard sinusoidal encoding in experiments

Abstract

We propose a new positional encoding method for a neural network architecture called the Transformer. Unlike the standard sinusoidal positional encoding, our approach is based on solid mathematical grounds and has a guarantee of not losing information about the positional order of the input sequence. We show that the new encoding approach systematically improves the prediction performance in the time-series classification task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding