Linearized Relative Positional Encoding
Zhen Qin, Weixuan Sun, Kaiyue Lu, Hui Deng, Dongxu Li, Xiaodong Han,, Yuchao Dai, Lingpeng Kong, Yiran Zhong

TL;DR
This paper introduces a new family of linear relative positional encoding algorithms, LRPE, that are compatible with linear transformers and achieve state-of-the-art results across multiple tasks.
Contribution
It develops a principled framework for designing linear relative positional encodings using unitary transformations, enabling effective and efficient encoding for linear transformers.
Findings
LRPE achieves state-of-the-art performance in language modeling.
LRPE outperforms existing methods in text and image classification.
The framework generalizes to various applications and preserves linear complexity.
Abstract
Relative positional encoding is widely used in vanilla and linear transformers to represent positional information. However, existing encoding methods of a vanilla transformer are not always directly applicable to a linear transformer, because the latter requires a decomposition of the query and key representations into separate kernel functions. Nevertheless, principles for designing encoding methods suitable for linear transformers remain understudied. In this work, we put together a variety of existing linear relative positional encoding approaches under a canonical form and further propose a family of linear relative positional encoding algorithms via unitary transformation. Our formulation leads to a principled framework that can be used to develop new relative positional encoding methods that preserve linear space-time complexity. Equipped with different models, the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Text and Document Classification Technologies
