Rethinking Positional Encoding

Jianqiao Zheng; Sameera Ramasinghe; Simon Lucey

arXiv:2107.02561·cs.LG·October 13, 2021·30 cites

Rethinking Positional Encoding

Jianqiao Zheng, Sameera Ramasinghe, Simon Lucey

PDF

Open Access 1 Repo

TL;DR

This paper broadens the understanding of positional encoding by showing that non-Fourier embeddings can be effective, and introduces a general theory based on shifted basis functions that explains their performance.

Contribution

It demonstrates that alternative non-Fourier embeddings can be used for positional encoding and develops a general theoretical framework based on shifted basis functions.

Findings

01

Fourier features are a special case of a broader class of embeddings.

02

Performance depends on the trade-off between stable rank and distance preservation.

03

Theoretical predictions are empirically validated.

Abstract

It is well noted that coordinate based MLPs benefit -- in terms of preserving high-frequency information -- through the encoding of coordinate positions as an array of Fourier features. Hitherto, the rationale for the effectiveness of these positional encodings has been solely studied through a Fourier lens. In this paper, we strive to broaden this understanding by showing that alternative non-Fourier embedding functions can indeed be used for positional encoding. Moreover, we show that their performance is entirely determined by a trade-off between the stable rank of the embedded matrix and the distance preservation between embedded coordinates. We further establish that the now ubiquitous Fourier feature mapping of position is a special case that fulfills these conditions. Consequently, we present a more general theory to analyze positional encoding in terms of shifted basis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osiriszjq/Rethinking-positional-encoding
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Speech and dialogue systems