Spherical Position Encoding for Transformers

Eren Unlu

arXiv:2310.04454·cs.CL·October 10, 2023·1 cites

Spherical Position Encoding for Transformers

Eren Unlu

PDF

Open Access

TL;DR

This paper introduces a spherical position encoding mechanism for transformers, called geotokens, which effectively captures geographical coordinates and their relative distances, improving modeling of spatial data.

Contribution

It proposes a novel spherical position encoding method based on RoPE, tailored for geospatial data, extending transformer capabilities beyond sequential language tasks.

Findings

01

Effective encoding of geographical coordinates in transformers

02

Maintains proportional distances on spherical surfaces

03

Enhances spatial data modeling in transformer architectures

Abstract

Position encoding is the primary mechanism which induces notion of sequential order for input tokens in transformer architectures. Even though this formulation in the original transformer paper has yielded plausible performance for general purpose language understanding and generation, several new frameworks such as Rotary Position Embedding (RoPE) are proposed for further enhancement. In this paper, we introduce the notion of "geotokens" which are input elements for transformer architectures, each representing an information related to a geological location. Unlike the natural language the sequential position is not important for the model but the geographical coordinates are. In order to induce the concept of relative position for such a setting and maintain the proportion between the physical distance and distance on embedding space, we formulate a position encoding mechanism based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Constraint Satisfaction and Optimization · Speech and dialogue systems