Rotary Positional Embeddings as Phase Modulation: Theoretical Bounds on the RoPE Base for Long-Context Transformers
Feilong Liu

TL;DR
This paper reinterprets Rotary Positional Embeddings as phase modulation, deriving bounds on the RoPE base to ensure effective long-context encoding in transformers, validated through analysis of state-of-the-art models.
Contribution
It provides a theoretical framework for understanding RoPE behavior at long contexts, including bounds on the RoPE base and insights into model failures and scaling limits.
Findings
Derived lower bounds on RoPE base for long-context coherence
Identified a precision-dependent upper bound causing positional erasure
Validated bounds with state-of-the-art model case studies
Abstract
Rotary positional embeddings (RoPE) are widely used in large language models to encode token positions through multiplicative rotations, yet their behavior at long context lengths remains poorly characterized. In this work, we reinterpret RoPE as phase modulation applied to a bank of complex oscillators, enabling analysis through classical signal processing theory. Under this formulation, we derive principled lower bounds on the RoPE base parameter that are necessary to preserve positional coherence over a target context length. These include a fundamental aliasing bound, analogous to a Nyquist limit, and a DC-component stability bound that constrains phase drift in low-frequency positional modes. We further extend this analysis to deep transformers, showing that repeated rotary modulation across layers compounds angular misalignment, tightening the base requirement as depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Topic Modeling
