LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training

Sikui Zhang; Guangze Gao; Ziyun Gan; Chunfeng Yuan; Zefeng Lin; Houwen Peng; Bing Li; Weiming Hu

arXiv:2508.02308·cs.CL·August 6, 2025

LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training

Sikui Zhang, Guangze Gao, Ziyun Gan, Chunfeng Yuan, Zefeng Lin, Houwen Peng, Bing Li, Weiming Hu

PDF

TL;DR

LaMPE is a training-free, length-aware positional encoding method that adaptively allocates positional capacity in RoPE-based LLMs, significantly improving long-context performance without retraining.

Contribution

Introduces LaMPE, a novel dynamic, multi-grained positional encoding approach that enhances long-context scaling in LLMs without additional training.

Findings

01

Significant performance gains on long-context benchmarks.

02

Effective adaptation to varying input lengths.

03

Compatibility with existing RoPE-based LLMs.

Abstract

Large language models (LLMs) experience significant performance degradation when the input exceeds the pretraining context window, primarily due to the out-of-distribution (OOD) behavior of Rotary Position Embedding (RoPE). Recent studies mitigate this problem by remapping OOD positions into the in-distribution range with fixed mapping strategies, ignoring the dynamic relationship between input length and the model's effective context window. To this end, we propose Length-aware Multi-grained Positional Encoding (LaMPE), a training-free method that fully utilizes the model's effective context window for adaptive long-context scaling in LLMs. Motivated by the left-skewed frequency distribution of relative positions, LaMPE establishes a dynamic relationship between mapping length and input length through a parametric scaled sigmoid function to adaptively allocate positional capacity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.