PSC: Extending Context Window of Large Language Models via Phase Shift Calibration

Wenqiao Zhu; Chao Xu; Lulu Wang; Jun Wu

arXiv:2505.12423·cs.CL·May 20, 2025

PSC: Extending Context Window of Large Language Models via Phase Shift Calibration

Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces PSC, a calibration module that enhances existing position encoding methods like RoPE, enabling large language models to effectively extend their context windows up to 64k tokens with improved performance.

Contribution

PSC provides a simple calibration approach that improves the effectiveness of existing frequency rescaling methods for expanding LLM context windows.

Findings

01

PSC improves perplexity reductions across larger context windows

02

The method is broadly applicable across models and tasks

03

Enhances existing position encoding techniques like PI, YaRN, and LongRoPE

Abstract

Rotary Position Embedding (RoPE) is an efficient position encoding approach and is widely utilized in numerous large language models (LLMs). Recently, a lot of methods have been put forward to further expand the context window based on RoPE. The core concept of those methods is to predefine or search for a set of factors to rescale the base frequencies of RoPE. Nevertheless, it is quite a challenge for existing methods to predefine an optimal factor due to the exponential search space. In view of this, we introduce PSC (Phase Shift Calibration), a small module for calibrating the frequencies predefined by existing methods. With the employment of PSC, we demonstrate that many existing methods can be further enhanced, like PI, YaRN, and LongRoPE. We conducted extensive experiments across multiple models and tasks. The results demonstrate that (1) when PSC is enabled, the comparative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wnqzhu/psc
pytorchOfficial

Videos

PSC: Extending Context Window of Large Language Models via Phase Shift Calibration· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques

MethodsSparse Evolutionary Training · Balanced Selection