Flexible Realignment of Language Models

Wenhong Zhu; Ruobing Xie; Weinan Zhang; Rui Wang

arXiv:2506.12704·cs.CL·January 13, 2026

Flexible Realignment of Language Models

Wenhong Zhu, Ruobing Xie, Weinan Zhang, Rui Wang

PDF

Open Access 7 Models

TL;DR

This paper introduces a flexible realignment framework for language models that allows for adjustable alignment during training and inference, improving efficiency and enabling deeper reasoning without performance loss.

Contribution

It presents a novel framework combining training-time and inference-time realignment techniques, including a controllable logit fusion method and a layer adapter for flexible model alignment.

Findings

01

Reduces token usage by 54.63% without performance loss

02

Outperforms previous methods in alignment efficiency

03

Enables deeper reasoning and flexible inference control

Abstract

Realignment becomes necessary when a language model (LM) fails to meet expected performance. We propose a flexible realignment framework that supports quantitative control of alignment degree during training and inference. This framework incorporates Training-time Realignment (TrRa), which efficiently realigns the reference model by leveraging the controllable fusion of logits from both the reference and already aligned models. For example, TrRa reduces token usage by 54.63% on DeepSeek-R1-Distill-Qwen-1.5B without any performance degradation, outperforming DeepScaleR-1.5B's 33.86%. To complement TrRa during inference, we introduce a layer adapter that enables smooth Inference-time Realignment (InRa). This adapter is initialized to perform an identity transformation at the bottom layer and is inserted preceding the original layers. During inference, input embeddings are simultaneously…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling