Flexible Realignment of Language Models
Wenhong Zhu, Ruobing Xie, Weinan Zhang, Rui Wang

TL;DR
This paper introduces a flexible realignment framework for language models that allows for adjustable alignment during training and inference, improving efficiency and enabling deeper reasoning without performance loss.
Contribution
It presents a novel framework combining training-time and inference-time realignment techniques, including a controllable logit fusion method and a layer adapter for flexible model alignment.
Findings
Reduces token usage by 54.63% without performance loss
Outperforms previous methods in alignment efficiency
Enables deeper reasoning and flexible inference control
Abstract
Realignment becomes necessary when a language model (LM) fails to meet expected performance. We propose a flexible realignment framework that supports quantitative control of alignment degree during training and inference. This framework incorporates Training-time Realignment (TrRa), which efficiently realigns the reference model by leveraging the controllable fusion of logits from both the reference and already aligned models. For example, TrRa reduces token usage by 54.63% on DeepSeek-R1-Distill-Qwen-1.5B without any performance degradation, outperforming DeepScaleR-1.5B's 33.86%. To complement TrRa during inference, we introduce a layer adapter that enables smooth Inference-time Realignment (InRa). This adapter is initialized to perform an identity transformation at the bottom layer and is inserted preceding the original layers. During inference, input embeddings are simultaneously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_2model
- 🤗wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_5model· 12 dl12 dl
- 🤗wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_10model
- 🤗wh-zhu/DeepSeek-R1-TrRa-iter2-1.5B-lambda_2model
- 🤗wh-zhu/DeepSeek-R1-TrRa-iter1-1.5B-lambda_2model
- 🤗wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_0.5model· 17 dl17 dl
- 🤗wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_1.5model
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
