Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Yi Liu; Dianqing Liu; Mingye Zhu; Junbo Guo; Yongdong Zhang; Zhendong Mao

arXiv:2505.19700·cs.CL·December 16, 2025

Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Yi Liu, Dianqing Liu, Mingye Zhu, Junbo Guo, Yongdong Zhang, Zhendong Mao

PDF

Open Access

TL;DR

This paper introduces a Residual Alignment Model (RAM) that uses importance sampling to detach alignment modules from large language models, enabling flexible and scalable alignment without retraining the entire model.

Contribution

The paper proposes a novel importance sampling framework for LLM alignment, allowing independent training and detachment of the alignment module from the base model.

Findings

01

Outperforms baseline models on instruction following tasks

02

Effective in domain adaptation and preference optimization

03

Improves flexibility and scalability of LLM alignment

Abstract

The widespread adoption of large language models (LLMs) across industries has increased the demand for high-quality and customizable outputs. However, traditional alignment methods often require retraining large pretrained models, making it difficult to quickly adapt and optimize LLMs for diverse applications. To address this limitation, we propose a novel \textit{Residual Alignment Model} (\textit{RAM}) that formalizes the alignment process as a type of importance sampling. In this framework, the unaligned upstream model serves as the proposal distribution, while the alignment process is framed as secondary sampling based on an autoregressive alignment module that acts as an estimator of the importance weights. This design enables a natural detachment of the alignment module from the target aligned model, improving flexibility and scalability. Based on this model, we derive an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques