Shadow-FT: Tuning Instruct Model via Training on Paired Base Model

Taiqiang Wu; Runming Yang; Jiayi Li; Pengfei Hu; Yik-Chung Wu; Ngai Wong; Yujiu Yang

arXiv:2505.12716·cs.CL·September 29, 2025

Shadow-FT: Tuning Instruct Model via Training on Paired Base Model

Taiqiang Wu, Runming Yang, Jiayi Li, Pengfei Hu, Yik-Chung Wu, Ngai Wong, Yujiu Yang

PDF

1 Repo 8 Models 3 Reviews

TL;DR

Shadow-FT is a novel fine-tuning method that leverages base models to improve instruction-tuned LLMs by directly grafting learned weight updates, leading to significant performance gains across diverse benchmarks.

Contribution

The paper introduces Shadow-FT, a simple yet effective framework for tuning instruct models by leveraging base models, avoiding additional parameters and outperforming traditional tuning methods.

Findings

01

Shadow-FT outperforms conventional tuning approaches across 19 benchmarks.

02

It can be applied to multimodal LLMs and combined with DPO.

03

The method significantly improves performance with no extra parameters.

Abstract

Large language models (LLMs) consistently benefit from further fine-tuning on various tasks. However, we observe that directly tuning the Instruct (i.e., instruction-tuned) models often leads to marginal improvements and even performance degeneration. Notably, paired Base models, the foundation for these Instruct variants, contain highly similar weight values (i.e., less than 2% on average for Llama 3.1 8B). The Base model tends to be a good learner yet a weak backbone without post-training. Therefore, we propose a novel Shadow-FT framework to tune the Instruct models by leveraging the corresponding Base models. The key insight is to fine-tune the Base model, and then \textit{directly} graft the learned weight updates to the Instruct model. Our proposed Shadow-FT introduces no additional parameters, is easy to implement, and significantly improves performance. We conduct extensive…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 4

Strengths

1. **Simple method**: The method requires only weight arithmetic W_I + (W⁺_B - W_B) with zero additional parameters, no extra training cost, and easy implementation. This simplicity aids adoption compared to complex meta-learning or distillation approaches 2. **Mechanistic insight**: Figure 5's analysis showing BASE exhibits 22.6% lower initial loss, 3.25× smaller initial gradient, and more stable gradient decay (58% vs. 11% at step 61) provides the strongest evidence for why BASE is a better l

Weaknesses

1. **No direct comparisons**: The paper cites Chat Vector, RE-Adapt, and concurrent work but provides no empirical comparisons. It should evaluate Shadow-FT against these and standard PEFT baselines to clarify both performance and efficiency advantages. 2. **Unexplained inconsistencies**: Shadow-FT sometimes degrades performance substantially (e.g., –3.3 to –18.7 on MATH tasks), contradicting its claimed consistency. The paper offers no analysis or framework for predicting when it helps or harm

Reviewer 02Rating 4Confidence 3

Strengths

- Simplicity + breadth. The method is simple and straightforward and seems widely applicable according to experiments, with coverage across different benchmarks. - Paper writing is clear and easy to follow.

Weaknesses

- Novelty is limited. The idea is closely related to task vectors, chat vectors, RE‑Adapt, and proxy tuning. - More/stronger theoretical framing beyond weight similarity maybe needed. - Baselines such as RE‑Adapt could be added.

Reviewer 03Rating 4Confidence 4

Strengths

1. The paper identifies a useful observation — the BASE and INSTRUCT models share over largely weight similarity (σ < 0.03) — and exploits this to propose a minimal yet elegant transfer mechanism. The “learn-on-BASE, apply-to-INSTRUCT” paradigm is intuitive. 2. Experiments are conducted across several major model families (Qwen, Llama, Gemma, Falcon, Yi) and cover 19 benchmarks (Math-7, Code-3, Knowledge-9). The results are extensive and generally consistent, showing that Shadow-FT often outper

Weaknesses

1. Lack of theoretical grounding. The claim that INSTRUCT fine-tuning “degenerates due to preexisting priors” lacks a formal mechanism or controlled ablation. 2. Lack of recent related work. Several existing works already transfer knowledge between BASE/INSTRUCT or different model checkpoints, the authors identified these work but did not properly compare to them, such as RE-Adapt [1], Task/Chat Vectors [2, 3] and Proxy-Tuning [4] 3. Missing of some implementation details, such as (a) whether o

Code & Models

Repositories

wutaiqiang/shadow-ft
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection · LLaMA