ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning
Xianming Li, Zongxi Li, Tsz-fung Andrew Lee, Jing Li, Haoran Xie, Qing Li

TL;DR
ShadowPEFT introduces a layer-level refinement framework for parameter-efficient fine-tuning of large language models, offering a flexible alternative to traditional weight perturbation methods like LoRA.
Contribution
It proposes a centralized shadow module for layer-space adaptation, enabling reuse, pretraining, and deployment flexibility across various scenarios.
Findings
ShadowPEFT matches or outperforms LoRA and DoRA with similar parameter budgets.
The shadow module can be pretrained independently and reused across layers.
Experiments show advantages in inference latency and system-level deployment.
Abstract
Parameter-efficient fine-tuning (PEFT) reduces the training cost of full-parameter fine-tuning for large language models (LLMs) by training only a small set of task-specific parameters while freezing the pretrained backbone. However, existing approaches, such as Low-Rank Adaptation (LoRA), achieve adaptation by inserting independent low-rank perturbations directly to individual weights, resulting in a local parameterization of adaptation. We propose ShadowPEFT, a centralized PEFT framework that instead performs layer-level refinement through a depth-shared shadow module. At each transformer layer, ShadowPEFT maintains a parallel shadow state and evolves it repeatedly for progressively richer hidden states. This design shifts adaptation from distributed weight-space perturbations to a shared layer-space refinement process. Since the shadow module is decoupled from the backbone, it can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗shadow-llm/Qwen3-0.6B-H8Bmodel· 129 dl· ♡ 1129 dl♡ 1
- 🤗shadow-llm/Qwen3-4B-GSM8k-Shadowmodel
- 🤗shadow-llm/Qwen3-4B-SquadV2-Shadowmodel
- 🤗shadow-llm/Qwen3-4B-MMLU-Shadowmodel
- 🤗shadow-llm/Qwen3-4B-MMLU-LoRAmodel
- 🤗shadow-llm/Qwen3-8B-SquadV2-Shadowmodel
- 🤗shadow-llm/Qwen3-8B-GSM8k-Shadowmodel
- 🤗shadow-llm/Qwen3-0.6B-GSM8k-Shadowmodel
- 🤗shadow-llm/Qwen3-0.6B-MMLU-Shadowmodel
- 🤗shadow-llm/robot-dog-detached-shadowmodel· 5 dl5 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
