Freeze Deep, Train Shallow: Interpretable Layer Allocation for Continued Pre-Training
Yu-Hang Wu, Qin-Yuan Liu, Qiu-Yang Zhao, Bo Jiang, Jiang-Feng Yang, Qing-Wei Cong

TL;DR
This paper introduces LayerTracer, a diagnostic framework that interprets layer importance in LLMs, guiding efficient layer freezing strategies for cost-effective continued pre-training.
Contribution
It provides an interpretable method to identify critical layers, demonstrating that training shallow layers while freezing deep layers improves performance and resource efficiency.
Findings
Deep layers are critical for task execution and stable against updates.
Training shallow layers while freezing deep layers outperforms full fine-tuning.
Hybrid models with high-quality modules in deep layers preserve knowledge effectively.
Abstract
Selective layer-wise updates are essential for low-cost continued pre-training of Large Language Models (LLMs), yet determining which layers to freeze or train remains an empirical black-box problem due to the lack of interpretable guidance. To address this issue, we propose LayerTracer, an architecture-agnostic diagnostic framework that reveals the evolution patterns of layer-wise representations and stability by locating task execution positions and quantifying layer sensitivity. Analysis results reveal that deep layers act as critical regions for task execution and maintain high stability against disruptive updates. Guided by this finding, we conduct three controlled continued pre-training trials to compare diverse freeze-train strategies, demonstrating that training shallow layers while freezing deep layers consistently outperforms full-parameter fine-tuning and the opposite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
