Not All Layers Need Tuning: Selective Layer Restoration Recovers Diversity
Bowen Zhang, Meiyi Wang, Harold Soh

TL;DR
This paper introduces Selective Layer Restoration (SLR), a post-training method that selectively restores certain layers of large language models to their pre-trained weights, improving output diversity without sacrificing quality.
Contribution
The paper proposes a novel, training-free approach called SLR that identifies and restores specific layers to recover diversity in LLM outputs, addressing mode collapse.
Findings
SLR increases diversity across multiple tasks and models.
Restoring certain layers balances diversity and quality effectively.
SLR incurs no additional inference cost.
Abstract
Post-training improves instruction-following and helpfulness of large language models (LLMs) but often reduces generation diversity, which leads to repetitive outputs in open-ended settings, a phenomenon known as mode collapse. Motivated by evidence that LLM layers play distinct functional roles, we hypothesize that mode collapse can be localized to specific layers and that restoring a carefully chosen range of layers to their pre-trained weights can recover diversity while maintaining high output quality. To validate this hypothesis and decide which layers to restore, we design a proxy task -- Constrained Random Character(CRC) -- with an explicit validity set and a natural diversity objective. Results on CRC reveal a clear diversity-validity trade-off across restoration ranges and identify configurations that increase diversity with minimal quality loss. Based on these findings, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
