Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Sagnik Mukherjee; Lifan Yuan; Dilek Hakkani-Tur; Hao Peng

arXiv:2505.11711·cs.LG·December 19, 2025

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Sagnik Mukherjee, Lifan Yuan, Dilek Hakkani-Tur, Hao Peng

PDF

Open Access

TL;DR

Reinforcement learning fine-tunes only a small, sparse subnetwork within large language models, achieving comparable performance to full fine-tuning without explicit sparsity constraints.

Contribution

This study reveals that RL fine-tuning induces intrinsic parameter update sparsity across diverse models and algorithms, with small subnetworks capturing most performance gains.

Findings

01

RL fine-tuning updates only 5-30% of parameters.

02

Subnetwork fine-tuning recovers full test accuracy.

03

Update sparsity is consistent across models and algorithms.

Abstract

Reinforcement learning (RL) yields substantial improvements in large language models (LLMs) downstream task performance and alignment with human values. Surprisingly, such large gains result from updating only a small subnetwork comprising just 5 percent to 30 percent of the parameters, with the rest effectively unchanged. We refer to this phenomenon as parameter update sparsity induced by RL. It is observed across all 7 widely used RL algorithms (e.g., PPO, GRPO, DPO) and all 10 LLMs from different families in our experiments. This sparsity is intrinsic and occurs without any explicit sparsity promoting regularizations or architectural constraints. Finetuning the subnetwork alone recovers the test accuracy, and, remarkably, produces a model nearly identical to the one obtained via full finetuning. The subnetworks from different random seeds, training data, and even RL algorithms show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education

MethodsEntropy Regularization · Proximal Policy Optimization