Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models

Andrii Balashov

arXiv:2507.17107·cs.LG·July 30, 2025

Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models

Andrii Balashov

PDF

Open Access

TL;DR

This paper reveals that reinforcement learning fine-tuning of large language models primarily updates a small, sparse subnetwork, maintaining most parameters unchanged, which can be used to achieve full performance efficiently.

Contribution

It demonstrates that RL fine-tuning consistently modifies only a small subnetwork, revealing a new sparsity phenomenon and enabling efficient model adaptation.

Findings

01

RL fine-tuning updates only 5-30% of weights

02

Subnetwork overlap is significant across seeds and datasets

03

Fine-tuning the subnetwork recovers full model performance

Abstract

Reinforcement learning (RL) is a key post-pretraining step for aligning large language models (LLMs) with complex tasks and human preferences. While it is often assumed that RL fine-tuning requires updating most of a model's parameters, we challenge this assumption with a surprising finding: RL fine-tuning consistently modifies only a small subnetwork (typically 5-30% of weights), leaving most parameters unchanged. We call this phenomenon RL-induced parameter update sparsity. It arises naturally, without any sparsity constraints or parameter-efficient tuning, and appears across multiple RL algorithms (e.g., PPO, DPO, SimPO, PRIME) and model families (e.g., OpenAI, Meta, and open-source LLMs). Moreover, the subnetworks updated by RL show substantial overlap across different seeds, datasets, and algorithms-far exceeding chance-suggesting a partially transferable structure in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling