Plasticity vs. Rigidity: The Impact of Low-Rank Adapters on Reasoning on a Micro-Budget
Zohaib Khan, Omer Tafveez, Zoha Hayat Bhatti

TL;DR
This paper explores how low-rank adapters influence reasoning in small language models under strict computational constraints, revealing that higher-rank adapters significantly improve reasoning abilities, but only in certain model types.
Contribution
It demonstrates that high-rank adapters enable reasoning improvements in small models trained with limited resources, highlighting the importance of adapter capacity and model initialization.
Findings
High-rank adapters (r=256) improve reasoning performance.
Instruction-tuned models benefit from increased adapter capacity.
Math-aligned models can suffer performance collapse under low-budget RL.
Abstract
Recent advances in mathematical reasoning typically rely on massive scale, yet the question remains: can strong reasoning capabilities be induced in small language models () under extreme constraints? We investigate this by training models on a single A40 GPU (48GB) for under 24 hours using Reinforcement Learning with Verifiable Rewards (RLVR) and Low-Rank Adaptation (LoRA). We find that the success of this ``micro-budget" regime depends critically on the interplay between adapter capacity and model initialization. While low-rank adapters () consistently fail to capture the complex optimization dynamics of reasoning, high-rank adapters () unlock significant plasticity in standard instruction-tuned models. Our best result achieved an impressive 40.0\% Pass@1 on AIME 24 (an 11.1\% absolute improvement over baseline) and pushed Pass@16 to 70.0\%, demonstrating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Ferroelectric and Negative Capacitance Devices
