Constructive Circuit Amplification: Improving Math Reasoning in LLMs via Targeted Sub-Network Updates
Nikhil Prakash, Donghao Ren, Dominik Moritz, Yannick Assogba

TL;DR
This paper introduces Constructive Circuit Amplification, a method that enhances mathematical reasoning in large language models by selectively updating key circuit components, leading to significant accuracy improvements with minimal model modifications.
Contribution
The paper presents a novel targeted update technique that identifies and modifies crucial model circuits to improve specific reasoning tasks in LLMs.
Findings
Up to +11.4% accuracy improvement in math reasoning
Modifies only 1.59% of model components
Minimal impact on other abilities like MMLU, TriviaQA, and TruthfulQA
Abstract
Prior studies investigating the internal workings of LLMs have uncovered sparse subnetworks, often referred to as circuits, that are responsible for performing specific tasks. Additionally, it has been shown that model performance improvement through fine-tuning often results from the strengthening of existing circuits in the model. Taken together, these findings suggest the possibility of intervening directly on such circuits to make precise, task-targeted updates. Motivated by these findings, we propose a novel method called Constructive Circuit Amplification which identifies pivotal tokens from model reasoning traces as well as model components responsible for the desired task, and updates only those components. Applied to mathematical reasoning, it improves accuracy by up to +11.4% across multiple models while modifying as little as 1.59% of model components, with minimal impact on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Model Reduction and Neural Networks · Ferroelectric and Negative Capacitance Devices
