Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes
Bryan R. Christ, Zack Gottesman, Jonathan Kropko, Thomas Hartvigsen

TL;DR
MathNeuro is a method to isolate and modify math reasoning capabilities in large language models by identifying math-specific parameters, enabling targeted improvements without affecting general language skills.
Contribution
The paper introduces MathNeuro, a novel, efficient technique to isolate and enhance math reasoning in LLMs through parameter filtering and scaling.
Findings
Scaling math-specific parameters improves math performance by up to 35%.
MathNeuro can identify math-specific parameters using only a single sample.
Removing math-specific parameters reduces math reasoning ability without harming language understanding.
Abstract
Math reasoning is an active area of Large Language Model (LLM) research because it is a hallmark of artificial intelligence and has implications in several domains, including math education. However, few works have explored how math reasoning is encoded within LLM parameters and if it is a skill that can be isolated within models. Doing so could allow targeted intervention to improve math performance without altering non-math behavior and foster understanding of how models encode math reasoning. We introduce Math Neurosurgery (MathNeuro), a computationally efficient method we use to isolate math-specific parameters in LLMs using only forward passes. MathNeuro builds on existing work by using weights and activations to calculate parameter importance, but isolates math-specific parameters by filtering out those important for general language tasks. Through pruning parameters MathNeuro…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Neurobiology of Language and Bilingualism · Text Readability and Simplification
MethodsPruning
