Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

Sirui Liang; Pengfei Cao; Jian Zhao; Cong Huang; Jun Zhao; Kang Liu

arXiv:2511.10707·cs.LG·November 17, 2025

Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

Sirui Liang, Pengfei Cao, Jian Zhao, Cong Huang, Jun Zhao, Kang Liu

PDF

Open Access

TL;DR

This paper introduces BREP ReFT, a novel fine-tuning method that improves mathematical reasoning in language models by optimizing early inference stages and constraining interventions, outperforming existing PEFT and ReFT methods.

Contribution

The paper proposes BREP ReFT, a new representation fine-tuning approach that enhances mathematical reasoning by focusing on early inference optimization and intervention constraints.

Findings

01

BREP ReFT outperforms standard ReFT and PEFT on mathematical reasoning tasks.

02

It demonstrates superior effectiveness, efficiency, and generalization across various models.

03

Extensive experiments validate the robustness of the proposed method.

Abstract

Parameter-Efficient finetuning (PEFT) enhances model performance on downstream tasks by updating a minimal subset of parameters. Representation finetuning (ReFT) methods further improve efficiency by freezing model weights and optimizing internal representations with fewer parameters than PEFT, outperforming PEFT on several tasks. However, ReFT exhibits a significant performance decline on mathematical reasoning tasks. To address this problem, the paper demonstrates that ReFT's poor performance on mathematical tasks primarily stems from its struggle to generate effective reasoning prefixes during the early inference phase. Moreover, ReFT disturbs the numerical encoding and the error accumulats during the CoT stage. Based on these observations, this paper proposes Bias-REstrained Prefix Representation FineTuning (BREP ReFT), which enhances ReFT's mathematical reasoning capability by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Constraint Satisfaction and Optimization · Topic Modeling