BARD: budget-aware reasoning distillation

Lujie Niu; Lei Shen; Yi Jiang; Caixia Yuan; Xiaojie Wang; Wenbo Su; Bo zheng

arXiv:2511.01470·cs.CL·December 29, 2025

BARD: budget-aware reasoning distillation

Lujie Niu, Lei Shen, Yi Jiang, Caixia Yuan, Xiaojie Wang, Wenbo Su, Bo zheng

PDF

Open Access

TL;DR

BARD introduces a framework that enables smaller language models to perform reasoning tasks efficiently by controlling reasoning length and resource usage through a two-phase training process involving supervised fine-tuning and reinforcement learning.

Contribution

It presents a novel budget-aware reasoning distillation method that allows dynamic control of reasoning length in language models, improving efficiency and performance.

Findings

01

Achieves strong reasoning performance on benchmarks like AIME24, AIME25, GPQA.

02

Provides precise, adaptive control over reasoning length across various budgets.

03

Empowers an 8B model to balance reasoning quality and computational efficiency.

Abstract

While long Chain-of-Thought (CoT) distillation effectively transfers reasoning capability to smaller language models, the reasoning process often remains redundant and computational budget uncontrollable, leading to inefficient resource usage. To address this limitation, we propose \textbf{Budget-Aware Reasoning Distillation (BARD)}, a novel framework that simultaneously distills reasoning capability and enables fine-grained control over the reasoning length. BARD uses the thinking budget as a user-specified control signal, allowing the model to dynamically balance reasoning performance and computational efficiency. To achieve this concept, BARD introduces a two-phase training regimen. The first phase, Supervised Fine-Tuning (SFT) on teacher-generated long CoT data compressed to various budget levels, bootstrapping the model's understanding of budget constraints. The second phase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Multimodal Machine Learning Applications