QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation
Jiazheng Li, Hongzhou Lin, Hong Lu, Kaiyue Wen, Zaiwen Yang, Jiaxuan Gao, Yi Wu, Jingzhao Zhang

TL;DR
QuestA introduces question augmentation during RL training to enhance reasoning in large language models, leading to state-of-the-art results on math benchmarks with 1.5B models.
Contribution
The paper proposes a simple question augmentation strategy during RL training that improves reasoning performance of LLMs on math tasks.
Findings
Achieved new state-of-the-art results on math benchmarks.
Improved pass@1 and pass@k metrics on challenging problems.
Enhanced reasoning capabilities of open-source models.
Abstract
Reinforcement learning (RL) has emerged as a central paradigm for training large language models (LLMs) in reasoning tasks. Yet recent studies question RL's ability to incentivize reasoning capacity beyond the base model. This raises a key challenge: how can RL be adapted to solve harder reasoning problems more effectively? To address this challenge, we propose a simple yet effective strategy via Question Augmentation: introduce partial solutions during training to reduce problem difficulty and provide more informative learning signals. Our method, QuestA, when applied during RL training on math reasoning tasks, not only improves pass@1 but also pass@k-particularly on problems where standard RL struggles to make progress. This enables continual improvement over strong open-source models such as DeepScaleR and OpenMath Nemotron, further enhancing their reasoning capabilities. We achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
