QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation

Jiazheng Li; Hongzhou Lin; Hong Lu; Kaiyue Wen; Zaiwen Yang; Jiaxuan Gao; Yi Wu; Jingzhao Zhang

arXiv:2507.13266·cs.CL·April 1, 2026

QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation

Jiazheng Li, Hongzhou Lin, Hong Lu, Kaiyue Wen, Zaiwen Yang, Jiaxuan Gao, Yi Wu, Jingzhao Zhang

PDF

1 Repo 1 Models 1 Datasets 1 Video

TL;DR

QuestA introduces question augmentation during RL training to enhance reasoning in large language models, leading to state-of-the-art results on math benchmarks with 1.5B models.

Contribution

The paper proposes a simple question augmentation strategy during RL training that improves reasoning performance of LLMs on math tasks.

Findings

01

Achieved new state-of-the-art results on math benchmarks.

02

Improved pass@1 and pass@k metrics on challenging problems.

03

Enhanced reasoning capabilities of open-source models.

Abstract

Reinforcement learning (RL) has emerged as a central paradigm for training large language models (LLMs) in reasoning tasks. Yet recent studies question RL's ability to incentivize reasoning capacity beyond the base model. This raises a key challenge: how can RL be adapted to solve harder reasoning problems more effectively? To address this challenge, we propose a simple yet effective strategy via Question Augmentation: introduce partial solutions during training to reduce problem difficulty and provide more informative learning signals. Our method, QuestA, when applied during RL training on math reasoning tasks, not only improves pass@1 but also pass@k-particularly on problems where standard RL struggles to make progress. This enables continual improvement over strong open-source models such as DeepScaleR and OpenMath Nemotron, further enhancing their reasoning capabilities. We achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

foreverlasting1202/QuestA
github

Models

🤗
foreverlasting1202/QuestA-Nemotron-1.5B
model· 980 dl· ♡ 17
980 dl♡ 17

Datasets

foreverlasting1202/QuestA
dataset· 62 dl
62 dl

Videos

QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation· slideslive