Asking What Matters: Reward-Driven Clarification for Software Engineering Tasks

Sanidhya Vijayvargiya; Vijay Viswanathan; Graham Neubig

arXiv:2604.14624·cs.SE·April 17, 2026

Asking What Matters: Reward-Driven Clarification for Software Engineering Tasks

Sanidhya Vijayvargiya, Vijay Viswanathan, Graham Neubig

PDF

TL;DR

This paper introduces CLARITI, a reinforcement learning-based clarification module for software engineering tasks, which effectively identifies valuable questions by analyzing information relevance and answerability, reducing unnecessary queries.

Contribution

It presents a novel reward-driven approach grounded in empirical analysis to improve clarification efficiency in software engineering tasks.

Findings

01

CLARITI matches GPT-5's resolution rate on underspecified issues.

02

It generates 41% fewer questions than baseline methods.

03

Grounding rewards in empirical analysis enhances clarification effectiveness.

Abstract

Humans often specify tasks incompletely, so assistants must know when and how to ask clarifying questions. However, effective clarification remains challenging in software engineering tasks as not all missing information is equally valuable, and questions must target information users can realistically provide. We study clarification in real software engineering tasks by quantifying which types of information most affect task success and which questions elicit useful responses from simulated users. Using Shapley attribution and distributional comparisons, we identify two key properties of effective clarification: task relevance (which information predicts success) and user answerability (what users can realistically provide). We operationalize these properties as multi-stage reinforcement learning rewards to train CLARITI, an 8B-parameter clarification module, that matches GPT-5's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.