Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

Nicholas Edwards; Sebastian Schuster

arXiv:2603.26233·cs.CL·March 30, 2026

Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

Nicholas Edwards, Sebastian Schuster

PDF

1 Repo

TL;DR

This paper introduces an uncertainty-aware multi-agent system for coding agents that effectively detects underspecification and asks clarifying questions, significantly improving task resolution rates in software engineering tasks.

Contribution

It presents a novel multi-agent scaffold that separates underspecification detection from code execution, enhancing clarification-seeking capabilities of LLM agents.

Findings

01

Multi-agent system achieves 69.40% task resolve rate, outperforming single-agent systems.

02

The system exhibits well-calibrated uncertainty, balancing queries based on task complexity.

03

Proactive clarification improves agent collaboration in underspecified tasks.

Abstract

As Large Language Model (LLM) agents are increasingly deployed in open-ended domains like software engineering, they frequently encounter underspecified instructions that lack crucial context. While human developers naturally resolve underspecification by asking clarifying questions, current agents are largely optimized for autonomous execution. In this work, we systematically evaluate the clarification-seeking abilities of LLM agents on an underspecified variant of SWE-bench Verified. We propose an uncertainty-aware multi-agent scaffold that explicitly decouples underspecification detection from code execution. Our results demonstrate that this multi-agent system using OpenHands + Claude Sonnet 4.5 achieves a 69.40% task resolve rate, significantly outperforming a standard single-agent setup (61.20%) and closing the performance gap with agents operating on fully specified instructions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nedwards99/ask-or-assume
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.