ExComm: Exploration-Stage Communication for Error-Resilient Agentic Test-Time Scaling
Woomin Song, Beomjun Kim, Daewon Choi, Sai Muralidhar Jayanthi, Saket Dingliwal, Jinwoo Shin, Aram Galstyan

TL;DR
ExComm is a communication protocol designed to detect and correct errors during agentic reasoning, improving the robustness and diversity of long-horizon test-time scaling in AI agents.
Contribution
It introduces a novel communication framework that audits, verifies, and updates agent beliefs to mitigate error propagation and maintain trajectory diversity.
Findings
ExComm outperforms baseline methods with 5.7% and 5.0% performance gains.
It improves error recovery and scaling behavior.
It maintains higher trajectory diversity than other communication methods.
Abstract
A common failure mode in long-horizon agentic test-time scaling is error propagation, where factual errors or invalid deductions introduced at intermediate steps persist in the agent's belief state and contaminate later reasoning. Existing test-time scaling methods provide limited control over this process, as they often rely on agents to detect their own mistakes, select among flawed trajectories, or refine solutions only after errors have already shaped the reasoning path. We propose ExComm, a communication protocol for exploration-stage agentic test-time scaling. ExComm is motivated by the empirical observation that the majority of intermediate errors in parallel agentic reasoning produce detectable cross-agent factual conflicts. Leveraging the iterative structure of agentic workflows, ExComm periodically audits agent belief states to detect such conflicts, resolves them through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
