Effective Strategies for Asynchronous Software Engineering Agents
Jiayi Geng, Graham Neubig

TL;DR
This paper introduces CAID, a structured multi-agent coordination framework for asynchronous software engineering tasks, improving accuracy and reliability by leveraging centralized task management and isolated workspaces.
Contribution
The paper proposes CAID, a novel multi-agent coordination paradigm inspired by human collaboration primitives, specifically designed for asynchronous software engineering tasks.
Findings
CAID improves accuracy by 26.7% on PaperBench.
CAID enhances accuracy by 14.3% on Commit0.
Branch-and-merge is identified as a key coordination mechanism.
Abstract
AI agents have become increasingly capable at isolated software engineering (SWE) tasks such as resolving issues on Github. Yet long-horizon tasks involving multiple interdependent subtasks still pose challenges both with respect to accuracy, and with respect to timely completion. A natural approach to solving these long-horizon tasks in a timely manner is asynchronous multi-agent collaboration, where multiple agents work on different parts of the task at the same time. But effective application of multi-agent systems has proven surprisingly difficult: concurrent edits by multiple agents interfere with each other, dependencies are difficult to synchronize, and combining partial progress into a coherent whole is challenging. On the other hand, human developers have long relied on mature collaboration infrastructure to manage these challenges in large software projects. Inspired by these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Scientific Computing and Data Management · Mobile Crowdsensing and Crowdsourcing
