Unified Software Engineering Agent as AI Software Engineer
Leonhard Applis, Yuntong Zhang, Shanchao Liang, Nan Jiang, Lin Tan, Abhik Roychoudhury

TL;DR
This paper introduces USEagent, a unified AI software engineering agent capable of handling multiple software development tasks, and evaluates its performance on a comprehensive benchmark, showing improvements over existing agents.
Contribution
The paper presents the first unified AI software engineering agent, USEagent, capable of orchestrating multiple tasks, and introduces USEbench for comprehensive evaluation.
Findings
USEagent outperforms existing general agents on the USEbench.
Gaps identified in USEagent's capabilities suggest directions for future development.
USEagent demonstrates potential as a team member in future AI-human software development teams.
Abstract
The growth of Large Language Model (LLM) technology has raised expectations for automated coding. However, software engineering is more than coding and is concerned with activities including maintenance and evolution of a project. In this context, the concept of LLM agents has gained traction, which utilize LLMs as reasoning engines to invoke external tools autonomously. But is an LLM agent the same as an AI software engineer? In this paper, we seek to understand this question by developing a Unified Software Engineering agent or USEagent. Unlike existing work which builds specialized agents for specific software tasks such as testing, debugging, and repair, our goal is to build a unified agent which can orchestrate and handle multiple capabilities. This gives the agent the promise of handling complex scenarios in software development such as fixing an incomplete patch, adding new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Multi-Agent Systems and Negotiation · Scientific Computing and Data Management
