SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu, Yao, Karthik Narasimhan, Ofir Press

TL;DR
This paper introduces SWE-agent, a specialized interface system that enables language model agents to autonomously perform complex software engineering tasks, significantly improving their effectiveness and achieving state-of-the-art results.
Contribution
The paper presents SWE-agent, a novel agent-computer interface that enhances LM agent capabilities in software engineering, with empirical evaluation showing substantial performance improvements.
Findings
SWE-agent achieves state-of-the-art pass@1 rates of 12.5% on SWE-bench.
SWE-agent attains 87.7% pass@1 on HumanEvalFix.
Interface design significantly impacts agent performance.
Abstract
Language model (LM) agents are increasingly being used to automate complicated tasks in digital environments. Just as humans benefit from powerful software applications, such as integrated development environments, for complex tasks like software engineering, we posit that LM agents represent a new category of end users with their own needs and abilities, and would benefit from specially-built interfaces to the software they use. We investigate how interface design affects the performance of language model agents. As a result of this exploration, we introduce SWE-agent: a system that facilitates LM agents to autonomously use computers to solve software engineering tasks. SWE-agent's custom agent-computer interface (ACI) significantly enhances an agent's ability to create and edit code files, navigate entire repositories, and execute tests and other programs. We evaluate SWE-agent on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Scheduling and Optimization Algorithms · Business Process Modeling and Analysis
