SWE-chat: Coding Agent Interactions From Real Users in the Wild

Joachim Baumann; Vishakh Padmakumar; Xiang Li; John Yang; Diyi Yang; Sanmi Koyejo

arXiv:2604.20779·cs.AI·April 23, 2026

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Joachim Baumann, Vishakh Padmakumar, Xiang Li, John Yang, Diyi Yang, Sanmi Koyejo

PDF

1 Models 3 Datasets

TL;DR

SWE-chat is a large-scale, real-world dataset of coding agent interactions revealing usage patterns, failure modes, and user behaviors in natural developer workflows.

Contribution

It introduces SWE-chat, the first extensive dataset of real open-source developer-agent sessions, enabling empirical analysis of practical AI coding agent use.

Findings

01

41% of sessions involve agents authoring almost all code

02

Only 44% of agent-generated code is committed by users

03

Agent-written code has more security vulnerabilities than human code

Abstract

AI coding agents are being adopted at scale, yet we lack empirical evidence on how people actually use them and how much of their output is useful in practice. We present SWE-chat, the first large-scale dataset of real coding agent sessions collected from open-source developers in the wild. The dataset currently contains 6,000 sessions, comprising more than 63,000 user prompts and 355,000 agent tool calls. SWE-chat is a living dataset; our collection pipeline automatically and continually discovers and processes sessions from public repositories. Leveraging SWE-chat, we provide an initial empirical characterization of real-world coding agent usage and failure modes. We find that coding patterns are bimodal: in 41% of sessions, agents author virtually all committed code ("vibe coding"), while in 23%, humans write all code themselves. Despite rapidly improving capabilities, coding agents…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
suhas9545/Qwen2.5-3B-SWE-Agent-QLoRA
model

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.