Can Coding Agents Be General Agents?

Maksim Ivanov; Abhijay Rana; Gokul Prabhakaran

arXiv:2604.13107·cs.SE·April 16, 2026

Can Coding Agents Be General Agents?

Maksim Ivanov, Abhijay Rana, Gokul Prabhakaran

PDF

TL;DR

This paper investigates whether coding agents can be generalized to automate complex business processes, identifying current limitations and challenges in domain logic integration.

Contribution

The study evaluates a coding agent's performance on practical business tasks, highlighting key bottlenecks in achieving true generalization.

Findings

01

Agent reliably completes simple tasks

02

Exhibits failures on complex tasks

03

Bridging domain logic and code execution is a key challenge

Abstract

As coding agents have seen rapid capability and adoption gains, users are applying them to general tasks beyond software engineering. In this post, we investigate whether coding agents can successfully generalize to end-to-end business process automation. We identify gaps in current evaluations, and conduct a case study to evaluate a coding agent on practical business tasks in an open-core Enterprise Resource Planning system. We find that the agent reliably completes simple tasks but exhibits characteristic failures on complex tasks, suggesting that bridging domain logic and code execution is a key bottleneck to generalizability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.