Inside the Scaffold: A Source-Code Taxonomy of Coding Agent Architectures
Benjamin Rombaut

TL;DR
This paper develops a detailed source-code taxonomy of coding agent architectures by analyzing 13 open-source scaffolds, revealing diverse control strategies, tool integrations, and resource management practices.
Contribution
It introduces a novel source-code-level taxonomy of coding agent scaffolds, characterizing architectural dimensions across multiple layers and primitives.
Findings
Control strategies range from fixed pipelines to Monte Carlo Tree Search.
Agents incorporate 0 to 37 tools, showing diverse tool usage.
Multiple loop primitives are used in various combinations across agents.
Abstract
LLM-based coding agents can localize bugs, generate patches, and run tests with diminishing human oversight, yet the scaffolding code that surrounds the language model (the control loop, tool definitions, state management, and context strategy) remains poorly understood. Existing surveys classify agents by abstract capabilities (tool use, planning, reflection) that cannot distinguish between architecturally distinct systems, and trajectory studies observe what agents do without examining the scaffold code that determines why. This paper presents a source-code-level architectural taxonomy derived from analysis of 13 open-source coding agent scaffolds at pinned commit hashes. Each agent is characterized across 12 dimensions organized into three layers: control architecture, tool and environment interface, and resource management. The analysis reveals that scaffold architectures resist…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
