Orchestrating Human-AI Software Delivery: A Retrospective Longitudinal Field Study of Three Software Modernization Programs
Maximiliano Armesto, Christophe Kolb

TL;DR
This study demonstrates that embedding AI into an orchestrated workflow significantly improves software modernization efforts in terms of speed, coverage, and quality, based on a longitudinal analysis of three real-world projects.
Contribution
It provides empirical evidence that integrated AI workflows outperform isolated AI tools in large-scale software modernization programs.
Findings
Efficient AI integration reduces project effort from 1080 to 232.5 person-days.
Validation issues decrease from 8.03 to 2.09 per 100 tasks.
First-release coverage improves from 77% to 90.5%.
Abstract
Evidence on AI in software engineering still leans heavily toward individual task completion, while evidence on team-level delivery remains scarce. We report a retrospective longitudinal field study of Chiron, an industrial platform that coordinates humans and AI agents across four delivery stages: analysis, planning, implementation, and validation. The study covers three real software modernization programs -- a COBOL banking migration (~30k LOC), a large accounting modernization (~400k LOC), and a .NET/Angular mortgage modernization (~30k LOC) -- observed across five delivery configurations: a traditional baseline and four successive platform versions (V1--V4). The benchmark separates observed outcomes (stage durations, task volumes, validation-stage issues, first-release coverage) from modeled outcomes (person-days and senior-equivalent effort under explicit staffing scenarios).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Software Engineering Research · Personal Information Management and User Behavior
