A Large-Scale Study on the Development and Issues of Multi-Agent AI Systems
Daniel Liu, Krishna Upadhyay, Vinaik Chhetri, A.B. Siddique, Umar Farooq

TL;DR
This large-scale empirical study analyzes the development, maintenance, and issues of open-source multi-agent AI systems, revealing diverse development profiles, common challenges, and the ecosystem's rapid growth and fragility.
Contribution
It provides the first comprehensive analysis of open-source MAS, identifying development patterns, common issues, and highlighting areas for improving ecosystem sustainability.
Findings
Three development profiles: sustained, steady, burst-driven
40.8% of commits are feature enhancements
Issue reporting increased sharply in 2023
Abstract
The rapid emergence of multi-agent AI systems (MAS), including LangChain, CrewAI, and AutoGen, has shaped how large language model (LLM) applications are developed and orchestrated. However, little is known about how these systems evolve and are maintained in practice. This paper presents the first large-scale empirical study of open-source MAS, analyzing over 42K unique commits and over 4.7K resolved issues across eight leading systems. Our analysis identifies three distinct development profiles: sustained, steady, and burst-driven. These profiles reflect substantial variation in ecosystem maturity. Perfective commits constitute 40.8% of all changes, suggesting that feature enhancement is prioritized over corrective maintenance (27.4%) and adaptive updates (24.3%). Data about issues shows that the most frequent concerns involve bugs (22%), infrastructure (14%), and agent coordination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI) · Topic Modeling
