EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration
Xinshuai Guo, Jiayi Kuang, Linyue Pan, Yinghui Li, Yangning Li, Hai-Tao Zheng, Ying Shen, Di Yin, Xing Sun

TL;DR
EvoConfig is a self-evolving multi-agent framework that enhances environment configuration efficiency and accuracy for large language models by integrating expert diagnosis and dynamic feedback, significantly improving success rates and debugging capabilities.
Contribution
The paper introduces EvoConfig, a novel self-evolving multi-agent system with expert diagnosis and feedback mechanisms for improved environment configuration.
Findings
EvoConfig matches state-of-the-art on Repo2Run datasets.
Achieves 78.1% success rate on Envbench, outperforming previous methods.
Demonstrates superior debugging and error correction capabilities.
Abstract
A reliable executable environment is the foundation for ensuring that large language models solve software engineering tasks. Due to the complex and tedious construction process, large-scale configuration is relatively inefficient. However, most methods always overlook fine-grained analysis of the actions performed by the agent, making it difficult to handle complex errors and resulting in configuration failures. To address this bottleneck, we propose EvoConfig, an efficient environment configuration framework that optimizes multi-agent collaboration to build correct runtime environments. EvoConfig features an expert diagnosis module for fine-grained post-execution analysis, and a self-evolving mechanism that lets expert agents self-feedback and dynamically adjust error-fixing priorities in real time. Empirically, EvoConfig matches the previous state-of-the-art Repo2Run on Repo2Run's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Engineering Research · Software Testing and Debugging Techniques
