KAPSO: A Knowledge-grounded framework for Autonomous Program Synthesis and Optimization
Alireza Nadafian, Alireza Mohammadshahi, Majid Yazdani

TL;DR
KAPSO is a modular framework that combines knowledge integration, experiment management, and iterative learning to improve autonomous program synthesis and optimization over long horizons, addressing common failures in coding agents.
Contribution
KAPSO introduces a novel integrated system that enhances long-horizon program synthesis by combining a git-based experimentation engine, a structured knowledge system, and a cognitive memory layer.
Findings
Achieved improved performance on MLE-Bench and ALE-Bench tasks.
Demonstrated effective reuse of knowledge and lessons to accelerate convergence.
Reduced repeated errors and enhanced reproducibility in program synthesis.
Abstract
We introduce KAPSO, a modular framework for autonomous program synthesis and optimization. Given a natural language goal and an evaluation method, KAPSO iteratively performs ideation, code synthesis and editing, execution, evaluation, and learning to improve a runnable artifact toward measurable objectives. Rather than treating synthesis as the endpoint, KAPSO uses synthesis as an operator within a long-horizon optimization loop, where progress is defined by evaluator outcomes. KAPSO targets long-horizon failures common in coding agents, including lost experimental state, brittle debugging, and weak reuse of domain expertise, by integrating three tightly coupled components. First, a git-native experimentation engine isolates each attempt as a branch, producing reproducible artifacts and preserving provenance across iterations. Second, a knowledge system ingests heterogeneous sources,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Software Engineering Research
