Competing with AI Scientists: Agent-Driven Approach to Astrophysics Research
Thomas Borrett, Licong Xu, Andy Nilipour, Boris Bolliet, Sebastien Pierre, Erwan Allys, Celia Lecat, Biwei Dai, Po-Wen Chang, Wahid Bhimji

TL;DR
This paper introduces an agent-driven system for constructing scientific data analysis pipelines, demonstrating its effectiveness in a cosmology challenge and showing potential to compete with expert solutions.
Contribution
It presents a multi-agent framework that collaborates to generate, execute, and refine research pipelines, integrating human intervention to achieve top results in a cosmological inference challenge.
Findings
Semi-autonomous agents achieved first place in a cosmology challenge.
The workflow combines neural networks, likelihood calibration, and regularization techniques.
Agent-driven workflows can rapidly explore and build inference pipelines.
Abstract
We present an agent-driven approach to the construction of parameter inference pipelines for scientific data analysis. Our method leverages a multi-agent system, Cmbagent (the analysis system of the AI scientist Denario), in which specialized agents collaborate to generate research ideas, write and execute code, evaluate results, and iteratively refine the overall pipeline. As a case study, we apply this approach to the FAIR Universe Weak Lensing Uncertainty Challenge, a competition under time constraints focused on robust cosmological parameter inference with realistic observational uncertainties. While the fully autonomous exploration initially did not reach expert-level performance, the integration of human intervention enabled our agent-driven workflow to achieve a first-place result in the challenge. This demonstrates that semi-autonomous agentic systems can compete with, and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
