Rethinking the AI Scientist: Interactive Multi-Agent Workflows for Scientific Discovery
Lukas Weidener, Marko Brki\'c, Mihailo Jovanovi\'c, Ritvik Singh, Chiara Baccin, Emre Ulgac, Alex Dobrin, Aakaash Meduri

TL;DR
This paper presents Deep Research, a multi-agent system that enables interactive, real-time AI-assisted scientific discovery with specialized agents and two operational modes, significantly improving turnaround times and performance on biological benchmarks.
Contribution
The paper introduces a novel multi-agent architecture for interactive scientific research, supporting real-time workflows and demonstrating state-of-the-art results on biological benchmarks.
Findings
Achieved 48.8% accuracy on open response tasks
Achieved 64.4% accuracy on multiple-choice tasks
Outperformed existing baselines by 14 to 26 percentage points
Abstract
Artificial intelligence systems for scientific discovery have demonstrated remarkable potential, yet existing approaches remain largely proprietary and operate in batch-processing modes requiring hours per research cycle, precluding real-time researcher guidance. This paper introduces Deep Research, a multi-agent system enabling interactive scientific investigation with turnaround times measured in minutes. The architecture comprises specialized agents for planning, data analysis, literature search, and novelty detection, unified through a persistent world state that maintains context across iterative research cycles. Two operational modes support different workflows: semi-autonomous mode with selective human checkpoints, and fully autonomous mode for extended investigations. Evaluation on the BixBench computational biology benchmark demonstrated state-of-the-art performance, achieving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Cell Image Analysis Techniques
