AI-Researcher: Autonomous Scientific Innovation
Jiabin Tang, Lianghao Xia, Zhonghang Li, Chao Huang

TL;DR
This paper introduces AI-Researcher, an autonomous system that manages the entire scientific research process using LLMs, and evaluates its effectiveness with a new benchmark, demonstrating near-human research quality.
Contribution
It presents AI-Researcher, the first fully autonomous research system capable of conducting end-to-end scientific discovery with minimal human input.
Findings
Achieves high success rates in implementing research tasks.
Produces research papers approaching human-level quality.
Establishes Scientist-Bench, a new benchmark for autonomous research evaluation.
Abstract
The powerful reasoning capabilities of Large Language Models (LLMs) in mathematics and coding, combined with their ability to automate complex tasks through agentic frameworks, present unprecedented opportunities for accelerating scientific innovation. In this paper, we introduce AI-Researcher, a fully autonomous research system that transforms how AI-driven scientific discovery is conducted and evaluated. Our framework seamlessly orchestrates the complete research pipeline--from literature review and hypothesis generation to algorithm implementation and publication-ready manuscript preparation--with minimal human intervention. To rigorously assess autonomous research capabilities, we develop Scientist-Bench, a comprehensive benchmark comprising state-of-the-art papers across diverse AI research domains, featuring both guided innovation and open-ended exploration tasks. Through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence
