AI-Researcher: Autonomous Scientific Innovation

Jiabin Tang; Lianghao Xia; Zhonghang Li; Chao Huang

arXiv:2505.18705·cs.AI·May 27, 2025

AI-Researcher: Autonomous Scientific Innovation

Jiabin Tang, Lianghao Xia, Zhonghang Li, Chao Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces AI-Researcher, an autonomous system that manages the entire scientific research process using LLMs, and evaluates its effectiveness with a new benchmark, demonstrating near-human research quality.

Contribution

It presents AI-Researcher, the first fully autonomous research system capable of conducting end-to-end scientific discovery with minimal human input.

Findings

01

Achieves high success rates in implementing research tasks.

02

Produces research papers approaching human-level quality.

03

Establishes Scientist-Bench, a new benchmark for autonomous research evaluation.

Abstract

The powerful reasoning capabilities of Large Language Models (LLMs) in mathematics and coding, combined with their ability to automate complex tasks through agentic frameworks, present unprecedented opportunities for accelerating scientific innovation. In this paper, we introduce AI-Researcher, a fully autonomous research system that transforms how AI-driven scientific discovery is conducted and evaluated. Our framework seamlessly orchestrates the complete research pipeline--from literature review and hypothesis generation to algorithm implementation and publication-ready manuscript preparation--with minimal human intervention. To rigorously assess autonomous research capabilities, we develop Scientist-Bench, a comprehensive benchmark comprising state-of-the-art papers across diverse AI research domains, featuring both guided innovation and open-ended exploration tasks. Through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkuds/ai-researcher
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBig Data and Business Intelligence