Evolving Roles of LLMs in Scientific Innovation: Assistant, Collaborator, Scientist, and Evaluator

Haoxuan Zhang; Ruochi Li; Yang Zhang; Ting Xiao; Jiangping Chen; Junhua Ding; Haihua Chen

arXiv:2507.11810·cs.DL·May 13, 2026

Evolving Roles of LLMs in Scientific Innovation: Assistant, Collaborator, Scientist, and Evaluator

Haoxuan Zhang, Ruochi Li, Yang Zhang, Ting Xiao, Jiangping Chen, Junhua Ding, Haihua Chen

PDF

1 Repo

TL;DR

This survey introduces a four-role framework for understanding LLMs in scientific innovation, analyzing their capabilities, limitations, and human oversight needs across roles like Assistant, Collaborator, Scientist, and Evaluator.

Contribution

It proposes a novel four-role framework integrating autonomy, cognition, and innovation dimensions to distinguish research support from discovery in LLM applications.

Findings

01

Assistant systems are mature in retrieval and synthesis but unreliable in open-ended tasks.

02

Collaborator systems expand hypothesis space but face novelty-grounding challenges.

03

Scientist systems automate workflows but encounter reliability and safety issues.

Abstract

Large language models (LLMs) are increasingly used in scientific research and discovery, supporting tasks ranging from literature retrieval and synthesis to hypothesis generation, autonomous experimentation, and research evaluation. Existing surveys often conflate scientific research with scientific discovery and typically organize systems by domain, task, or autonomy level alone. In this survey, we propose a four-role framework for understanding LLMs in scientific innovation: Assistant, Collaborator, Scientist, and Evaluator. The framework integrates three complementary dimensions: autonomy level, cognitive function, and scientific innovation, to distinguish research-oriented support from frontier-oriented discovery. We review representative methods, benchmarks, and evaluation practices for each role, examining their capabilities, limitations, and human oversight requirements. Across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haoxuan-unt2024/llm4innovation
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.