Emergent autonomous scientific research capabilities of large language models
Daniil A. Boiko, Robert MacKnight, Gabe Gomes

TL;DR
This paper demonstrates how large language models can autonomously conduct scientific research, including designing and executing experiments, exemplified by catalyzed cross-coupling reactions, and discusses safety considerations.
Contribution
It introduces an autonomous agent system that combines multiple large language models for scientific research tasks, showcasing novel capabilities in experiment design and execution.
Findings
Successful autonomous execution of catalyzed cross-coupling reactions
Demonstration of scientific research capabilities in multiple domains
Discussion of safety measures for autonomous AI systems
Abstract
Transformer-based large language models are rapidly advancing in the field of machine learning research, with applications spanning natural language, biology, chemistry, and computer programming. Extreme scaling and reinforcement learning from human feedback have significantly improved the quality of generated text, enabling these models to perform various tasks and reason about their choices. In this paper, we present an Intelligent Agent system that combines multiple large language models for autonomous design, planning, and execution of scientific experiments. We showcase the Agent's scientific research capabilities with three distinct examples, with the most complex being the successful performance of catalyzed cross-coupling reactions. Finally, we discuss the safety implications of such systems and propose measures to prevent their misuse.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
