Toward Automated Simulation Research Workflow through LLM Prompt Engineering Design
Zhihan Liu, Yubo Chai, Jianfeng Li

TL;DR
This paper demonstrates the potential of Large Language Models, especially GPT-4o, to automate entire simulation research workflows through prompt engineering, enabling long-term autonomous research with minimal human intervention.
Contribution
It introduces an autonomous simulation agent (ASA) powered by LLMs that can perform end-to-end simulation research tasks based on human research plans, showcasing near-flawless long-task execution.
Findings
ASA-GPT-4o achieved near-flawless execution on research missions.
The automation process can be iterated for up to 20 cycles without human input.
The study discusses self-validation and task management traits of ASA.
Abstract
The advent of Large Language Models (LLMs) has created new opportunities for the automation of scientific research spanning both experimental processes and computational simulations. This study explores the feasibility of constructing an autonomous simulation agent (ASA) powered by LLMs through prompt engineering and automated program design to automate the entire simulation research process according to a human-provided research plan. This process includes experimental design, remote upload and simulation execution, data analysis, and report compilation. Using a well-studied simulation problem of polymer chain conformations as a test case, we assessed the long-task completion and reliability of ASAs powered by different LLMs, including GPT-4o, Claude-3.5, etc. Our findings revealed that ASA-GPT-4o achieved near-flawless execution on designated research missions, underscoring the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management
MethodsSoftmax · Attention Is All You Need
