Autonomous Microscopy Experiments through Large Language Model Agents
Indrajeet Mandal, Jitendra Soni, Mohd Zaki, Morten M. Smedskjaer, Katrin Wondraczek, Lothar Wondraczek, Nitya Nand Gosvami, N. M. Anoop Krishnan

TL;DR
This paper introduces AILA, an AI framework for automating atomic force microscopy experiments using large language models, highlighting challenges in model coordination, safety, and prompt robustness in scientific workflows.
Contribution
The paper presents AILA, a novel LLM-driven agent framework for microscopy automation, and develops AFMBench for comprehensive evaluation of AI agents in scientific experiments.
Findings
State-of-the-art models struggle with basic experimental tasks.
Multi-agent frameworks outperform single-agent architectures.
Prompt fragility significantly affects model performance.
Abstract
Large language models (LLMs) are revolutionizing self driving laboratories (SDLs) for materials research, promising unprecedented acceleration of scientific discovery. However, current SDL implementations rely on rigid protocols that fail to capture the adaptability and intuition of expert scientists in dynamic experimental settings. We introduce Artificially Intelligent Lab Assistant (AILA), a framework automating atomic force microscopy through LLM driven agents. Further, we develop AFMBench a comprehensive evaluation suite challenging AI agents across the complete scientific workflow from experimental design to results analysis. We find that state of the art models struggle with basic tasks and coordination scenarios. Notably, Claude 3.5 sonnet performs unexpectedly poorly despite excelling in materials domain question answering (QA) benchmarks, revealing that domain specific QA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Weight Decay · Multi-Head Attention · {Dispute@FaQ-s}How to file a dispute with Expedia? · Layer Normalization · Cosine Annealing · Dense Connections · 15 Ways to Contact How can i speak to someone at Delta Airlines
