A Critical Examination of Active Learning Workflows in Materials Science

Akhil S. Nair; Lucas Foppa

arXiv:2601.05946·cond-mat.mtrl-sci·January 12, 2026

A Critical Examination of Active Learning Workflows in Materials Science

Akhil S. Nair, Lucas Foppa

PDF

Open Access

TL;DR

This paper critically examines active learning workflows in materials science, highlighting assumptions, pitfalls, and providing guidance to improve their reliability and effectiveness in applications like interatomic potentials and self-driving labs.

Contribution

It systematically assesses key design choices in AL workflows, identifying common pitfalls and offering practical strategies for better implementation in materials science.

Findings

01

Identifies common pitfalls in AL workflows

02

Provides mitigation strategies for design assumptions

03

Guides practitioners in AL workflow assessment

Abstract

Active learning (AL) plays a critical role in materials science, enabling applications such as the construction of machine-learning interatomic potentials for atomistic simulations and the operation of self-driving laboratories. Despite its widespread use, the reliability and effectiveness of AL workflows depend on implicit design assumptions that are rarely examined systematically. Here, we critically assess AL workflows deployed in materials science and investigate how key design choices, such as surrogate models, sampling strategies, uncertainty quantification and evaluation metrics, relate to their performance. By identifying common pitfalls and discussing practical mitigation strategies, we provide guidance to practitioners for the efficient design, assessment, and interpretation of AL workflows in materials science.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Machine Learning and Algorithms · Scientific Computing and Data Management