Perspective: Towards sustainable exploration of chemical spaces with machine learning

Leonardo Medrano Sandonas; David Balcells; Anton Bochkarev; Jacqueline M. Cole; Volker L. Deringer; Werner Dobrautz; Adrian Ehrenhofer; Thorben Frank; Pascal Friederich; Rico Friedrich; Janine George; Luca Ghiringhelli; Alejandra Hinostroza Caldas; Veronika Juraskova; Hannes Kneiding; Yury Lysogorskiy; Johannes T. Margraf; Hanna T\"urk; Anatole von Lilienfeld; Milica Todorovi\'c; Alexandre Tkatchenko; Mariana Rossi; and Gianaurelio Cuniberti

arXiv:2604.00069·cs.LG·April 2, 2026

Perspective: Towards sustainable exploration of chemical spaces with machine learning

Leonardo Medrano Sandonas, David Balcells, Anton Bochkarev, Jacqueline M. Cole, Volker L. Deringer, Werner Dobrautz, Adrian Ehrenhofer, Thorben Frank, Pascal Friederich, Rico Friedrich, Janine George, Luca Ghiringhelli, Alejandra Hinostroza Caldas, Veronika Juraskova

PDF

TL;DR

This paper discusses the sustainability challenges in AI-driven molecular discovery, emphasizing resource-efficient strategies like multi-fidelity models, physics constraints, and open data to promote responsible scientific progress.

Contribution

It highlights emerging methods and workflows that improve efficiency and sustainability in AI-based chemical space exploration, integrating physics and practical considerations.

Findings

01

Large quantum datasets enable benchmarking but increase energy costs.

02

Strategies like multi-fidelity models and active learning improve efficiency.

03

Hierarchical workflows with physics constraints optimize resource use.

Abstract

Artificial intelligence is transforming molecular and materials science, but its growing computational and data demands raise critical sustainability challenges. In this Perspective, we examine resource considerations across the AI-driven discovery pipeline--from quantum-mechanical (QM) data generation and model training to automated, self-driving research workflows--building on discussions from the ``SusML workshop: Towards sustainable exploration of chemical spaces with machine learning'' held in Dresden, Germany. In this context, the availability of large quantum datasets has enabled rigorous benchmarking and rapid methodological progress, while also incurring substantial energy and infrastructure costs. We highlight emerging strategies to enhance efficiency, including general-purpose machine learning (ML) models, multi-fidelity approaches, model distillation, and active learning.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.