Teaching a Transformer to Think Like a Chemist: Predicting Nanocluster Stability
Jo\~ao Marcos T. Palheta, Octavio Rodrigues Filho, Mohammad Soleymanibrojeni, Alexandre Cavalheiro Dias, Diego Guedes-Sobrinho, Wolfgang Wenzel, Roland Aydin, Celso R. C. R\^ego, Maur\'icio Jeomar Piotrowski

TL;DR
This paper integrates density functional theory and AI to predict the stability of bimetallic nanoclusters, enabling rapid and interpretable screening of new chemistries for catalysis.
Contribution
It introduces a transformer-based model trained on DFT data to accurately predict nanocluster stability and preferences, with effective transfer learning to new host metals.
Findings
Transformer achieves ~0.6-0.7 eV MAE in formation energy predictions.
Model adapts quickly to unseen Fe-host nanoclusters with minimal data.
Attention and Shapley analyses identify key stability descriptors.
Abstract
Atomically precise metal nanoclusters bridge the molecular and bulk regimes, but designing bimetallic motifs with targeted stability and reactivity remains challenging. Here we combine density functional theory (DFT) and physics-grounded predictive artificial intelligence to map the configurational landscape of 13-atom icosahedral nanoclusters XTM, with hosts X = (Ti, Zr, Hf), and Fe and a single transition--metal dopant spanning the 3-5 series. Spin-polarized DFT calculations on 240 bimetallic clusters reveal systematic trends in binding and formation energies, distortion penalties, effective coordination number, d-band centre, and HOMO-LUMO gap that govern the competition between core-shell (in) and surface-segregated (out) arrangements. We then pretrain a transformer architecture on a curated set of 2968 unary clusters from the Quantum Cluster Database and fine-tune it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Nanocluster Synthesis and Applications · Electrocatalysts for Energy Conversion
