Representations and Strategies for Transferable Machine Learning Models in Chemical Discovery
Daniel R. Harper, Aditya Nandy, Naveen Arunachalam, Chenru Duan, Jon, Paul Janet, and Heather J. Kulik

TL;DR
This paper develops transferable machine learning models for chemical discovery, utilizing enhanced representations and transfer learning to predict properties of transition-metal complexes across different periodic table rows, even with limited data.
Contribution
It introduces an extended graph-based representation (eRAC) and a transfer learning approach that improve model transferability across chemical spaces with scarce data.
Findings
eRAC representation improves property prediction accuracy.
Transfer learning enhances model performance with limited data.
Models reorder complex similarities to align with periodic table trends.
Abstract
Strategies for machine-learning(ML)-accelerated discovery that are general across materials composition spaces are essential, but demonstrations of ML have been primarily limited to narrow composition variations. By addressing the scarcity of data in promising regions of chemical space for challenging targets like open-shell transition-metal complexes, general representations and transferable ML models that leverage known relationships in existing data will accelerate discovery. Over a large set (ca. 1000) of isovalent transition-metal complexes, we quantify evident relationships for different properties (i.e., spin-splitting and ligand dissociation) between rows of the periodic table (i.e., 3d/4d metals and 2p/3p ligands). We demonstrate an extension to graph-based revised autocorrelation (RAC) representation (i.e., eRAC) that incorporates the effective nuclear charge alongside the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · History and advancements in chemistry
