Constructing and explaining machine learning models for chemistry: example of the exploration and design of boron-based Lewis acids
Juliette Fenogli, Laurence Grimaud, Rodolphe Vuilleumier (CPCV,, D\'epartement de chimie, \'Ecole Normale Sup\'erieure, PSL University,, Sorbonne Universit\'e, CNRS, Paris, France)

TL;DR
This paper develops interpretable machine learning models to predict and understand the Lewis acidity of boron-based compounds, aiding rational molecular design in chemistry.
Contribution
It introduces explainable AI models using chemically meaningful descriptors to predict Lewis acidity, surpassing black-box models especially in low-data scenarios.
Findings
Highly accurate predictions with MAE < 6 kJ/mol
Models reveal key factors influencing Lewis acidity
Guidelines for molecular modifications to tune acidity
Abstract
The integration of machine learning (ML) into chemistry offers transformative potential in the design of molecules with targeted properties. However, the focus has often been on creating highly efficient predictive models, sometimes at the expense of interpretability. In this study, we leverage explainable AI techniques to explore the rational design of boron-based Lewis acids, which play a pivotal role in organic reactions due to their electron-ccepting properties. Using Fluoride Ion Affinity as a proxy for Lewis acidity, we developed interpretable ML models based on chemically meaningful descriptors, including ab initio computed features and substituent-based parameters derived from the Hammett linear free-energy relationship. By constraining the chemical space to well-defined molecular scaffolds, we achieved highly accurate predictions (mean absolute error < 6 kJ/mol), surpassing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVarious Chemistry Research Topics · Machine Learning in Materials Science · Computational Drug Discovery Methods
MethodsFocus
