BOOM: Benchmarking Out-Of-distribution Molecular Property Predictions of Machine Learning Models
Evan R. Antoniuk, Shehtab Zaman, Tal Ben-Nun, Peggy Li, James Diffenderfer, Busra Sahin, Obadiah Smolenski, Tim Hsu, Anna M. Hiszpanski, Kenneth Chiu, Bhavya Kailkhura, Brian Van Essen

TL;DR
BOOM introduces a comprehensive benchmark for evaluating the out-of-distribution molecular property prediction capabilities of machine learning models, revealing current limitations and guiding future improvements.
Contribution
This work provides the first systematic, chemically-informed benchmark for OOD molecular property prediction, evaluating over 150 models and analyzing factors affecting OOD generalization.
Findings
No model excels across all OOD tasks
Top models have 3x higher error OOD than in-distribution
Models with high inductive bias perform better on simple properties
Abstract
Data-driven molecular discovery leverages artificial intelligence/machine learning (AI/ML) and generative modeling to filter and design novel molecules. Discovering novel molecules requires accurate out-of-distribution (OOD) predictions, but ML models struggle to generalize OOD. Currently, no systematic benchmarks exist for molecular OOD prediction tasks. We present , enchmarks for ut-f-distribution olecular property predictions: a chemically-informed benchmark for OOD performance on common molecular property prediction tasks. We evaluate over 150 model-task combinations to benchmark deep learning models on OOD performance. Overall, we find that no existing model achieves strong generalization across all tasks: even the top-performing model exhibited an average OOD error 3x higher than in-distribution. Current chemical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods
