A Systematic Survey and Benchmark of Deep Learning for Molecular Property Prediction in the Foundation Model Era
Zongru Li, Xingsheng Chen, Honggang Wen, Regina Qianru Zhang, Ming Li, Xiaojin Zhang, Hongzhi Yin, Qiang Yang, Kwok-Yan Lam, Pietro Lio, Siu-Ming Yiu

TL;DR
This paper provides a comprehensive survey and benchmark of deep learning methods for molecular property prediction, emphasizing recent foundation models and proposing future research directions.
Contribution
It introduces a unified taxonomy, analyzes current benchmarks, discusses challenges, and suggests new directions like physics-aware learning and multimodal datasets.
Findings
Benchmark analyses reveal inconsistencies in data curation and evaluation.
Current standards face reproducibility issues due to heterogeneous data sources.
Proposed future directions aim to improve model trustworthiness and realism.
Abstract
Molecular property prediction integrates quantum chemistry, cheminformatics, and deep learning to connect molecular structure with physicochemical and biological behavior. This survey traces four complementary paradigms, including Quantum, Descriptor Machine Learning, Geometric Deep Learning, and Foundation Models, and outlines a unified taxonomy linking molecular representations, model architectures, and interdisciplinary applications. Benchmark analyses integrate evidence from both widely used datasets and datasets reflecting industry perspectives, encompassing quantum, physicochemical, physiological, and biophysical domains. The survey examines current standards in data curation, splitting strategies, and evaluation protocols, highlighting challenges including inconsistent stereochemistry, heterogeneous assay sources, and reproducibility limitations under random or poorly defined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
