Identifying Semantic Component for Robust Molecular Property Prediction
Zijian Li, Zunhong Xu, Ruichu Cai, Zhenhui Yang, Yuguang Yan, Zhifeng, Hao, Guangyi Chen, Kun Zhang

TL;DR
This paper introduces SCI, a generative model that explicitly identifies semantic components in molecular data, enhancing out-of-distribution generalization and achieving state-of-the-art results across multiple benchmarks.
Contribution
The paper proposes a novel generative model with semantic-component identifiability for molecular property prediction, improving OOD generalization and interpretability.
Findings
Achieves state-of-the-art performance on 21 datasets.
Demonstrates improved OOD generalization over existing methods.
Provides insightful visualizations and explanations for predictions.
Abstract
Although graph neural networks have achieved great success in the task of molecular property prediction in recent years, their generalization ability under out-of-distribution (OOD) settings is still under-explored. Different from existing methods that learn discriminative representations for prediction, we propose a generative model with semantic-components identifiability, named SCI. We demonstrate that the latent variables in this generative model can be explicitly identified into semantic-relevant (SR) and semantic-irrelevant (SI) components, which contributes to better OOD generalization by involving minimal change properties of causal mechanisms. Specifically, we first formulate the data generation process from the atom level to the molecular level, where the latent space is split into SI substructures, SR substructures, and SR atom variables. Sequentially, to reduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Graph Neural Networks
