Pre-training Graph Neural Networks on 2D and 3D Molecular Structures by using Multi-View Conditional Information Bottleneck
Van Thuy Hoang, O-Joun Lee

TL;DR
This paper introduces MVCIB, a novel self-supervised pre-training framework for molecular graph neural networks that leverages multi-view (2D and 3D) structures to improve representation quality, interpretability, and expressiveness, especially for complex molecular geometries.
Contribution
The paper proposes MVCIB, a multi-view conditional information bottleneck method that effectively discovers shared information and aligns important substructures across 2D and 3D molecular views, enhancing GNN pre-training.
Findings
MVCIB outperforms baselines in predictive tasks across four molecular domains.
MVCIB achieves 3D Weisfeiler-Lehman expressiveness, distinguishing isomers with identical 2D structures.
Incorporating substructure alignment improves interpretability and cross-view consistency.
Abstract
Recent pre-training strategies for molecular graphs have attempted to use 2D and 3D molecular views as both inputs and self-supervised signals, primarily aligning graph-level representations. However, existing studies remain limited in addressing two main challenges of multi-view molecular learning: (1) discovering shared information between two views while diminishing view-specific information and (2) identifying and aligning important substructures, e.g., functional groups, which are crucial for enhancing cross-view consistency and model expressiveness. To solve these challenges, we propose a Multi-View Conditional Information Bottleneck framework, called MVCIB, for pre-training graph neural networks on 2D and 3D molecular structures in a self-supervised setting. Our idea is to discover the shared information while minimizing irrelevant features from each view under the MVCIB…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Computational Drug Discovery Methods
