Multi-level Self-supervised Pretraining on Compositional Hierarchical Graph for Molecular Property Prediction
Xiayu Liu, Zhengyi Lu, Hou-biao Li

TL;DR
This paper introduces MolCHG, a multi-level self-supervised pretraining framework for molecular graphs that captures hierarchical structural information to improve molecular property prediction.
Contribution
The work proposes a novel Compositional Hierarchical Graph and multi-level pretraining objectives to better utilize bond and atom information in molecular graphs.
Findings
MolCHG outperforms baselines on seven MoleculeNet datasets.
Multi-level supervision signals are complementary and enhance performance.
Bond-level information is effectively integrated as an independent semantic layer.
Abstract
Self-supervised pretraining on molecular graphs has emerged as a promising approach for molecular property prediction, yet most existing methods operate at a single structural granularity and treat bond information as auxiliary edge attributes rather than as an independent semantic layer. In this work, we propose MolCHG, a multi-level self-supervised pretraining framework built upon a novel Compositional Hierarchical Graph that organizes molecular structure into four types of nodes across three semantic levels. By introducing a bond graph that operates in parallel with the atom graph, our architecture elevates bond-level information to independently evolving node representations, enabling fragment nodes to aggregate atom-level and bond-level semantics on an equal footing. We design three level-specific pretraining objectives: an atom-bond cross-view contrastive task that aligns the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
