Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction
Jiele Wu, Haozhe Ma, Zhihan Guo, Thanh Vinh Vo, Tze Yun Leong

TL;DR
This paper introduces GraSPNet, a hierarchical self-supervised learning framework that models atomic and fragment-level semantics in molecular graphs, leading to improved molecular property prediction.
Contribution
It proposes a novel hierarchical GSSL method that captures multi-resolution chemical structures without predefined vocabularies, enhancing transferability.
Findings
Outperforms state-of-the-art GSSL methods on molecular benchmarks
Learns chemically meaningful and transferable representations
Effectively models multi-level molecular semantics
Abstract
Graph self-supervised learning (GSSL) has demonstrated strong potential for generating expressive graph embeddings without the need for human annotations, making it particularly valuable in domains with high labeling costs such as molecular graph analysis. However, existing GSSL methods mostly focus on node- or edge-level information, often ignoring chemically relevant substructures which strongly influence molecular properties. In this work, we propose Graph Semantic Predictive Network (GraSPNet), a hierarchical self-supervised framework that explicitly models both atomic-level and fragment-level semantics. GraSPNet decomposes molecular graphs into chemically meaningful fragments without predefined vocabularies and learns node- and fragment-level representations through multi-level message passing with masked semantic prediction at both levels. This hierarchical semantic supervision…
Peer Reviews
Decision·Submitted to ICLR 2026
1- The paper proposed a novel approach to capture both node and fragment-level semantics. The proposed GraSPNet architecture introduces a dual-level semantic prediction mechanism, which is underexplored in graph self-supervised learning (GSSL). 2- The proposed fragmentation strategy looks promising. Moreover, the WL-test example in Figure 2 clearly demonstrates how fragment-level abstraction helps distinguish structurally similar but semantically distinct molecules. This is a strong theoretical
1- The baselines are outdated. Especially the graph contrastive learning methods. Here a list of GCL methods that have been published more recently and outperform current baselines: GRACE: Zhu, Y., Xu, Y., Yu, F., Liu, Q., Wu, S., & Wang, L. (2020). Deep graph contrastive representation learning. arXiv preprint arXiv:2006.04131. GCA: Zhu, Y., Xu, Y., Yu, F., Liu, Q., Wu, S., & Wang, L. (2021, April). Graph contrastive learning with adaptive augmentation. In Proceedings of the web conference 2021
1. **Programmatic Fragmentation Strategy:** The model partitions molecules into structural subgraphs (rings, paths, articulation points) through a deterministic graph algorithm, ensuring reproducibility without predefined vocabularies. 2. **Hierarchical Message Passing:** The dual-channel design captures both local atomic and global fragment semantics. 3. **Comprehensive Experiments:** Evaluated on 8 classification and 3 regression benchmarks, showing consistent performance gains over GraphCL,
1. **Limited Novelty:** The core idea of hierarchical molecular representation has been explored in [1][2][3], and similar fragment-based or hierarchical pretraining exists. GraSPNet mainly integrates known techniques (fragment-level modeling + masked prediction + hierarchical GNN). 2. **Heuristic Fragment Extraction:** Although the method avoids chemical vocabularies, it still depends on hand-crafted structural heuristics (rings, paths, articulation points). A comparison with functional groups
1. The model achieves state-of-the-art or near-state-of-the-art performance on several challenging molecular property prediction benchmarks, particularly in transfer learning settings, demonstrating the effectiveness of its pretraining strategy and strong generalization ability. 2. The model explicitly models information transfer between atom-fragment and fragment-fragment, enabling it to capture higher-level chemical semantics that standard GNNs may overlook.
1. The fragmentation strategy is a fixed decomposition method based on heuristic rules. It remains unclear whether this decomposition approach is optimal for all downstream tasks. For example, some tasks may require substructure partitions with different granularities or types. Simply applying this method of partitioning could potentially disrupt the information carried within the molecular graph. 2. The "fragment-based" approach is not novel; utilizing substructures or motifs to enhance molecu
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Computational Drug Discovery Methods · Graph Theory and Algorithms
