Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

Jiele Wu; Haozhe Ma; Zhihan Guo; Thanh Vinh Vo; Tze Yun Leong

arXiv:2602.20344·cs.LG·February 25, 2026

Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

Jiele Wu, Haozhe Ma, Zhihan Guo, Thanh Vinh Vo, Tze Yun Leong

PDF

Open Access 3 Reviews

TL;DR

This paper introduces GraSPNet, a hierarchical self-supervised learning framework that models atomic and fragment-level semantics in molecular graphs, leading to improved molecular property prediction.

Contribution

It proposes a novel hierarchical GSSL method that captures multi-resolution chemical structures without predefined vocabularies, enhancing transferability.

Findings

01

Outperforms state-of-the-art GSSL methods on molecular benchmarks

02

Learns chemically meaningful and transferable representations

03

Effectively models multi-level molecular semantics

Abstract

Graph self-supervised learning (GSSL) has demonstrated strong potential for generating expressive graph embeddings without the need for human annotations, making it particularly valuable in domains with high labeling costs such as molecular graph analysis. However, existing GSSL methods mostly focus on node- or edge-level information, often ignoring chemically relevant substructures which strongly influence molecular properties. In this work, we propose Graph Semantic Predictive Network (GraSPNet), a hierarchical self-supervised framework that explicitly models both atomic-level and fragment-level semantics. GraSPNet decomposes molecular graphs into chemically meaningful fragments without predefined vocabularies and learns node- and fragment-level representations through multi-level message passing with masked semantic prediction at both levels. This hierarchical semantic supervision…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 6Confidence 4

Strengths

1- The paper proposed a novel approach to capture both node and fragment-level semantics. The proposed GraSPNet architecture introduces a dual-level semantic prediction mechanism, which is underexplored in graph self-supervised learning (GSSL). 2- The proposed fragmentation strategy looks promising. Moreover, the WL-test example in Figure 2 clearly demonstrates how fragment-level abstraction helps distinguish structurally similar but semantically distinct molecules. This is a strong theoretical

Weaknesses

1- The baselines are outdated. Especially the graph contrastive learning methods. Here a list of GCL methods that have been published more recently and outperform current baselines: GRACE: Zhu, Y., Xu, Y., Yu, F., Liu, Q., Wu, S., & Wang, L. (2020). Deep graph contrastive representation learning. arXiv preprint arXiv:2006.04131. GCA: Zhu, Y., Xu, Y., Yu, F., Liu, Q., Wu, S., & Wang, L. (2021, April). Graph contrastive learning with adaptive augmentation. In Proceedings of the web conference 2021

Reviewer 02Rating 2Confidence 4

Strengths

1. **Programmatic Fragmentation Strategy:** The model partitions molecules into structural subgraphs (rings, paths, articulation points) through a deterministic graph algorithm, ensuring reproducibility without predefined vocabularies. 2. **Hierarchical Message Passing:** The dual-channel design captures both local atomic and global fragment semantics. 3. **Comprehensive Experiments:** Evaluated on 8 classification and 3 regression benchmarks, showing consistent performance gains over GraphCL,

Weaknesses

1. **Limited Novelty:** The core idea of hierarchical molecular representation has been explored in [1][2][3], and similar fragment-based or hierarchical pretraining exists. GraSPNet mainly integrates known techniques (fragment-level modeling + masked prediction + hierarchical GNN). 2. **Heuristic Fragment Extraction:** Although the method avoids chemical vocabularies, it still depends on hand-crafted structural heuristics (rings, paths, articulation points). A comparison with functional groups

Reviewer 03Rating 4Confidence 4

Strengths

1. The model achieves state-of-the-art or near-state-of-the-art performance on several challenging molecular property prediction benchmarks, particularly in transfer learning settings, demonstrating the effectiveness of its pretraining strategy and strong generalization ability. 2. The model explicitly models information transfer between atom-fragment and fragment-fragment, enabling it to capture higher-level chemical semantics that standard GNNs may overlook.

Weaknesses

1. The fragmentation strategy is a fixed decomposition method based on heuristic rules. It remains unclear whether this decomposition approach is optimal for all downstream tasks. For example, some tasks may require substructure partitions with different granularities or types. Simply applying this method of partitioning could potentially disrupt the information carried within the molecular graph. 2. The "fragment-based" approach is not novel; utilizing substructures or motifs to enhance molecu

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Computational Drug Discovery Methods · Graph Theory and Algorithms