Similarity-aware Positive Instance Sampling for Graph Contrastive Pre-training
Xueyi Liu, Yu Rong, Tingyang Xu, Fuchun Sun, Wenbing Huang, Junzhou, Huang

TL;DR
This paper introduces a similarity-aware positive instance sampling method for graph contrastive pre-training, improving the quality of positive samples and enhancing GNN performance across multiple benchmarks.
Contribution
It proposes a novel positive sampling strategy based on domain-specific similarity and hierarchical encoding, addressing issues of illegal or uninformative positive instances in graph contrastive learning.
Findings
Outperforms existing pre-training methods on 13 graph and node classification datasets.
Maintains graph legality and relevance in positive instance selection.
Enhances GNN model performance compared to training from scratch.
Abstract
Graph instance contrastive learning has been proved as an effective task for Graph Neural Network (GNN) pre-training. However, one key issue may seriously impede the representative power in existing works: Positive instances created by current methods often miss crucial information of graphs or even yield illegal instances (such as non-chemically-aware graphs in molecular generation). To remedy this issue, we propose to select positive graph instances directly from existing graphs in the training set, which ultimately maintains the legality and similarity to the target graphs. Our selection is based on certain domain-specific pair-wise similarity measurements as well as sampling from a hierarchical graph encoding similarity relations among graphs. Besides, we develop an adaptive node-level pre-training method to dynamically mask nodes to distribute them evenly in the graph. We conduct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Topic Modeling
MethodsGraph Neural Network · Contrastive Learning
