From Overfitting to Robustness: Quantity, Quality, and Variety Oriented Negative Sample Selection in Graph Contrastive Learning
Adnan Ali, Jinlong Li, Huanhuan Chen, Ali Kashif Bashir

TL;DR
This paper introduces a novel negative sample selection algorithm for graph contrastive learning that improves robustness and accuracy by considering sample quality, variation, and quantity, outperforming state-of-the-art methods.
Contribution
The study proposes the Cumulative Sample Selection (CSS) algorithm and the NegAmplify framework, enhancing negative sample selection in GCL to reduce overfitting and improve node classification.
Findings
Up to 2.86% accuracy improvement over SOTA methods
Effective negative sample selection improves model robustness
Framework generalizes well across nine datasets
Abstract
Graph contrastive learning (GCL) aims to contrast positive-negative counterparts to learn the node embeddings, whereas graph data augmentation methods are employed to generate these positive-negative samples. The variation, quantity, and quality of negative samples compared to positive samples play crucial roles in learning meaningful embeddings for node classification downstream tasks. Less variation, excessive quantity, and low-quality negative samples cause the model to be overfitted for particular nodes, resulting in less robust models. To solve the overfitting problem in the GCL paradigm, this study proposes a novel Cumulative Sample Selection (CSS) algorithm by comprehensively considering negative samples' quality, variations, and quantity. Initially, three negative sample pools are constructed: easy, medium, and hard negative samples, which contain 25%, 50%, and 25% of the total…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
MethodsContrastive Learning
