From Overfitting to Robustness: Quantity, Quality, and Variety Oriented   Negative Sample Selection in Graph Contrastive Learning

Adnan Ali; Jinlong Li; Huanhuan Chen; Ali Kashif Bashir

arXiv:2406.15044·cs.LG·March 5, 2025

From Overfitting to Robustness: Quantity, Quality, and Variety Oriented Negative Sample Selection in Graph Contrastive Learning

Adnan Ali, Jinlong Li, Huanhuan Chen, Ali Kashif Bashir

PDF

Open Access

TL;DR

This paper introduces a novel negative sample selection algorithm for graph contrastive learning that improves robustness and accuracy by considering sample quality, variation, and quantity, outperforming state-of-the-art methods.

Contribution

The study proposes the Cumulative Sample Selection (CSS) algorithm and the NegAmplify framework, enhancing negative sample selection in GCL to reduce overfitting and improve node classification.

Findings

01

Up to 2.86% accuracy improvement over SOTA methods

02

Effective negative sample selection improves model robustness

03

Framework generalizes well across nine datasets

Abstract

Graph contrastive learning (GCL) aims to contrast positive-negative counterparts to learn the node embeddings, whereas graph data augmentation methods are employed to generate these positive-negative samples. The variation, quantity, and quality of negative samples compared to positive samples play crucial roles in learning meaningful embeddings for node classification downstream tasks. Less variation, excessive quantity, and low-quality negative samples cause the model to be overfitted for particular nodes, resulting in less robust models. To solve the overfitting problem in the GCL paradigm, this study proposes a novel Cumulative Sample Selection (CSS) algorithm by comprehensively considering negative samples' quality, variations, and quantity. Initially, three negative sample pools are constructed: easy, medium, and hard negative samples, which contain 25%, 50%, and 25% of the total…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition

MethodsContrastive Learning