Graph Contrastive Learning via Cluster-refined Negative Sampling for Semi-supervised Text Classification
Wei Ai, Jianbin Li, Ze Wang, Jiayi Du, Tao Meng, Yuntao Shou, Keqin Li

TL;DR
This paper introduces ClusterText, a novel graph contrastive learning method that refines negative sampling through clustering and self-correction, significantly improving semi-supervised text classification performance.
Contribution
It proposes a cluster-refined negative sampling strategy combined with a self-correction mechanism to address over-clustering in GCL-based text classification.
Findings
Outperforms existing methods in text classification accuracy
Effectively mitigates negative sampling bias and over-clustering
Demonstrates scalability to large datasets
Abstract
Graph contrastive learning (GCL) has been widely applied to text classification tasks due to its ability to generate self-supervised signals from unlabeled data, thus facilitating model training. However, existing GCL-based text classification methods often suffer from negative sampling bias, where similar nodes are incorrectly paired as negative pairs. This can lead to over-clustering, where instances of the same class are divided into different clusters. To address the over-clustering issue, we propose an innovative GCL-based method of graph contrastive learning via cluster-refined negative sampling for semi-supervised text classification, namely ClusterText. Firstly, we combine the pre-trained model Bert with graph neural networks to learn text representations. Secondly, we introduce a clustering refinement strategy, which clusters the learned text representations to obtain pseudo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Contrastive Learning · Dense Connections · Weight Decay · Sparse Evolutionary Training · Layer Normalization · Residual Connection
