HNCSE: Advancing Sentence Embeddings via Hybrid Contrastive Learning with Hard Negatives
Wenxiao Liu, Zihong Yang, Chaozhuo Li, Zijin Hong, Jianfeng Ma,, Zhiquan Liu, Litian Zhang, Feiran Huang

TL;DR
HNCSE introduces a novel contrastive learning framework that leverages hard negative samples to improve sentence embeddings, demonstrating superior performance on semantic similarity and transfer tasks.
Contribution
It extends SimCSE by effectively incorporating hard negatives, enhancing the semantic richness of sentence representations in unsupervised learning.
Findings
Outperforms existing methods on semantic textual similarity tasks.
Shows improved transfer learning performance.
Validates the effectiveness of hard negatives in sentence embedding.
Abstract
Unsupervised sentence representation learning remains a critical challenge in modern natural language processing (NLP) research. Recently, contrastive learning techniques have achieved significant success in addressing this issue by effectively capturing textual semantics. Many such approaches prioritize the optimization using negative samples. In fields such as computer vision, hard negative samples (samples that are close to the decision boundary and thus more difficult to distinguish) have been shown to enhance representation learning. However, adapting hard negatives to contrastive sentence learning is complex due to the intricate syntactic and semantic details of text. To address this problem, we propose HNCSE, a novel contrastive learning framework that extends the leading SimCSE approach. The hallmark of HNCSE is its innovative use of hard negative samples to enhance the learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Hate Speech and Cyberbullying Detection
MethodsContrastive Learning · SimCSE
