Identical and Fraternal Twins: Fine-Grained Semantic Contrastive Learning of Sentence Representations
Qingfa Xiao, Shuangyin Li, Lei Chen

TL;DR
This paper introduces IFTCL, a novel contrastive learning framework for sentence representations that preserves semantic margins and improves performance by addressing limitations of existing methods.
Contribution
The paper proposes the Twins Loss and hippocampus queue mechanism, enhancing contrastive learning by maintaining semantic margins and reusing negative samples efficiently.
Findings
Outperforms state-of-the-art on nine semantic textual similarity tasks
Effectively preserves semantic margins during training
Improves efficiency with hippocampus queue mechanism
Abstract
The enhancement of unsupervised learning of sentence representations has been significantly achieved by the utility of contrastive learning. This approach clusters the augmented positive instance with the anchor instance to create a desired embedding space. However, relying solely on the contrastive objective can result in sub-optimal outcomes due to its inability to differentiate subtle semantic variations between positive pairs. Specifically, common data augmentation techniques frequently introduce semantic distortion, leading to a semantic margin between the positive pair. While the InfoNCE loss function overlooks the semantic margin and prioritizes similarity maximization between positive pairs during training, leading to the insensitive semantic comprehension ability of the trained model. In this paper, we introduce a novel Identical and Fraternal Twins of Contrastive Learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
MethodsContrastive Learning · InfoNCE
