JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings
Tianyu Zong, Hongzhu Yi, Bingkang Shi, Yuanxiang Wang, Jungang Xu

TL;DR
JTCSE introduces a novel unsupervised contrastive learning framework for sentence embeddings that incorporates modulus constraints and cross-attention mechanisms, leading to state-of-the-art results in semantic similarity tasks.
Contribution
The paper proposes the JTCSE framework, combining modulus constraints on semantic tensors and cross-attention to improve unsupervised sentence embedding quality.
Findings
Outperforms baseline models on seven semantic similarity tasks
Achieves state-of-the-art results in unsupervised sentence embedding
Demonstrates strong zero-shot performance across 130 downstream tasks
Abstract
Unsupervised contrastive learning has become a hot research topic in natural language processing. Existing works usually aim at constraining the orientation distribution of the representations of positive and negative samples in the high-dimensional semantic space in contrastive learning, but the semantic representation tensor possesses both modulus and orientation features, and the existing works ignore the modulus feature of the representations and cause insufficient contrastive learning. % Therefore, we firstly propose a training objective that aims at modulus constraints on the semantic representation tensor, to strengthen the alignment between the positive samples in contrastive learning. Therefore, we first propose a training objective that is designed to impose modulus constraints on the semantic representation tensor, to strengthen the alignment between positive samples in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
MethodsSoftmax · Attention Is All You Need · Contrastive Learning
