D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence Embeddings
Hyunjae Lee

TL;DR
D2CSE introduces a novel, memory-efficient method for learning highly distinguishable sentence embeddings using continuous prompts, outperforming existing models on semantic similarity benchmarks.
Contribution
It proposes a difference-aware deep continuous prompt approach that avoids multiple PLMs, reduces training parameters, and enhances sentence embedding quality.
Findings
Outperforms state-of-the-art on seven STS benchmarks.
Reduces training parameters to about 1% of existing methods.
Improves embedding space quality measured by multiple metrics.
Abstract
This paper describes Difference-aware Deep continuous prompt for Contrastive Sentence Embeddings (D2CSE) that learns sentence embeddings. Compared to state-of-the-art approaches, D2CSE computes sentence vectors that are exceptional to distinguish a subtle difference in similar sentences by employing a simple neural architecture for continuous prompts. Unlike existing architectures that require multiple pretrained language models (PLMs) to process a pair of the original and corrupted (subtly modified) sentences, D2CSE avoids cumbersome fine-tuning of multiple PLMs by only optimizing continuous prompts by performing multiple tasks -- i.e., contrastive learning and conditional replaced token detection all done in a self-guided manner. D2CSE overloads a single PLM on continuous prompts and greatly saves memory consumption as a result. The number of training parameters in D2CSE is reduced to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsContrastive Learning
