D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence   Embeddings

Hyunjae Lee

arXiv:2304.08991·cs.CL·April 19, 2023·1 cites

D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence Embeddings

Hyunjae Lee

PDF

Open Access

TL;DR

D2CSE introduces a novel, memory-efficient method for learning highly distinguishable sentence embeddings using continuous prompts, outperforming existing models on semantic similarity benchmarks.

Contribution

It proposes a difference-aware deep continuous prompt approach that avoids multiple PLMs, reduces training parameters, and enhances sentence embedding quality.

Findings

01

Outperforms state-of-the-art on seven STS benchmarks.

02

Reduces training parameters to about 1% of existing methods.

03

Improves embedding space quality measured by multiple metrics.

Abstract

This paper describes Difference-aware Deep continuous prompt for Contrastive Sentence Embeddings (D2CSE) that learns sentence embeddings. Compared to state-of-the-art approaches, D2CSE computes sentence vectors that are exceptional to distinguish a subtle difference in similar sentences by employing a simple neural architecture for continuous prompts. Unlike existing architectures that require multiple pretrained language models (PLMs) to process a pair of the original and corrupted (subtly modified) sentences, D2CSE avoids cumbersome fine-tuning of multiple PLMs by only optimizing continuous prompts by performing multiple tasks -- i.e., contrastive learning and conditional replaced token detection all done in a self-guided manner. D2CSE overloads a single PLM on continuous prompts and greatly saves memory consumption as a result. The number of training parameters in D2CSE is reduced to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsContrastive Learning