2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings

Yumeng Wang; Ziran Zhou; Junjin Wang

arXiv:2501.13758·cs.CL·January 24, 2025

2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings

Yumeng Wang, Ziran Zhou, Junjin Wang

PDF

Open Access

TL;DR

This paper introduces a novel 2-Tier SimCSE fine-tuning approach for BERT that improves sentence embeddings across multiple NLP tasks by combining unsupervised and supervised contrastive learning techniques.

Contribution

It proposes a new 2-Tier SimCSE model integrating unsupervised and supervised fine-tuning, and explores dropout techniques to enhance embedding quality and generalization.

Findings

01

2-Tier SimCSE achieves higher STS scores (average 0.742).

02

Adaptive Dropout removal improves performance, indicating overfitting issues.

03

Transfer learning from SimCSE on other tasks shows limited benefits.

Abstract

Effective sentence embeddings that capture semantic nuances and generalize well across diverse contexts are crucial for natural language processing tasks. We address this challenge by applying SimCSE (Simple Contrastive Learning of Sentence Embeddings) using contrastive learning to fine-tune the minBERT model for sentiment analysis, semantic textual similarity (STS), and paraphrase detection. Our contributions include experimenting with three different dropout techniques, namely standard dropout, curriculum dropout, and adaptive dropout, to tackle overfitting, proposing a novel 2-Tier SimCSE Fine-tuning Model that combines both unsupervised and supervised SimCSE on STS task, and exploring transfer learning potential for Paraphrase and SST tasks. Our findings demonstrate the effectiveness of SimCSE, with the 2-Tier model achieving superior performance on the STS task, with an average…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Natural Language Processing Techniques

MethodsSimCSE · Contrastive Learning · Dropout · Adaptive Dropout