2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings
Yumeng Wang, Ziran Zhou, Junjin Wang

TL;DR
This paper introduces a novel 2-Tier SimCSE fine-tuning approach for BERT that improves sentence embeddings across multiple NLP tasks by combining unsupervised and supervised contrastive learning techniques.
Contribution
It proposes a new 2-Tier SimCSE model integrating unsupervised and supervised fine-tuning, and explores dropout techniques to enhance embedding quality and generalization.
Findings
2-Tier SimCSE achieves higher STS scores (average 0.742).
Adaptive Dropout removal improves performance, indicating overfitting issues.
Transfer learning from SimCSE on other tasks shows limited benefits.
Abstract
Effective sentence embeddings that capture semantic nuances and generalize well across diverse contexts are crucial for natural language processing tasks. We address this challenge by applying SimCSE (Simple Contrastive Learning of Sentence Embeddings) using contrastive learning to fine-tune the minBERT model for sentiment analysis, semantic textual similarity (STS), and paraphrase detection. Our contributions include experimenting with three different dropout techniques, namely standard dropout, curriculum dropout, and adaptive dropout, to tackle overfitting, proposing a novel 2-Tier SimCSE Fine-tuning Model that combines both unsupervised and supervised SimCSE on STS task, and exploring transfer learning potential for Paraphrase and SST tasks. Our findings demonstrate the effectiveness of SimCSE, with the 2-Tier model achieving superior performance on the STS task, with an average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Software Engineering Research · Natural Language Processing Techniques
MethodsSimCSE · Contrastive Learning · Dropout · Adaptive Dropout
