Advancing Semantic Textual Similarity Modeling: A Regression Framework   with Translated ReLU and Smooth K2 Loss

Bowen Zhang; Chunping Li

arXiv:2406.05326·cs.CL·October 8, 2024

Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss

Bowen Zhang, Chunping Li

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a novel regression framework with Translated ReLU and Smooth K2 Loss for Semantic Textual Similarity, addressing limitations of contrastive learning and classification-based methods, and demonstrating strong results across multiple benchmarks.

Contribution

It proposes a new regression-based approach with innovative loss functions to better model nuanced semantic similarities in STS tasks.

Findings

01

Achieves competitive performance on seven STS benchmarks.

02

Effectively models fine-grained semantic similarity levels.

03

Potential to enhance contrastive learning pre-trained models.

Abstract

Since the introduction of BERT and RoBERTa, research on Semantic Textual Similarity (STS) has made groundbreaking progress. Particularly, the adoption of contrastive learning has substantially elevated state-of-the-art performance across various STS benchmarks. However, contrastive learning categorizes text pairs as either semantically similar or dissimilar, failing to leverage fine-grained annotated information and necessitating large batch sizes to prevent model collapse. These constraints pose challenges for researchers engaged in STS tasks that involve nuanced similarity levels or those with limited computational resources, compelling them to explore alternatives like Sentence-BERT. Despite its efficiency, Sentence-BERT tackles STS tasks from a classification perspective, overlooking the progressive nature of semantic relationships, which results in suboptimal performance. To bridge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Linear Warmup With Linear Decay · Adam · Attention Dropout · Weight Decay · Linear Layer · Multi-Head Attention · Dropout