Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for   Pairwise Sentence Scoring Tasks

Nandan Thakur; Nils Reimers; Johannes Daxenberger; Iryna Gurevych

arXiv:2010.08240·cs.CL·April 13, 2021

Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks

Nandan Thakur, Nils Reimers, Johannes Daxenberger, Iryna Gurevych

PDF

1 Repo 9 Models

TL;DR

Augmented SBERT introduces a data augmentation technique using cross-encoders to label additional data, significantly enhancing bi-encoder performance in pairwise sentence scoring tasks, especially in domain adaptation scenarios.

Contribution

The paper proposes a simple, efficient data augmentation method for bi-encoders using cross-encoder labels, improving performance without extensive fine-tuning.

Findings

01

Up to 6-point improvement in in-domain tasks

02

Up to 37-point improvement in domain adaptation tasks

03

Effective sentence pair selection is crucial for success

Abstract

There are two approaches for pairwise sentence scoring: Cross-encoders, which perform full-attention over the input pair, and Bi-encoders, which map each input independently to a dense vector space. While cross-encoders often achieve higher performance, they are too slow for many practical use cases. Bi-encoders, on the other hand, require substantial training data and fine-tuning over the target task to achieve competitive performance. We present a simple yet efficient data augmentation strategy called Augmented SBERT, where we use the cross-encoder to label a larger set of input pairs to augment the training data for the bi-encoder. We show that, in this process, selecting the sentence pairs is non-trivial and crucial for the success of the method. We evaluate our approach on multiple tasks (in-domain) as well as on a domain adaptation task. Augmented SBERT achieves an improvement of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UKPLab/sentence-transformers
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSentence-BERT · Augmented SBERT · Siamese Network · Adam · Dropout · Softmax · BERT