Regularized Contrastive Learning of Semantic Search

Mingxi Tan; Alexis Rolland; Andong Tian

arXiv:2209.13241·cs.LG·September 28, 2022

Regularized Contrastive Learning of Semantic Search

Mingxi Tan, Alexis Rolland, Andong Tian

PDF

Open Access

TL;DR

This paper introduces Regularized Contrastive Learning, a novel regularization technique for transformer models that enhances sentence representations for semantic search, outperforming existing methods on multiple benchmarks.

Contribution

The paper proposes a new regularization method that augments semantic representations and uses contrastive objectives to improve transformer-based sentence encoding.

Findings

01

Outperforms baseline methods on 7 semantic search benchmarks

02

Effectively alleviates overfitting and anisotropic issues in sentence embeddings

03

Demonstrates superior performance on long-query FAQ datasets

Abstract

Semantic search is an important task which objective is to find the relevant index from a database for query. It requires a retrieval model that can properly learn the semantics of sentences. Transformer-based models are widely used as retrieval models due to their excellent ability to learn semantic representations. in the meantime, many regularization methods suitable for them have also been proposed. In this paper, we propose a new regularization method: Regularized Contrastive Learning, which can help transformer-based models to learn a better representation of sentences. It firstly augments several different semantic representations for every sentence, then take them into the contrastive objective as regulators. These contrastive regulators can overcome overfitting issues and alleviate the anisotropic problem. We firstly evaluate our approach on 7 semantic search benchmarks with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning