Regularized Contrastive Learning of Semantic Search
Mingxi Tan, Alexis Rolland, Andong Tian

TL;DR
This paper introduces Regularized Contrastive Learning, a novel regularization technique for transformer models that enhances sentence representations for semantic search, outperforming existing methods on multiple benchmarks.
Contribution
The paper proposes a new regularization method that augments semantic representations and uses contrastive objectives to improve transformer-based sentence encoding.
Findings
Outperforms baseline methods on 7 semantic search benchmarks
Effectively alleviates overfitting and anisotropic issues in sentence embeddings
Demonstrates superior performance on long-query FAQ datasets
Abstract
Semantic search is an important task which objective is to find the relevant index from a database for query. It requires a retrieval model that can properly learn the semantics of sentences. Transformer-based models are widely used as retrieval models due to their excellent ability to learn semantic representations. in the meantime, many regularization methods suitable for them have also been proposed. In this paper, we propose a new regularization method: Regularized Contrastive Learning, which can help transformer-based models to learn a better representation of sentences. It firstly augments several different semantic representations for every sentence, then take them into the contrastive objective as regulators. These contrastive regulators can overcome overfitting issues and alleviate the anisotropic problem. We firstly evaluate our approach on 7 semantic search benchmarks with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
