Improving Domain-Specific Retrieval by NLI Fine-Tuning

Roman Du\v{s}ek; Aleksander Wawer; Christopher Galias; Lidia; Wojciechowska

arXiv:2308.03103·cs.CL·August 8, 2023

Improving Domain-Specific Retrieval by NLI Fine-Tuning

Roman Du\v{s}ek, Aleksander Wawer, Christopher Galias, Lidia, Wojciechowska

PDF

Open Access

TL;DR

This paper explores how fine-tuning sentence encoders with NLI data enhances retrieval and ranking performance across English and Polish, demonstrating improvements in both monolingual and multilingual models.

Contribution

It shows that NLI fine-tuning significantly boosts retrieval and ranking tasks in multiple languages, with analysis of embedding properties explaining the improvements.

Findings

01

NLI fine-tuning improves retrieval and ranking performance.

02

Both monolingual and multilingual models benefit from NLI data.

03

Embedding uniformity and alignment are affected by NLI fine-tuning.

Abstract

The aim of this article is to investigate the fine-tuning potential of natural language inference (NLI) data to improve information retrieval and ranking. We demonstrate this for both English and Polish languages, using data from one of the largest Polish e-commerce sites and selected open-domain datasets. We employ both monolingual and multilingual sentence encoders fine-tuned by a supervised method utilizing contrastive loss and NLI data. Our results point to the fact that NLI fine-tuning increases the performance of the models in both tasks and both languages, with the potential to improve mono- and multilingual models. Finally, we investigate uniformity and alignment of the embeddings to explain the effect of NLI-based fine-tuning for an out-of-domain use-case.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification