Dense Retrieval Adaptation using Target Domain Description

Helia Hashemi; Yong Zhuang; Sachith Sri Ram Kothur; Srivas Prasad,; Edgar Meij; W. Bruce Croft

arXiv:2307.02740·cs.IR·July 7, 2023

Dense Retrieval Adaptation using Target Domain Description

Helia Hashemi, Yong Zhuang, Sachith Sri Ram Kothur, Srivas Prasad,, Edgar Meij, W. Bruce Croft

PDF

Open Access

TL;DR

This paper proposes a new zero-shot domain adaptation method for dense retrieval models using only a textual description of the target domain, creating synthetic data to improve retrieval performance.

Contribution

It introduces a novel adaptation approach that leverages domain descriptions to generate synthetic data, enabling effective domain adaptation without access to target documents.

Findings

01

Synthetic data improves retrieval in new domains

02

Method outperforms baseline zero-shot approaches

03

Effective across diverse target domains

Abstract

In information retrieval (IR), domain adaptation is the process of adapting a retrieval model to a new domain whose data distribution is different from the source domain. Existing methods in this area focus on unsupervised domain adaptation where they have access to the target document collection or supervised (often few-shot) domain adaptation where they additionally have access to (limited) labeled data in the target domain. There also exists research on improving zero-shot performance of retrieval models with no adaptation. This paper introduces a new category of domain adaptation in IR that is as-yet unexplored. Here, similar to the zero-shot setting, we assume the retrieval model does not have access to the target document collection. In contrast, it does have access to a brief textual description that explains the target domain. We define a taxonomy of domain attributes in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications

MethodsFocus