Interpretability Analysis of Domain Adapted Dense Retrievers
Goksenin Yuksel, Jaap Kamps

TL;DR
This paper introduces an interpretability method using integrated gradients to analyze how domain adaptation affects dense retrievers' focus on domain-specific terminology, enhancing understanding of their behavior across different domains.
Contribution
It develops a novel explainability framework for dense retrievers using integrated gradients and a new baseline, revealing how domain adaptation shifts model focus to in-domain terms.
Findings
Domain-adapted models focus more on in-domain terminology.
Integrated gradients effectively explain dense retriever behavior.
The method provides both instance-based and ranking-based explanations.
Abstract
Dense retrievers have demonstrated significant potential for neural information retrieval; however, they exhibit a lack of robustness to domain shifts, thereby limiting their efficacy in zero-shot settings across diverse domains. Previous research has investigated unsupervised domain adaptation techniques to adapt dense retrievers to target domains. However, these studies have not focused on explainability analysis to understand how such adaptations alter the model's behavior. In this paper, we propose utilizing the integrated gradients framework to develop an interpretability method that provides both instance-based and ranking-based explanations for dense retrievers. To generate these explanations, we introduce a novel baseline that reveals both query and document attributions. This method is used to analyze the effects of domain adaptation on input attributions for query and document…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
MethodsFocus
