An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking
Koustav Rudra, Zeon Trevor Fernando, Avishek Anand

TL;DR
This paper investigates passage-level label transfer for document ranking with pre-trained language models, revealing label noise issues and proposing a weak supervision scheme that improves retrieval performance and efficiency.
Contribution
It provides a detailed analysis of splitting and label transfer strategies, identifying issues with label noise and proposing a novel weak supervision approach for better passage-level relevance labeling.
Findings
Direct label transfer introduces noise affecting effectiveness.
Fine-grained splitting increases query processing time.
Proposed weak supervision scheme improves nDCG scores by 3-14%.
Abstract
Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, the documents are longer than the passages and such document ranking models suffer from the token limitation (512) of BERT. Researchers proposed ranking strategies that either truncate the documents beyond the token limit or chunk the documents into units that can fit into the BERT. In the later case, the relevance labels are either directly transferred from the original query-document pair or learned through some external model. In this paper, we conduct a detailed study of the design decisions about splitting and label transfer on retrieval effectiveness and efficiency. We find that direct transfer of relevance labels from documents to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
MethodsLinear Layer · WordPiece · Weight Decay · Attention Is All You Need · Softmax · Dense Connections · Linear Warmup With Linear Decay · Dropout · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia?
