An In-depth Analysis of Passage-Level Label Transfer for Contextual   Document Ranking

Koustav Rudra; Zeon Trevor Fernando; Avishek Anand

arXiv:2103.16669·cs.IR·December 7, 2023·1 cites

An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking

Koustav Rudra, Zeon Trevor Fernando, Avishek Anand

PDF

Open Access 1 Repo

TL;DR

This paper investigates passage-level label transfer for document ranking with pre-trained language models, revealing label noise issues and proposing a weak supervision scheme that improves retrieval performance and efficiency.

Contribution

It provides a detailed analysis of splitting and label transfer strategies, identifying issues with label noise and proposing a novel weak supervision approach for better passage-level relevance labeling.

Findings

01

Direct label transfer introduces noise affecting effectiveness.

02

Fine-grained splitting increases query processing time.

03

Proposed weak supervision scheme improves nDCG scores by 3-14%.

Abstract

Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, the documents are longer than the passages and such document ranking models suffer from the token limitation (512) of BERT. Researchers proposed ranking strategies that either truncate the documents beyond the token limit or chunk the documents into units that can fit into the BERT. In the later case, the relevance labels are either directly transferred from the original query-document pair or learned through some external model. In this paper, we conduct a detailed study of the design decisions about splitting and label transfer on retrieval effectiveness and efficiency. We find that direct transfer of relevance labels from documents to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

krudra/qadocrank
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning

MethodsLinear Layer · WordPiece · Weight Decay · Attention Is All You Need · Softmax · Dense Connections · Linear Warmup With Linear Decay · Dropout · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia?