Exploiting Sentence-Level Representations for Passage Ranking

Jurek Leonhardt; Fabian Beringer; Avishek Anand

arXiv:2106.07316·cs.IR·August 31, 2021

Exploiting Sentence-Level Representations for Passage Ranking

Jurek Leonhardt, Fabian Beringer, Avishek Anand

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that explicitly modeling sentence-level representations with Dynamic Memory Networks enhances passage re-ranking in open-domain QA, outperforming traditional fine-tuning of BERT and improving training efficiency.

Contribution

It introduces a memory-enhanced explicit sentence modeling approach using DMNs, showing significant improvements over standard BERT fine-tuning for passage re-ranking.

Findings

01

Memory-augmented models outperform vanilla BERT in passage re-ranking.

02

Freezing BERT and training only DMN layers yields comparable performance with higher efficiency.

03

Explicit sentence-level modeling captures useful relevance signals discarded in traditional approaches.

Abstract

Recently, pre-trained contextual models, such as BERT, have shown to perform well in language related tasks. We revisit the design decisions that govern the applicability of these models for the passage re-ranking task in open-domain question answering. We find that common approaches in the literature rely on fine-tuning a pre-trained BERT model and using a single, global representation of the input, discarding useful fine-grained relevance signals in token- or sentence-level representations. We argue that these discarded tokens hold useful information that can be leveraged. In this paper, we explicitly model the sentence-level representations by using Dynamic Memory Networks (DMNs) and conduct empirical evaluation to show improvements in passage re-ranking over fine-tuned vanilla BERT models by memory-enhanced explicit sentence modelling on a diverse set of open-domain QA datasets. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mrjleo/ranking-models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout · Dense Connections