Context-Aware Sentence/Passage Term Importance Estimation For First Stage Retrieval
Zhuyun Dai, Jamie Callan

TL;DR
This paper introduces a deep learning framework that leverages BERT to generate context-aware term weights for sentences and passages, significantly enhancing first-stage retrieval accuracy without high computational costs.
Contribution
It proposes a novel deep contextualized term weighting method that integrates with standard retrieval systems, improving initial retrieval effectiveness using BERT-based representations.
Findings
DeepCT improves retrieval accuracy across four datasets.
The method produces efficient term weights compatible with inverted indexes.
It outperforms traditional frequency-based weighting methods.
Abstract
Term frequency is a common method for identifying the importance of a term in a query or document. But it is a weak signal, especially when the frequency distribution is flat, such as in long queries or short documents where the text is of sentence/passage-length. This paper proposes a Deep Contextualized Term Weighting framework that learns to map BERT's contextualized text representations to context-aware term weights for sentences and passages. When applied to passages, DeepCT-Index produces term weights that can be stored in an ordinary inverted index for passage retrieval. When applied to query text, DeepCT-Query generates a weighted bag-of-words query. Both types of term weight can be used directly by typical first-stage retrieval algorithms. This is novel because most deep neural network based ranking models have higher computational costs, and thus are restricted to later-stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Natural Language Processing Techniques
