CEQE: Contextualized Embeddings for Query Expansion

Shahrzad Naseri; Jeffrey Dalton; Andrew Yates; James Allan

arXiv:2103.05256·cs.IR·March 10, 2021

CEQE: Contextualized Embeddings for Query Expansion

Shahrzad Naseri, Jeffrey Dalton, Andrew Yates, James Allan

PDF

TL;DR

This paper introduces CEQE, a novel query expansion method using contextualized embeddings like BERT, which significantly improves retrieval effectiveness over static models and traditional PRF methods.

Contribution

The paper proposes CEQE, a new model leveraging query-focused contextualized embeddings for query expansion, demonstrating superior performance in ad-hoc retrieval tasks.

Findings

01

CEQE outperforms static embedding methods by up to 18% on Robust and 31% on Deep Learning.

02

CEQE improves over traditional PRF models in multiple retrieval scenarios.

03

Multiple expansion and reranking passes further enhance retrieval effectiveness.

Abstract

In this work we leverage recent advances in context-sensitive language models to improve the task of query expansion. Contextualized word representation models, such as ELMo and BERT, are rapidly replacing static embedding models. We propose a new model, Contextualized Embeddings for Query Expansion (CEQE), that utilizes query-focused contextualized embedding vectors. We study the behavior of contextual representations generated for query expansion in ad-hoc document retrieval. We conduct our experiments on probabilistic retrieval models as well as in combination with neural ranking models. We evaluate CEQE on two standard TREC collections: Robust and Deep Learning. We find that CEQE outperforms static embedding-based expansion methods on multiple collections (by up to 18% on Robust and 31% on Deep Learning on average precision) and also improves over proven probabilistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Tanh Activation · Linear Warmup With Linear Decay · Sigmoid Activation · Weight Decay · Multi-Head Attention · Dense Connections · Softmax · Attention Is All You Need