HyQE: Ranking Contexts with Hypothetical Query Embeddings

Weichao Zhou; Jiaxin Zhang; Hilaf Hasson; Anu Singh; Wenchao Li

arXiv:2410.15262·cs.IR·October 22, 2024

HyQE: Ranking Contexts with Hypothetical Query Embeddings

Weichao Zhou, Jiaxin Zhang, Hilaf Hasson, Anu Singh, Wenchao Li

PDF

Open Access 1 Repo 1 Video

TL;DR

HyQE introduces a scalable ranking method that leverages hypothetical query embeddings generated by large language models to improve context relevance ranking without fine-tuning, demonstrating enhanced performance across benchmarks.

Contribution

The paper presents a novel framework combining embedding similarity and LLM capabilities for context ranking without fine-tuning, addressing scalability and domain adaptation issues.

Findings

01

Improves ranking performance on multiple benchmarks

02

Efficient inference compatible with various retrieval techniques

03

Does not require LLM fine-tuning or domain-specific data

Abstract

In retrieval-augmented systems, context ranking techniques are commonly employed to reorder the retrieved contexts based on their relevance to a user query. A standard approach is to measure this relevance through the similarity between contexts and queries in the embedding space. However, such similarity often fails to capture the relevance. Alternatively, large language models (LLMs) have been used for ranking contexts. However, they can encounter scalability issues when the number of candidate contexts grows and the context window sizes of the LLMs remain constrained. Additionally, these approaches require fine-tuning LLMs with domain-specific data. In this work, we introduce a scalable ranking framework that combines embedding similarity and LLM capabilities without requiring LLM fine-tuning. Our framework uses a pre-trained LLM to hypothesize the user query based on the retrieved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zwc662/hyqe
pytorchOfficial

Videos

HyQE: Ranking Contexts with Hypothetical Query Embeddings· underline

Taxonomy

TopicsData Management and Algorithms · Rough Sets and Fuzzy Logic