Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse   Representations

Sebastian Bruch; Franco Maria Nardini; Cosimo Rulli; Rossano Venturini

arXiv:2404.18812·cs.IR·July 15, 2024

Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse Representations

Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini

PDF

1 Repo

TL;DR

This paper introduces Seismic, a novel inverted index organization that enables fast, approximate retrieval over learned sparse embeddings, significantly improving speed while maintaining high recall compared to existing solutions.

Contribution

The paper proposes a new index structure with geometrically-cohesive blocks and summary vectors, achieving rapid approximate retrieval over learned sparse representations.

Findings

01

Seismic achieves sub-millisecond per-query latency.

02

It outperforms state-of-the-art inverted index solutions by 10-100 times.

03

It surpasses the BigANN Challenge winners in speed and recall.

Abstract

Learned sparse representations form an attractive class of contextual embeddings for text retrieval. That is so because they are effective models of relevance and are interpretable by design. Despite their apparent compatibility with inverted indexes, however, retrieval over sparse embeddings remains challenging. That is due to the distributional differences between learned embeddings and term frequency-based lexical models of relevance such as BM25. Recognizing this challenge, a great deal of research has gone into, among other things, designing retrieval algorithms tailored to the properties of learned sparse representations, including approximate retrieval systems. In fact, this task featured prominently in the latest BigANN Challenge at NeurIPS 2023, where approximate algorithms were evaluated on a large benchmark dataset by throughput and recall. In this work, we propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tuskanny/seismic
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.