SEINE: SEgment-based Indexing for NEural information retrieval
Sibo Dong, Justin Goldstein, and Grace Hui Yang

TL;DR
SEINE introduces a segment-based indexing framework that enhances neural information retrieval by enabling faster retrieval speeds while maintaining effectiveness, bridging the gap between interaction-based and representation-based methods.
Contribution
The paper proposes a novel segment-based inverted index framework, SEINE, supporting various interaction-based neural retrieval methods with significant speed improvements.
Findings
Achieves up to 28x faster retrieval speeds
Maintains comparable retrieval effectiveness
Supports multiple neural retrieval methods
Abstract
Many early neural Information Retrieval (NeurIR) methods are re-rankers that rely on a traditional first-stage retriever due to expensive query time computations. Recently, representation-based retrievers have gained much attention, which learns query representation and document representation separately, making it possible to pre-compute document representations offline and reduce the workload at query time. Both dense and sparse representation-based retrievers have been explored. However, these methods focus on finding the representation that best represents a text (aka metric learning) and the actual retrieval function that is responsible for similarity matching between query and document is kept at a minimum by using dot product. One drawback is that unlike traditional term-level inverted index, the index formed by these embeddings cannot be easily re-used by another retrieval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsFocus
