ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
Omar Khattab, Matei Zaharia

TL;DR
ColBERT introduces a late interaction architecture for BERT-based passage search that significantly reduces computational costs while maintaining high effectiveness, enabling fast and scalable information retrieval.
Contribution
The paper presents ColBERT, a novel BERT-based ranking model with a late interaction mechanism that allows pre-computation of document representations for efficient retrieval.
Findings
ColBERT achieves competitive effectiveness with existing BERT models.
It is two orders of magnitude faster than traditional BERT re-ranking models.
Requires four orders-of-magnitude fewer FLOPs per query.
Abstract
Recent progress in Natural Language Understanding (NLU) is driving fast-paced advances in Information Retrieval (IR), largely owed to fine-tuning deep language models (LMs) for document ranking. While remarkably effective, the ranking models based on these LMs increase computational cost by orders of magnitude over prior approaches, particularly as they must feed each query-document pair through a massive neural network to compute a single relevance score. To tackle this, we present ColBERT, a novel ranking model that adapts deep LMs (in particular, BERT) for efficient retrieval. ColBERT introduces a late interaction architecture that independently encodes the query and the document using BERT and then employs a cheap yet powerful interaction step that models their fine-grained similarity. By delaying and yet retaining this fine-granular interaction, ColBERT can leverage the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗BAAI/bge-m3model· 14.5M dl· ♡ 287114.5M dl♡ 2871
- 🤗colbert-ir/colbertv2.0model· 13.7M dl· ♡ 31713.7M dl♡ 317
- 🤗vidore/colqwen2-v1.0-hfmodel· 101k dl· ♡ 23101k dl♡ 23
- 🤗sebastian-hofstaetter/colbert-distilbert-margin_mse-T2-msmarcomodel· 16 dl· ♡ 1516 dl♡ 15
- 🤗vespa-engine/col-minilmmodel· 5 dl· ♡ 265 dl♡ 26
- 🤗vespa-engine/colbert-mediummodel· 7 dl· ♡ 37 dl♡ 3
- 🤗vjeronymo2/mColBERTmodel· ♡ 4♡ 4
- 🤗Crystalcareai/Colbertv2model· 95 dl95 dl
- 🤗jinaai/jina-colbert-v1-enmodel· 267 dl· ♡ 100267 dl♡ 100
- 🤗BAAI/bge-m3-unsupervisedmodel· 6.0k dl· ♡ 186.0k dl♡ 18
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Web Data Mining and Analysis · Natural Language Processing Techniques
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
