A Reproducibility Study of PLAID
Sean MacAvaney, Nicola Tonellotto

TL;DR
This study reproduces and analyzes the PLAID algorithm for document retrieval, comparing it with re-ranking baselines, and explores how parameter tuning and recent modifications affect efficiency and effectiveness trade-offs.
Contribution
It fills gaps in the original PLAID work, evaluates its parameters, compares it with re-ranking baselines, and analyzes token clusters to understand retrieval performance.
Findings
PLAID's Pareto frontier depends on three key parameters.
Re-ranking with ColBERTv2 on BM25 pools offers better efficiency-effectiveness trade-offs at low latency.
Recent neighbor-based re-ranking modifications improve performance across all operational points.
Abstract
The PLAID (Performance-optimized Late Interaction Driver) algorithm for ColBERTv2 uses clustered term representations to retrieve and progressively prune documents for final (exact) document scoring. In this paper, we reproduce and fill in missing gaps from the original work. By studying the parameters PLAID introduces, we find that its Pareto frontier is formed of a careful balance among its three parameters; deviations beyond the suggested settings can substantially increase latency without necessarily improving its effectiveness. We then compare PLAID with an important baseline missing from the paper: re-ranking a lexical system. We find that applying ColBERTv2 as a re-ranker atop an initial pool of BM25 results provides better efficiency-effectiveness trade-offs in low-latency settings. However, re-ranking cannot reach peak effectiveness at higher latency settings due to limitations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
