An Alternative to FLOPS Regularization to Effectively Productionize SPLADE-Doc
Aldo Porco, Dhruv Mehra, Igor Malioutov, Karthik Radhakrishnan, Moniba Keymanesh, Daniel Preo\c{t}iuc-Pietro, Sean MacAvaney, Pengxiang Cheng

TL;DR
This paper introduces DF-FLOPS, a new regularization method for Sparse Retrieval models that reduces high-frequency term usage, significantly lowering retrieval latency while maintaining retrieval effectiveness in production environments.
Contribution
The paper proposes DF-FLOPS, an alternative to FLOPS regularization, which penalizes high document frequency terms to improve retrieval speed and practicality in production search engines.
Findings
DF-FLOPS reduces high-DF terms and retrieval latency by around 10x.
Maintains effectiveness with only a 2.2-point drop in MRR@10.
Improves performance in cross-domain retrieval tasks.
Abstract
Learned Sparse Retrieval (LSR) models encode text as weighted term vectors, which need to be sparse to leverage inverted index structures during retrieval. SPLADE, the most popular LSR model, uses FLOPS regularization to encourage vector sparsity during training. However, FLOPS regularization does not ensure sparsity among terms - only within a given query or document. Terms with very high Document Frequencies (DFs) substantially increase latency in production retrieval engines, such as Apache Solr, due to their lengthy posting lists. To address the issue of high DFs, we present a new variant of FLOPS regularization: DF-FLOPS. This new regularization technique penalizes the usage of high-DF terms, thereby shortening posting lists and reducing retrieval latency. Unlike other inference-time sparsification methods, such as stopword removal, DF-FLOPS regularization allows for the selective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems
