pEBR: A Probabilistic Approach to Embedding Based Retrieval
Han Zhang, Yunjiang Jiang, Mingming Li, Haowei Yuan, Yiming Qiu, Wen-Yun Yang

TL;DR
This paper introduces pEBR, a probabilistic framework for embedding-based retrieval that dynamically adjusts similarity thresholds to improve recall and precision across different query types.
Contribution
pEBR is the first to model item distribution conditioned on queries, enabling dynamic thresholding for better retrieval performance.
Findings
Significantly improves retrieval precision and recall.
Effectively captures differences between head and tail queries.
Outperforms existing fixed-threshold retrieval methods.
Abstract
Embedding-based retrieval aims to learn a shared semantic representation space for both queries and items, enabling efficient and effective item retrieval through approximate nearest neighbor (ANN) algorithms. In current industrial practice, retrieval systems typically retrieve a fixed number of items for each query. However, this fixed-size retrieval often results in insufficient recall for head queries and low precision for tail queries. This limitation largely stems from the dominance of frequentist approaches in loss function design, which fail to address this challenge in industry. In this paper, we propose a novel \textbf{p}robabilistic \textbf{E}mbedding-\textbf{B}ased \textbf{R}etrieval (\textbf{pEBR}) framework. Our method models the item distribution conditioned on each query, enabling the use of a dynamic cosine similarity threshold derived from the cumulative distribution…
Peer Reviews
Decision·Submitted to ICLR 2024
The paper is generally well-written with a clear motivation from the weaknesses of existing frameworks. The authors present empirical evidence to the central research problem.
1. The authors claim that the paper is the first to introduce probabilistic modelling into embedding based retrieval, which remains doubtful to me. Probabilistic embedding has a long history in machine learning, as well as probabilistic information retrieval, at least dating back to probabilistic ranking principle (Robertson, 1977), which essentially seeks to model the relevance of items to a query. Such literature was not reviewed in the paper. Furthermore, this formulation for modelling the re
The work is around dense embedding retrieval which is an important topic specially in industrial applications with large catalogs of items. It is interesting to see this probabilistic approach for retrieving items as it avoids using standard approaches which may bring inefficiencies.
It would be great if the authors could enhance the related work with probabilistic embedding approaches especially few from the domain of images and metric learning and also draw some parallels.For example, Probabilistic Embeddings for Cross-Modal Retrieval, CVPR 2021. One could use such an approach for performing retrieval tasks as it models uncertainty. Is very difficult to assess the result. Could you please give more details for the dataset? What type of user log are these? How the model th
1. Improved precision compared to DSSM, especially on tail queries. Recall is better than DSSM but by a small margin. 2. The model produces a variable number of results based on relevance cutoff. So where there are more results relevant to a query, the model can retrieve more of them compared to DSSM.
1. Assessment is sparse. The baseline chosen, DSSM, is rather old (from 2013). 2. Comparison on one dataset and its unclear if it is public.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
