Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling
Jean-Michel Renders (NaverLabs Europe)

TL;DR
This paper introduces a non-stationary Thompson Sampling approach for Active Search, effectively handling low prevalence and multi-modal relevance, to maximize relevant object retrieval with minimal effort.
Contribution
It extends Thompson Sampling to a non-stationary setting for Active Search, incorporating soft-clustering and a two-level decision process for improved high-recall performance.
Findings
Effective retrieval of relevant objects with minimal effort
Handles multi-modal and low prevalence classes
Achieves high recall with reduced long-term effort
Abstract
We consider the problem of Active Search, where a maximum of relevant objects - ideally all relevant objects - should be retrieved with the minimum effort or minimum time. Typically, there are two main challenges to face when tackling this problem: first, the class of relevant objects has often low prevalence and, secondly, this class can be multi-faceted or multi-modal: objects could be relevant for completely different reasons. To solve this problem and its associated issues, we propose an approach based on a non-stationary (aka restless) extension of Thompson Sampling, a well-known strategy for Multi-Armed Bandits problems. The collection is first soft-clustered into a finite set of components and a posterior distribution of getting a relevant object inside each cluster is updated after receiving the user feedback about the proposed instances. The "next instance" selection strategy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
