Active Search for High Recall: a Non-Stationary Extension of Thompson   Sampling

Jean-Michel Renders (NaverLabs Europe)

arXiv:1712.09550·cs.IR·March 23, 2018

Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling

Jean-Michel Renders (NaverLabs Europe)

PDF

TL;DR

This paper introduces a non-stationary Thompson Sampling approach for Active Search, effectively handling low prevalence and multi-modal relevance, to maximize relevant object retrieval with minimal effort.

Contribution

It extends Thompson Sampling to a non-stationary setting for Active Search, incorporating soft-clustering and a two-level decision process for improved high-recall performance.

Findings

01

Effective retrieval of relevant objects with minimal effort

02

Handles multi-modal and low prevalence classes

03

Achieves high recall with reduced long-term effort

Abstract

We consider the problem of Active Search, where a maximum of relevant objects - ideally all relevant objects - should be retrieved with the minimum effort or minimum time. Typically, there are two main challenges to face when tackling this problem: first, the class of relevant objects has often low prevalence and, secondly, this class can be multi-faceted or multi-modal: objects could be relevant for completely different reasons. To solve this problem and its associated issues, we propose an approach based on a non-stationary (aka restless) extension of Thompson Sampling, a well-known strategy for Multi-Armed Bandits problems. The collection is first soft-clustered into a finite set of components and a posterior distribution of getting a relevant object inside each cluster is updated after receiving the user feedback about the proposed instances. The "next instance" selection strategy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.