Positive-First Most Ambiguous: A Simple Active Learning Criterion for Interactive Retrieval of Rare Categories
Kawtar Zaher, Olivier Buisson, Alexis Joly

TL;DR
This paper introduces PF-MA, an active learning criterion tailored for interactive retrieval of rare categories, emphasizing class imbalance and user-centric objectives to improve early discovery and diversity in visual retrieval tasks.
Contribution
The paper proposes PF-MA, a simple active learning method that prioritizes likely positives near decision boundaries, enhancing retrieval of rare categories in imbalanced datasets.
Findings
PF-MA outperforms standard AL methods in early retrieval and coverage.
Experiments on long-tailed datasets show improved classifier performance.
PF-MA maintains high relevance in small batches, aiding user satisfaction.
Abstract
Real-world fine-grained visual retrieval often requires discovering a rare concept from large unlabeled collections with minimal supervision. This is especially critical in biodiversity monitoring, ecological studies, and long-tailed visual domains, where the target may represent only a tiny fraction of the data, creating highly imbalanced binary problems. Interactive retrieval with relevance feedback offers a practical solution: starting from a small query, the system selects candidates for binary user annotation and iteratively refines a lightweight classifier. While Active Learning (AL) is commonly used to guide selection, conventional AL assumes symmetric class priors and large annotation budgets, limiting effectiveness in imbalanced, low-budget, low-latency settings. We introduce Positive-First Most Ambiguous (PF-MA), a simple yet effective AL criterion that explicitly addresses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
