A Unified Batch Selection Policy for Active Metric Learning
Priyadarshini K, Siddhartha Chaudhuri, Vivek Borkar, Subhasis, Chaudhuri

TL;DR
This paper introduces a novel batch active metric learning method that uses maximum entropy principles and joint entropy maximization to select diverse, informative triplet batches efficiently, outperforming existing approaches.
Contribution
It proposes the first unified scoring method for batch selection in active metric learning that balances informativeness and diversity using a submodular optimization framework.
Findings
Outperforms state-of-the-art methods on multiple datasets.
Demonstrates robustness and generalization across applications.
Efficient greedy algorithm achieves near-optimal batch selection.
Abstract
Active metric learning is the problem of incrementally selecting high-utility batches of training data (typically, ordered triplets) to annotate, in order to progressively improve a learned model of a metric over some input domain as rapidly as possible. Standard approaches, which independently assess the informativeness of each triplet in a batch, are susceptible to highly correlated batches with many redundant triplets and hence low overall utility. While a recent work \cite{kumari2020batch} proposes batch-decorrelation strategies for metric learning, they rely on ad hoc heuristics to estimate the correlation between two triplets at a time. We present a novel batch active metric learning method that leverages the Maximum Entropy Principle to learn the least biased estimate of triplet distribution for a given set of prior constraints. To avoid redundancy between triplets, our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques
MethodsHigh-Order Consensuses
