A Unified Batch Selection Policy for Active Metric Learning

Priyadarshini K; Siddhartha Chaudhuri; Vivek Borkar; Subhasis; Chaudhuri

arXiv:2102.07365·cs.LG·August 3, 2021·1 cites

A Unified Batch Selection Policy for Active Metric Learning

Priyadarshini K, Siddhartha Chaudhuri, Vivek Borkar, Subhasis, Chaudhuri

PDF

Open Access

TL;DR

This paper introduces a novel batch active metric learning method that uses maximum entropy principles and joint entropy maximization to select diverse, informative triplet batches efficiently, outperforming existing approaches.

Contribution

It proposes the first unified scoring method for batch selection in active metric learning that balances informativeness and diversity using a submodular optimization framework.

Findings

01

Outperforms state-of-the-art methods on multiple datasets.

02

Demonstrates robustness and generalization across applications.

03

Efficient greedy algorithm achieves near-optimal batch selection.

Abstract

Active metric learning is the problem of incrementally selecting high-utility batches of training data (typically, ordered triplets) to annotate, in order to progressively improve a learned model of a metric over some input domain as rapidly as possible. Standard approaches, which independently assess the informativeness of each triplet in a batch, are susceptible to highly correlated batches with many redundant triplets and hence low overall utility. While a recent work \cite{kumari2020batch} proposes batch-decorrelation strategies for metric learning, they rely on ad hoc heuristics to estimate the correlation between two triplets at a time. We present a novel batch active metric learning method that leverages the Maximum Entropy Principle to learn the least biased estimate of triplet distribution for a given set of prior constraints. To avoid redundancy between triplets, our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques

MethodsHigh-Order Consensuses