Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop
Yin Cui, Feng Zhou, Yuanqing Lin, Serge Belongie

TL;DR
This paper introduces an iterative deep metric learning framework with human-in-the-loop for fine-grained categorization, effectively expanding datasets and improving classification accuracy on challenging visual categories.
Contribution
The work presents a novel human-in-the-loop dataset bootstrapping approach using deep metric learning to handle data scarcity and high intra-class variance in fine-grained categorization.
Findings
Achieved significant performance improvements through dataset expansion.
Demonstrated state-of-the-art results on fine-grained datasets.
Effectively handled high intra-class variance with learned feature embeddings.
Abstract
Existing fine-grained visual categorization methods often suffer from three challenges: lack of training data, large number of fine-grained categories, and high intraclass vs. low inter-class variance. In this work we propose a generic iterative framework for fine-grained categorization and dataset bootstrapping that handles these three challenges. Using deep metric learning with humans in the loop, we learn a low dimensional feature embedding with anchor points on manifolds for each category. These anchor points capture intra-class variances and remain discriminative between classes. In each round, images with high confidence scores from our model are sent to humans for labeling. By comparing with exemplar images, labelers mark each candidate image as either a "true positive" or a "false positive". True positives are added into our current dataset and false positives are regarded as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
