Adversarial Learning for Fine-grained Image Search
Kevin Lin, Fan Yang, Qiaosong Wang, Robinson Piramuthu

TL;DR
This paper introduces FGGAN, an end-to-end adversarial network that learns discriminative features for fine-grained image search by transforming multi-view images into a canonical view, improving robustness in open-set scenarios.
Contribution
The paper proposes a novel GAN-based network that automatically handles pose variations and generalizes to unseen categories for fine-grained image search.
Findings
Achieves up to 10% relative improvement over baselines.
Demonstrates robustness in both closed-set and open-set scenarios.
Validates effectiveness on multiple datasets.
Abstract
Fine-grained image search is still a challenging problem due to the difficulty in capturing subtle differences regardless of pose variations of objects from fine-grained categories. In practice, a dynamic inventory with new fine-grained categories adds another dimension to this challenge. In this work, we propose an end-to-end network, called FGGAN, that learns discriminative representations by implicitly learning a geometric transformation from multi-view images for fine-grained image search. We integrate a generative adversarial network (GAN) that can automatically handle complex view and pose variations by converting them to a canonical view without any predefined transformations. Moreover, in an open-set scenario, our network is able to better match images from unseen and unknown fine-grained categories. Extensive experiments on two public datasets and a newly collected dataset have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
