Adversarial Learning for Fine-grained Image Search

Kevin Lin; Fan Yang; Qiaosong Wang; Robinson Piramuthu

arXiv:1807.02247·cs.CV·July 9, 2018

Adversarial Learning for Fine-grained Image Search

Kevin Lin, Fan Yang, Qiaosong Wang, Robinson Piramuthu

PDF

Open Access

TL;DR

This paper introduces FGGAN, an end-to-end adversarial network that learns discriminative features for fine-grained image search by transforming multi-view images into a canonical view, improving robustness in open-set scenarios.

Contribution

The paper proposes a novel GAN-based network that automatically handles pose variations and generalizes to unseen categories for fine-grained image search.

Findings

01

Achieves up to 10% relative improvement over baselines.

02

Demonstrates robustness in both closed-set and open-set scenarios.

03

Validates effectiveness on multiple datasets.

Abstract

Fine-grained image search is still a challenging problem due to the difficulty in capturing subtle differences regardless of pose variations of objects from fine-grained categories. In practice, a dynamic inventory with new fine-grained categories adds another dimension to this challenge. In this work, we propose an end-to-end network, called FGGAN, that learns discriminative representations by implicitly learning a geometric transformation from multi-view images for fine-grained image search. We integrate a generative adversarial network (GAN) that can automatically handle complex view and pose variations by converting them to a canonical view without any predefined transformations. Moreover, in an open-set scenario, our network is able to better match images from unseen and unknown fine-grained categories. Extensive experiments on two public datasets and a newly collected dataset have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques