Joint Learning of Generative Translator and Classifier for Visually   Similar Classes

ByungIn Yoo; Tristan Sylvain; Yoshua Bengio; Junmo Kim

arXiv:1912.06994·cs.CV·December 3, 2020

Joint Learning of Generative Translator and Classifier for Visually Similar Classes

ByungIn Yoo, Tristan Sylvain, Yoshua Bengio, Junmo Kim

PDF

TL;DR

This paper introduces GTCN, a joint learning framework that combines generative translation and classification to enhance visual recognition of similar classes with limited data, outperforming baselines and matching state-of-the-art results.

Contribution

The paper presents a novel end-to-end joint learning approach for generative translation and classification, incorporating adaptive loss functions for improved data augmentation in visual recognition.

Findings

01

Training on 40% of data surpasses full-data baselines.

02

Achieves state-of-the-art performance with a lightweight architecture.

03

Effective data augmentation improves classification accuracy.

Abstract

In this paper, we propose a Generative Translation Classification Network (GTCN) for improving visual classification accuracy in settings where classes are visually similar and data is scarce. For this purpose, we propose joint learning from a scratch to train a classifier and a generative stochastic translation network end-to-end. The translation network is used to perform on-line data augmentation across classes, whereas previous works have mostly involved domain adaptation. To help the model further benefit from this data-augmentation, we introduce an adaptive fade-in loss and a quadruplet loss. We perform experiments on multiple datasets to demonstrate the proposed method's performance in varied settings. Of particular interest, training on 40% of the dataset is enough for our model to surpass the performance of baselines trained on the full dataset. When our architecture is trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.