Towards Non-I.I.D. Image Classification: A Dataset and Baselines

Yue He; Zheyan Shen; Peng Cui

arXiv:1906.02899·cs.CV·August 15, 2019·21 cites

Towards Non-I.I.D. Image Classification: A Dataset and Baselines

Yue He, Zheyan Shen, Peng Cui

PDF

Open Access

TL;DR

This paper introduces NICO, a new Non-I.I.D. image dataset, and baseline models to study and improve image classification performance when training and testing data distributions differ.

Contribution

The paper creates and releases NICO, a flexible Non-I.I.D. dataset, and proposes baseline ConvNet models with batch balancing for Non-I.I.D. image classification.

Findings

01

NICO supports various Non-I.I.D. scenarios effectively.

02

ConvNet models trained on NICO perform well in Non-I.I.D. settings.

03

Batch balancing improves ConvNet performance in Non-I.I.D. conditions.

Abstract

I.I.D. hypothesis between training and testing data is the basis of numerous image classification methods. Such property can hardly be guaranteed in practice where the Non-IIDness is common, causing instable performances of these models. In literature, however, the Non-I.I.D. image classification problem is largely understudied. A key reason is lacking of a well-designed dataset to support related research. In this paper, we construct and release a Non-I.I.D. image dataset called NICO, which uses contexts to create Non-IIDness consciously. Compared to other datasets, extended analyses prove NICO can support various Non-I.I.D. situations with sufficient flexibility. Meanwhile, we propose a baseline model with ConvNet structure for General Non-I.I.D. image classification, where distribution of testing data is unknown but different from training data. The experimental results demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications