Efficient Image Dataset Classification Difficulty Estimation for   Predicting Deep-Learning Accuracy

Florian Scheidegger; Roxana Istrate; Giovanni Mariani; Luca Benini,; Costas Bekas; Cristiano Malossi

arXiv:1803.09588·cs.CV·March 28, 2018

Efficient Image Dataset Classification Difficulty Estimation for Predicting Deep-Learning Accuracy

Florian Scheidegger, Roxana Istrate, Giovanni Mariani, Luca Benini,, Costas Bekas, Cristiano Malossi

PDF

1 Repo

TL;DR

This paper introduces a fast method to estimate the difficulty of image classification datasets, helping to efficiently select suitable neural network configurations without extensive training.

Contribution

It proposes a novel dataset difficulty estimation technique that is 27 times faster than training models, aiding in quicker neural network selection and hyper-parameter tuning.

Findings

01

Estimates dataset difficulty 27x faster than training.

02

Helps guide neural network architecture and hyper-parameter search.

03

Reduces computational cost in model selection process.

Abstract

In the deep-learning community new algorithms are published at an incredible pace. Therefore, solving an image classification problem for new datasets becomes a challenging task, as it requires to re-evaluate published algorithms and their different configurations in order to find a close to optimal classifier. To facilitate this process, before biasing our decision towards a class of neural networks or running an expensive search over the network space, we propose to estimate the classification difficulty of the dataset. Our method computes a single number that characterizes the dataset difficulty 27x faster than training state-of-the-art networks. The proposed method can be used in combination with network topology and hyper-parameter search optimizers to efficiently drive the search towards promising neural-network configurations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/iotnets
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.