Some like it tough: Improving model generalization via progressively   increasing the training difficulty

Hannes Fassold

arXiv:2110.13058·cs.CV·October 26, 2021

Some like it tough: Improving model generalization via progressively increasing the training difficulty

Hannes Fassold

PDF

Open Access 1 Repo

TL;DR

This paper introduces mini-batch trimming, a simple method that progressively increases training difficulty by focusing on harder samples, leading to improved generalization in neural network models.

Contribution

The paper proposes a novel, easy-to-integrate training strategy that enhances model generalization by emphasizing difficult samples during later training stages.

Findings

01

Improved test error across multiple image classification datasets.

02

Easy integration without modifying existing network architectures.

03

Enhanced model robustness through progressive difficulty increase.

Abstract

In this work, we propose to progressively increase the training difficulty during learning a neural network model via a novel strategy which we call mini-batch trimming. This strategy makes sure that the optimizer puts its focus in the later training stages on the more difficult samples, which we identify as the ones with the highest loss in the current mini-batch. The strategy is very easy to integrate into an existing training pipeline and does not necessitate a change of the network model. Experiments on several image classification problems show that mini-batch trimming is able to increase the generalization ability (measured via final test error) of the trained model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hfassold/mbtrim
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsTest