Using mixup as regularization and tuning hyper-parameters for ResNets

Venkata Bhanu Teja Pallakonda

arXiv:2111.11616·cs.CV·November 24, 2021

Using mixup as regularization and tuning hyper-parameters for ResNets

Venkata Bhanu Teja Pallakonda

PDF

Open Access 1 Repo

TL;DR

This paper enhances ResNet50 by integrating mixup data augmentation for regularization and hyper-parameter tuning, aiming to improve image classification performance, especially with limited data, while emphasizing training efficiency.

Contribution

It introduces the use of mixup augmentation specifically for ResNets and provides insights into hyper-parameter tuning to boost performance.

Findings

01

Improved accuracy of ResNet50 with mixup augmentation

02

Enhanced training stability and efficiency

03

Better generalization on limited data

Abstract

While novel computer vision architectures are gaining traction, the impact of model architectures is often related to changes or exploring in training methods. Identity mapping-based architectures ResNets and DenseNets have promised path-breaking results in the image classification task and are go-to methods for even now if the data given is fairly limited. Considering the ease of training with limited resources this work revisits the ResNets and improves the ResNet50 \cite{resnets} by using mixup data-augmentation as regularization and tuning the hyper-parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pvbhanuteja/mixrnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsMixup