Improving Resnet-9 Generalization Trained on Small Datasets

Omar Mohamed Awad; Habib Hajimolahoseini; Michael Lim; Gurpreet Gosal; Walid Ahmed; Yang Liu; Gordon Deng

arXiv:2309.03965·cs.LG·May 27, 2025·2 cites

Improving Resnet-9 Generalization Trained on Small Datasets

Omar Mohamed Awad, Habib Hajimolahoseini, Michael Lim, Gurpreet Gosal, Walid Ahmed, Yang Liu, Gordon Deng

PDF

Open Access

TL;DR

This paper introduces a novel training approach for ResNet-9 that significantly improves generalization on small datasets, achieving high accuracy rapidly and winning a hardware-aware training competition.

Contribution

The paper presents a combination of techniques including sharpness aware optimization and meta-learning to enhance ResNet-9's performance on limited data within strict time constraints.

Findings

01

Achieved 88% accuracy on small CIFAR-10 subset

02

Training time under 10 minutes

03

Won ICLR competition on hardware-aware efficient training

Abstract

This paper presents our proposed approach that won the first prize at the ICLR competition on Hardware Aware Efficient Training. The challenge is to achieve the highest possible accuracy in an image classification task in less than 10 minutes. The training is done on a small dataset of 5000 images picked randomly from CIFAR-10 dataset. The evaluation is performed by the competition organizers on a secret dataset with 1000 images of the same size. Our approach includes applying a series of technique for improving the generalization of ResNet-9 including: sharpness aware optimization, label smoothing, gradient centralization, input patch whitening as well as metalearning based training. Our experiments show that the ResNet-9 can achieve the accuracy of 88% while trained only on a 10% subset of CIFAR-10 dataset in less than 10 minuets

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Physical Unclonable Functions (PUFs) and Hardware Security

MethodsAttentive Walk-Aggregating Graph Neural Network