94% on CIFAR-10 in 3.29 Seconds on a Single GPU

Keller Jordan

arXiv:2404.00498·cs.LG·April 8, 2024·2 cites

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

Keller Jordan

PDF

Open Access 1 Repo

TL;DR

This paper presents a highly efficient training method for CIFAR-10 that achieves near state-of-the-art accuracy within seconds on a single GPU, significantly speeding up research workflows.

Contribution

The paper introduces a novel training approach and a derandomized augmentation technique that drastically reduce training time for CIFAR-10.

Findings

01

Achieves 94% accuracy in 3.29 seconds on a single GPU

02

Proposes a derandomized horizontal flipping augmentation that outperforms standard methods

03

Demonstrates significant speedup in CIFAR-10 training without sacrificing accuracy

Abstract

CIFAR-10 is among the most widely used datasets in machine learning, facilitating thousands of research projects per year. To accelerate research and reduce the cost of experiments, we introduce training methods for CIFAR-10 which reach 94% accuracy in 3.29 seconds, 95% in 10.4 seconds, and 96% in 46.3 seconds, when run on a single NVIDIA A100 GPU. As one factor contributing to these training speeds, we propose a derandomized variant of horizontal flipping augmentation, which we show improves over the standard method in every case where flipping is beneficial over no flipping at all. Our code is released at https://github.com/KellerJordan/cifar10-airbench.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kellerjordan/cifar10-airbench
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Medical Imaging Techniques and Applications