Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft   Label Regularizer

Qian Li; Qingyuan Hu; Yong Qi; Saiyu Qi; Jie Ma; and Jian Zhang

arXiv:2006.15284·cs.LG·June 30, 2020

Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

Qian Li, Qingyuan Hu, Yong Qi, Saiyu Qi, Jie Ma, and Jian Zhang

PDF

Open Access

TL;DR

This paper introduces Stochastic Batch Augmentation (SBA), a novel training framework that enhances neural network generalization by incorporating distribution-aware soft label regularization during data augmentation.

Contribution

The work proposes SBA, which stochastically applies augmentation with a dynamic soft label regularizer based on distribution similarity, improving training efficiency and model generalization.

Findings

01

SBA improves generalization on CIFAR-10, CIFAR-100, and ImageNet.

02

SBA accelerates convergence during training.

03

SBA outperforms traditional augmentation methods.

Abstract

Data augmentation have been intensively used in training deep neural network to improve the generalization, whether in original space (e.g., image space) or representation space. Although being successful, the connection between the synthesized data and the original data is largely ignored in training, without considering the distribution information that the synthesized samples are surrounding the original sample in training. Hence, the behavior of the network is not optimized for this. However, that behavior is crucially important for generalization, even in the adversarial setting, for the safety of the deep learning system. In this work, we propose a framework called Stochastic Batch Augmentation (SBA) to address these problems. SBA stochastically decides whether to augment at iterations controlled by the batch scheduler and in which a ''distilled'' dynamic soft label regularization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings