Generalizing Few Data to Unseen Domains Flexibly Based on Label Smoothing Integrated with Distributionally Robust Optimization
Yangdi Wang, Zhi-Hai Zhang, Su Xiu Xu, Wenming Guo

TL;DR
This paper introduces a novel method combining label smoothing with distributionally robust optimization to improve deep neural network generalization on unseen domains, especially in small datasets.
Contribution
It extends label smoothing with a DRO-based regularization, enabling flexible data shift to unseen domains and proposing an efficient gradient-iteration algorithm for training.
Findings
Demonstrates superior performance on small-scale anomaly classification tasks.
Proves the regularization does not affect convergence of the training algorithm.
Uses Bayesian optimization to tune hyperparameters effectively.
Abstract
Overfitting commonly occurs when applying deep neural networks (DNNs) on small-scale datasets, where DNNs do not generalize well from existing data to unseen data. The main reason resulting in overfitting is that small-scale datasets cannot reflect the situations of the real world. Label smoothing (LS) is an effective regularization method to prevent overfitting, avoiding it by mixing one-hot labels with uniform label vectors. However, LS only focuses on labels while ignoring the distribution of existing data. In this paper, we introduce the distributionally robust optimization (DRO) to LS, achieving shift the existing data distribution flexibly to unseen domains when training DNNs. Specifically, we prove that the regularization of LS can be extended to a regularization term for the DNNs parameters when integrating DRO. The regularization term can be utilized to shift existing data to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGrey System Theory Applications
MethodsLabel Smoothing
