Go Small and Similar: A Simple Output Decay Brings Better Performance
Xuan Cheng, Tianshu Xie, Xiaomin Wang, Jiali Deng, Minghui Liu, Ming, Liu

TL;DR
This paper introduces Output Decay, a novel regularization technique that encourages models to produce smaller and more similar output values, leading to significant performance improvements across various tasks.
Contribution
It proposes a new regularization method, Output Decay, which regularizes model outputs to improve deep learning performance, a concept not extensively explored before.
Findings
Output Decay improves model performance across multiple tasks.
Smaller and similar output distributions correlate with better results.
The method is versatile and widely applicable.
Abstract
Regularization and data augmentation methods have been widely used and become increasingly indispensable in deep learning training. Researchers who devote themselves to this have considered various possibilities. But so far, there has been little discussion about regularizing outputs of the model. This paper begins with empirical observations that better performances are significantly associated with output distributions, that have smaller average values and variances. By audaciously assuming there is causality involved, we propose a novel regularization term, called Output Decay, that enforces the model to assign smaller and similar output values on each class. Though being counter-intuitive, such a small modification result in a remarkable improvement on performance. Extensive experiments demonstrate the wide applicability, versatility, and compatibility of Output Decay.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Advanced Neural Network Applications
