Jitter: Random Jittering Loss Function

Zhicheng Cai; Chenglei Peng; Sidan Du

arXiv:2106.13749·cs.LG·July 8, 2021

Jitter: Random Jittering Loss Function

Zhicheng Cai, Chenglei Peng, Sidan Du

PDF

Open Access

TL;DR

Jitter introduces a novel regularization technique that adds randomness to the loss function, improving model robustness and generalization by flattening the loss landscape more effectively than previous methods.

Contribution

The paper proposes Jitter, a new random loss function-based regularization method that enhances generalization and outperforms flooding by making the loss curve flatter.

Findings

01

Jitter improves model performance more significantly than flooding.

02

Test loss curve descends twice with Jitter.

03

Jitter is domain-, task-, and model-independent.

Abstract

Regularization plays a vital role in machine learning optimization. One novel regularization method called flooding makes the training loss fluctuate around the flooding level. It intends to make the model continue to random walk until it comes to a flat loss landscape to enhance generalization. However, the hyper-parameter flooding level of the flooding method fails to be selected properly and uniformly. We propose a novel method called Jitter to improve it. Jitter is essentially a kind of random loss function. Before training, we randomly sample the Jitter Point from a specific probability distribution. The flooding level should be replaced by Jitter point to obtain a new target function and train the model accordingly. As Jitter point acting as a random factor, we actually add some randomness to the loss function, which is consistent with the fact that there exists innumerable random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications