Generalization Bounds for Vicinal Risk Minimization Principle
Chao Zhang, Min-Hsiu Hsieh, Dacheng Tao

TL;DR
This paper provides a theoretical analysis of the generalization bounds for vicinal risk minimization (VRM), showing how the choice of vicinal functions influences learning performance and offering insights into existing VRM models.
Contribution
It establishes the first comprehensive theoretical bounds for VRM, linking function class complexity, vicinal functions, and generalization performance.
Findings
Function class complexity convolved with vicinal functions can be controlled by original class complexity.
Generalization bounds depend on the choice and quality of vicinal functions.
Theoretical explanations for existing VRM models like mixup and Gaussian-based models.
Abstract
The vicinal risk minimization (VRM) principle, first proposed by \citet{vapnik1999nature}, is an empirical risk minimization (ERM) variant that replaces Dirac masses with vicinal functions. Although there is strong numerical evidence showing that VRM outperforms ERM if appropriate vicinal functions are chosen, a comprehensive theoretical understanding of VRM is still lacking. In this paper, we study the generalization bounds for VRM. Our results support Vapnik's original arguments and additionally provide deeper insights into VRM. First, we prove that the complexity of function classes convolving with vicinal functions can be controlled by that of the original function classes under the assumption that the function class is composed of Lipschitz-continuous functions. Then, the resulting generalization bounds for VRM suggest that the generalization performance of VRM is also effected by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Machine Learning and Algorithms · Statistical Methods and Inference
MethodsMixup
