Generalization Error Analysis of Neural networks with Gradient Based Regularization
Lingfeng Li, Xue-Cheng Tai, Jiang Yang

TL;DR
This paper introduces a general framework for analyzing the generalization error of neural networks regularized via gradient-based methods like total variation and Tikhonov, demonstrating improved performance and robustness in image classification.
Contribution
It provides a novel theoretical framework for understanding the generalization error of gradient-regularized neural networks, supported by experimental validation.
Findings
Gradient-based regularization improves generalization in neural networks.
Such methods enhance adversarial robustness.
Experimental results confirm theoretical predictions.
Abstract
We study gradient-based regularization methods for neural networks. We mainly focus on two regularization methods: the total variation and the Tikhonov regularization. Applying these methods is equivalent to using neural networks to solve some partial differential equations, mostly in high dimensions in practical applications. In this work, we introduce a general framework to analyze the generalization error of regularized networks. The error estimate relies on two assumptions on the approximation error and the quadrature error. Moreover, we conduct some experiments on the image classification tasks to show that gradient-based methods can significantly improve the generalization ability and adversarial robustness of neural networks. A graphical extension of the gradient-based methods are also considered in the experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Neural Networks and Applications
