Generalization Error Analysis of Neural networks with Gradient Based   Regularization

Lingfeng Li; Xue-Cheng Tai; Jiang Yang

arXiv:2107.02797·cs.LG·November 9, 2022

Generalization Error Analysis of Neural networks with Gradient Based Regularization

Lingfeng Li, Xue-Cheng Tai, Jiang Yang

PDF

Open Access

TL;DR

This paper introduces a general framework for analyzing the generalization error of neural networks regularized via gradient-based methods like total variation and Tikhonov, demonstrating improved performance and robustness in image classification.

Contribution

It provides a novel theoretical framework for understanding the generalization error of gradient-regularized neural networks, supported by experimental validation.

Findings

01

Gradient-based regularization improves generalization in neural networks.

02

Such methods enhance adversarial robustness.

03

Experimental results confirm theoretical predictions.

Abstract

We study gradient-based regularization methods for neural networks. We mainly focus on two regularization methods: the total variation and the Tikhonov regularization. Applying these methods is equivalent to using neural networks to solve some partial differential equations, mostly in high dimensions in practical applications. In this work, we introduce a general framework to analyze the generalization error of regularized networks. The error estimate relies on two assumptions on the approximation error and the quadrature error. Moreover, we conduct some experiments on the image classification tasks to show that gradient-based methods can significantly improve the generalization ability and adversarial robustness of neural networks. A graphical extension of the gradient-based methods are also considered in the experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Neural Networks and Applications