Limitation of Characterizing Implicit Regularization by Data-independent   Functions

Leyang Zhang; Zhi-Qin John Xu; Tao Luo; Yaoyu Zhang

arXiv:2201.12198·cs.LG·September 8, 2023

Limitation of Characterizing Implicit Regularization by Data-independent Functions

Leyang Zhang, Zhi-Qin John Xu, Tao Luo, Yaoyu Zhang

PDF

Open Access

TL;DR

This paper investigates the limitations of characterizing neural network implicit regularization using data-independent functions, highlighting the profound data dependency and proposing mechanisms that demonstrate these limitations.

Contribution

It introduces two dynamical mechanisms showing that certain neural networks cannot be fully characterized by data-independent functions, emphasizing data dependency in implicit regularization.

Findings

01

Data-independent functions cannot fully characterize some neural networks.

02

Two-point and One-point Overlapping mechanisms demonstrate these limitations.

03

Results highlight the importance of data dependency in understanding implicit regularization.

Abstract

In recent years, understanding the implicit regularization of neural networks (NNs) has become a central task in deep learning theory. However, implicit regularization is itself not completely defined and well understood. In this work, we attempt to mathematically define and study implicit regularization. Importantly, we explore the limitations of a common approach to characterizing implicit regularization using data-independent functions. We propose two dynamical mechanisms, i.e., Two-point and One-point Overlapping mechanisms, based on which we provide two recipes for producing classes of one-hidden-neuron NNs that provably cannot be fully characterized by a type of or all data-independent functions. Following the previous works, our results further emphasize the profound data dependency of implicit regularization in general, inspiring us to study in detail the data dependency of NN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and ELM