A Dynamics Theory of Implicit Regularization in Deep Low-Rank Matrix   Factorization

Jian Cao; Chen Qian; Yihui Huang; Dicheng Chen; Yuncheng Gao; Jiyang; Dong; Di Guo; Xiaobo Qu

arXiv:2212.14150·cs.LG·August 14, 2023

A Dynamics Theory of Implicit Regularization in Deep Low-Rank Matrix Factorization

Jian Cao, Chen Qian, Yihui Huang, Dicheng Chen, Yuncheng Gao, Jiyang, Dong, Di Guo, Xiaobo Qu

PDF

Open Access

TL;DR

This paper introduces a new theoretical approach to understanding how deep matrix factorization implicitly regularizes solutions, linking saddle point escaping stages to matrix rank and providing insights into the training dynamics of deep neural networks.

Contribution

It proposes a landscape analysis approach for discrete gradient dynamics, establishing a theoretical connection between saddle point escaping stages and matrix rank in deep matrix factorization.

Findings

01

DMF converges to second-order critical points after R saddle point escaping stages for rank-R matrices

02

Theoretical results are experimentally verified on low-rank matrix reconstruction

03

Provides new insights into implicit regularization in deep learning

Abstract

Implicit regularization is an important way to interpret neural networks. Recent theory starts to explain implicit regularization with the model of deep matrix factorization (DMF) and analyze the trajectory of discrete gradient dynamics in the optimization process. These discrete gradient dynamics are relatively small but not infinitesimal, thus fitting well with the practical implementation of neural networks. Currently, discrete gradient dynamics analysis has been successfully applied to shallow networks but encounters the difficulty of complex computation for deep networks. In this work, we introduce another discrete gradient dynamics approach to explain implicit regularization, i.e. landscape analysis. It mainly focuses on gradient regions, such as saddle points and local minima. We theoretically establish the connection between saddle point escaping (SPE) stages and the matrix rank…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Matrix Theory and Algorithms · Model Reduction and Neural Networks