A Dynamics Theory of Implicit Regularization in Deep Low-Rank Matrix Factorization
Jian Cao, Chen Qian, Yihui Huang, Dicheng Chen, Yuncheng Gao, Jiyang, Dong, Di Guo, Xiaobo Qu

TL;DR
This paper introduces a new theoretical approach to understanding how deep matrix factorization implicitly regularizes solutions, linking saddle point escaping stages to matrix rank and providing insights into the training dynamics of deep neural networks.
Contribution
It proposes a landscape analysis approach for discrete gradient dynamics, establishing a theoretical connection between saddle point escaping stages and matrix rank in deep matrix factorization.
Findings
DMF converges to second-order critical points after R saddle point escaping stages for rank-R matrices
Theoretical results are experimentally verified on low-rank matrix reconstruction
Provides new insights into implicit regularization in deep learning
Abstract
Implicit regularization is an important way to interpret neural networks. Recent theory starts to explain implicit regularization with the model of deep matrix factorization (DMF) and analyze the trajectory of discrete gradient dynamics in the optimization process. These discrete gradient dynamics are relatively small but not infinitesimal, thus fitting well with the practical implementation of neural networks. Currently, discrete gradient dynamics analysis has been successfully applied to shallow networks but encounters the difficulty of complex computation for deep networks. In this work, we introduce another discrete gradient dynamics approach to explain implicit regularization, i.e. landscape analysis. It mainly focuses on gradient regions, such as saddle points and local minima. We theoretically establish the connection between saddle point escaping (SPE) stages and the matrix rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Matrix Theory and Algorithms · Model Reduction and Neural Networks
