Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy Weights in Deep Networks
Mohammad Mahdi Bejani, Mehdi Ghatee

TL;DR
This paper introduces Adaptive Low-Rank (ALR) regularization that selectively targets overfitting layers in deep networks by encouraging low-rank structures, improving training speed and resource efficiency.
Contribution
It proposes a novel adaptive regularization method that detects overfitting layers using condition numbers and applies low-rank factorization with damping sequences for better regularization.
Findings
ALR effectively reduces overfitting in deep networks.
ALR achieves high training speed and low resource usage.
Experimental results validate the efficiency of ALR.
Abstract
Overfitting is one of the critical problems in deep neural networks. Many regularization schemes try to prevent overfitting blindly. However, they decrease the convergence speed of training algorithms. Adaptive regularization schemes can solve overfitting more intelligently. They usually do not affect the entire network weights. This paper detects a subset of the weighting layers that cause overfitting. The overfitting recognizes by matrix and tensor condition numbers. An adaptive regularization scheme entitled Adaptive Low-Rank (ALR) is proposed that converges a subset of the weighting layers to their Low-Rank Factorization (LRF). It happens by minimizing a new Tikhonov-based loss function. ALR also encourages lazy weights to contribute to the regularization when epochs grow up. It uses a damping sequence to increment layer selection likelihood in the last generations. Thus before…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Sparse and Compressive Sensing Techniques · Image and Signal Denoising Methods
