Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy   Weights in Deep Networks

Mohammad Mahdi Bejani; Mehdi Ghatee

arXiv:2106.09677·cs.LG·June 18, 2021

Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy Weights in Deep Networks

Mohammad Mahdi Bejani, Mehdi Ghatee

PDF

Open Access

TL;DR

This paper introduces Adaptive Low-Rank (ALR) regularization that selectively targets overfitting layers in deep networks by encouraging low-rank structures, improving training speed and resource efficiency.

Contribution

It proposes a novel adaptive regularization method that detects overfitting layers using condition numbers and applies low-rank factorization with damping sequences for better regularization.

Findings

01

ALR effectively reduces overfitting in deep networks.

02

ALR achieves high training speed and low resource usage.

03

Experimental results validate the efficiency of ALR.

Abstract

Overfitting is one of the critical problems in deep neural networks. Many regularization schemes try to prevent overfitting blindly. However, they decrease the convergence speed of training algorithms. Adaptive regularization schemes can solve overfitting more intelligently. They usually do not affect the entire network weights. This paper detects a subset of the weighting layers that cause overfitting. The overfitting recognizes by matrix and tensor condition numbers. An adaptive regularization scheme entitled Adaptive Low-Rank (ALR) is proposed that converges a subset of the weighting layers to their Low-Rank Factorization (LRF). It happens by minimizing a new Tikhonov-based loss function. ALR also encourages lazy weights to contribute to the regularization when epochs grow up. It uses a damping sequence to increment layer selection likelihood in the last generations. Thus before…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Sparse and Compressive Sensing Techniques · Image and Signal Denoising Methods