Exploiting Explainable Metrics for Augmented SGD

Mahdi S. Hosseini; Mathieu Tuli; Konstantinos N. Plataniotis

arXiv:2203.16723·cs.LG·April 1, 2022

Exploiting Explainable Metrics for Augmented SGD

Mahdi S. Hosseini, Mathieu Tuli, Konstantinos N. Plataniotis

PDF

Open Access 2 Repos

TL;DR

This paper introduces explainability metrics based on low-rank factorization to assess layer-wise learning quality in deep neural networks, and uses these metrics to adaptively enhance SGD, resulting in improved generalization with minimal overhead.

Contribution

The paper proposes novel explainability metrics for neural network layers and leverages them to augment SGD with adaptive layer-wise learning rates, improving generalization.

Findings

01

RMSGD outperforms state-of-the-art methods in generalization.

02

Metrics strongly correlate with generalization performance.

03

Minimal additional computational cost.

Abstract

Explaining the generalization characteristics of deep learning is an emerging topic in advanced machine learning. There are several unanswered questions about how learning under stochastic optimization really works and why certain strategies are better than others. In this paper, we address the following question: \textit{can we probe intermediate layers of a deep neural network to identify and quantify the learning quality of each layer?} With this question in mind, we propose new explainability metrics that measure the redundant information in a network's layers using a low-rank factorization framework and quantify a complexity measure that is highly correlated with the generalization performance of a given optimizer, network, and dataset. We subsequently exploit these metrics to augment the Stochastic Gradient Descent (SGD) optimizer by adaptively adjusting the learning rate in each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Machine Learning and Data Classification

MethodsStochastic Gradient Descent