Investigating Learning in Deep Neural Networks using Layer-Wise Weight   Change

Ayush Manish Agrawal; Atharva Tendle; Harshvardhan Sikka; Sahib Singh,; and Amr Kayid

arXiv:2011.06735·cs.LG·December 2, 2020

Investigating Learning in Deep Neural Networks using Layer-Wise Weight Change

Ayush Manish Agrawal, Atharva Tendle, Harshvardhan Sikka, Sahib Singh,, and Amr Kayid

PDF

Open Access 2 Repos

TL;DR

This paper explores how different layers in deep CNNs change their weights during training, revealing that later layers tend to undergo more significant relative weight changes across various architectures and tasks.

Contribution

It introduces a method to measure per-layer weight change during training and uncovers consistent trends across multiple CNN architectures and vision tasks.

Findings

01

Later layers exhibit greater relative weight change than earlier layers.

02

Weight change patterns are consistent across different CNN architectures.

03

Insights may inform improved training strategies for deep neural networks.

Abstract

Understanding the per-layer learning dynamics of deep neural networks is of significant interest as it may provide insights into how neural networks learn and the potential for better training regimens. We investigate learning in Deep Convolutional Neural Networks (CNNs) by measuring the relative weight change of layers while training. Several interesting trends emerge in a variety of CNN architectures across various computer vision classification tasks, including the overall increase in relative weight change of later layers as compared to earlier ones.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications