A simple theory for training response of deep neural networks
Kenichi Nakazato

TL;DR
This paper introduces a simple theoretical model to analyze the training response of deep neural networks, revealing factors influencing training dynamics and the emergence of network fragility.
Contribution
It proposes a minimalistic network model to bridge gaps in understanding training responses and complex phenomena like fragility in deep learning.
Findings
Training response varies with training stages, activation functions, and methods.
Stochastic training dynamics lead to feature space reduction.
Feature space reduction can cause network fragility.
Abstract
Deep neural networks give us a powerful method to model the training dataset's relationship between input and output. We can regard that as a complex adaptive system consisting of many artificial neurons that work as an adaptive memory as a whole. The network's behavior is training dynamics with a feedback loop from the evaluation of the loss function. We already know the training response can be constant or shows power law-like aging in some ideal situations. However, we still have gaps between those findings and other complex phenomena, like network fragility. To fill the gap, we introduce a very simple network and analyze it. We show the training response consists of some different factors based on training stages, activation functions, or training methods. In addition, we show feature space reduction as an effect of stochastic training dynamics, which can result in network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
