Opening the black box of deep learning
Dian Lei, Xiaoxiao Chen, Jianfei Zhao

TL;DR
This paper explores the internal mechanisms of deep learning by viewing neural networks as physical systems, applying physics principles to explain core processes and limitations, and proposing a physics-based theoretical foundation for future development.
Contribution
It introduces a physics-based perspective on deep learning, providing novel explanations for neural network operations and limitations using principles from quantum mechanics and statistical physics.
Findings
Calculation methods for convolution, pooling, and normalization derived from physics
Explanation of why deep learning models must be deep
Insights into why CNNs do not need layer-by-layer training
Abstract
The great success of deep learning shows that its technology contains profound truth, and understanding its internal mechanism not only has important implications for the development of its technology and effective application in various fields, but also provides meaningful insights into the understanding of human brain mechanism. At present, most of the theoretical research on deep learning is based on mathematics. This dissertation proposes that the neural network of deep learning is a physical system, examines deep learning from three different perspectives: microscopic, macroscopic, and physical world views, answers multiple theoretical puzzles in deep learning by using physics principles. For example, from the perspective of quantum mechanics and statistical physics, this dissertation presents the calculation methods for convolution calculation, pooling, normalization, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications · Time Series Analysis and Forecasting
