A Flow Model of Neural Networks
Zhen Li, Zuoqiang Shi

TL;DR
This paper introduces a continuous flow model linking ResNet and transport equations, offering new insights into neural network design and enabling the application of differential equation tools to deep learning.
Contribution
It proposes a novel continuous flow framework for ResNet and plain networks, clarifying their relationship and underlying phenomena in deep neural networks.
Findings
ResNet can be viewed as a refinement of a plain net within the flow model
The flow model explains why 2-layer blocks are necessary in ResNets
Deeper networks benefit from the continuous flow perspective
Abstract
Based on a natural connection between ResNet and transport equation or its characteristic equation, we propose a continuous flow model for both ResNet and plain net. Through this continuous model, a ResNet can be explicitly constructed as a refinement of a plain net. The flow model provides an alternative perspective to understand phenomena in deep neural networks, such as why it is necessary and sufficient to use 2-layer blocks in ResNets, why deeper is better, and why ResNets are even deeper, and so on. It also opens a gate to bring in more tools from the huge area of differential equations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
