Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function
Kuo Gai, Shihua Zhang

TL;DR
This paper derives an analytical solution for a three-layer neural network with a matrix exponential activation, highlighting the theoretical advantages of depth and non-linear activations over shallow networks.
Contribution
It provides the first analytical solution for a three-layer network with matrix exponential activation, demonstrating the theoretical power of depth and non-linearity.
Findings
Three-layer network with matrix exponential activation has analytical solutions.
Depth and non-linear activation enhance the network's solving power.
One-layer networks can only solve linear equations.
Abstract
In practice, deeper networks tend to be more powerful than shallow ones, but this has not been understood theoretically. In this paper, we find the analytical solution of a three-layer network with a matrix exponential activation function, i.e., have analytical solutions for the equations for with only invertible assumptions. Our proof shows the power of depth and the use of a non-linear activation function, since one layer network can only solve one equation,i.e.,.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemiconductor Lasers and Optical Devices · Neural Networks and Applications · Advanced Research in Systems and Signal Processing
