Neural Network Architecture Beyond Width and Depth
Zuowei Shen, Haizhao Yang, Shijun Zhang

TL;DR
This paper introduces three-dimensional neural network architectures called NestNets, which significantly enhance expressiveness and approximation capabilities over traditional two-dimensional networks by adding a height dimension.
Contribution
The paper proposes a novel three-dimensional neural network architecture called NestNet, demonstrating its superior approximation power and expressiveness compared to standard networks.
Findings
NestNets with height s can approximate Lipschitz functions with error O(n^{-(s+1)/d})
NestNets outperform standard networks in approximation accuracy
Numerical experiments confirm the theoretical advantages of NestNets
Abstract
This paper proposes a new neural network architecture by introducing an additional dimension called height beyond width and depth. Neural network architectures with height, width, and depth as hyper-parameters are called three-dimensional architectures. It is shown that neural networks with three-dimensional architectures are significantly more expressive than the ones with two-dimensional architectures (those with only width and depth as hyper-parameters), e.g., standard fully connected networks. The new network architecture is constructed recursively via a nested structure, and hence we call a network with the new architecture nested network (NestNet). A NestNet of height is built with each hidden neuron activated by a NestNet of height . When , a NestNet degenerates to a standard network with a two-dimensional architecture. It is proved by construction that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Model Reduction and Neural Networks
