Universal Approximation Properties for an ODENet and a ResNet: Mathematical Analysis and Numerical Experiments
Yuto Aizawa, Masato Kimura, and Kazunori Matsui

TL;DR
This paper establishes the universal approximation capabilities of ODENet and ResNet models with skip connections, providing theoretical proofs and numerical experiments demonstrating their effectiveness in approximating functions and learning tasks.
Contribution
It proves the universal approximation property for ODENet and ResNet models, including explicit gradient derivation and a practical learning algorithm.
Findings
ODENet of width n+m can approximate any continuous function on compact sets.
ResNet has the same approximation property as depth tends to infinity.
The proposed learning algorithm performs well on MNIST classification tasks.
Abstract
We prove a universal approximation property (UAP) for a class of ODENet and a class of ResNet, which are simplified mathematical models for deep learning systems with skip connections. The UAP can be stated as follows. Let and be the dimension of input and output data, and assume . Then we show that ODENet of width with any non-polynomial continuous activation function can approximate any continuous function on a compact subset on . We also show that ResNet has the same property as the depth tends to infinity. Furthermore, we derive the gradient of a loss function explicitly with respect to a certain tuning variable. We use this to construct a learning algorithm for ODENet. To demonstrate the usefulness of this algorithm, we apply it to a regression problem, a binary classification, and a multinomial classification in MNIST.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
MethodsAverage Pooling · Max Pooling · Kaiming Initialization · 1x1 Convolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Bottleneck Residual Block · Convolution · Residual Block
