The Influence of the Memory Capacity of Neural DDEs on the Universal Approximation Property
Christian Kuehn, Sara-Viola Kuntz

TL;DR
This paper investigates how the memory capacity of Neural Delay Differential Equations (Neural DDEs) affects their ability to universally approximate functions, revealing that larger memory capacity enables universal approximation.
Contribution
It establishes the relationship between memory capacity and universal approximation in Neural DDEs, showing that increased memory capacity is necessary for universal approximation.
Findings
Small memory capacity limits Neural DDEs to non-universal behavior.
Large memory capacity enables Neural DDEs to approximate continuous functions universally.
Augmented architectures expand the parameter space for universal approximation.
Abstract
Neural Ordinary Differential Equations (Neural ODEs), which are the continuous-time analog of Residual Neural Networks (ResNets), have gained significant attention in recent years. Similarly, Neural Delay Differential Equations (Neural DDEs) can be interpreted as an infinite depth limit of Densely Connected Residual Neural Networks (DenseResNets). In contrast to traditional ResNet architectures, DenseResNets are feed-forward networks that allow for shortcut connections across all layers. These additional connections introduce memory in the network architecture, as typical in many modern architectures. In this work, we explore how the memory capacity in neural DDEs influences the universal approximation property. The key parameter for studying the memory capacity is the product of the Lipschitz constant and the delay of the DDE. In the case of non-augmented architectures, where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Numerical methods for differential equations · Neural Networks Stability and Synchronization
MethodsSoftmax · Attention Is All You Need · Average Pooling · Global Average Pooling · Convolution · Kaiming Initialization · Max Pooling
