Delay Differential Neural Networks
Srinivas Anumasa, P.K. Srijith

TL;DR
Delay Differential Neural Networks (DDNNs) extend neural ODEs by incorporating past feature vectors into the derivative computation, offering continuous-depth models with improved data efficiency and fewer parameters, demonstrated on image classification tasks.
Contribution
Introduction of DDNN models inspired by delay differential equations, providing a new continuous-depth neural network architecture with efficient training methods.
Findings
DDNN achieves comparable accuracy with fewer parameters.
DDNN improves data efficiency over NODEs.
Experimental validation on CIFAR datasets confirms effectiveness.
Abstract
Neural ordinary differential equations (NODEs) treat computation of intermediate feature vectors as trajectories of ordinary differential equation parameterized by a neural network. In this paper, we propose a novel model, delay differential neural networks (DDNN), inspired by delay differential equations (DDEs). The proposed model considers the derivative of the hidden feature vector as a function of the current feature vector and past feature vectors (history). The function is modelled as a neural network and consequently, it leads to continuous depth alternatives to many recent ResNet variants. We propose two different DDNN architectures, depending on the way current and past feature vectors are considered. For training DDNNs, we provide a memory-efficient adjoint method for computing gradients and back-propagate through the network. DDNN improves the data efficiency of NODE by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Hydrological Forecasting Using AI
MethodsAverage Pooling · Batch Normalization · Residual Block · Residual Connection · Kaiming Initialization · Global Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · 1x1 Convolution · Max Pooling
