A Derivation of Feedforward Neural Network Gradients Using Fr\'echet Calculus
Thomas Hamm

TL;DR
This paper introduces a compact derivation of neural network gradients using Fréchet calculus, providing a unified approach applicable to various architectures including convolutional networks.
Contribution
It presents a novel, more concise derivation of neural network gradients via Fréchet calculus, extending to complex architectures.
Findings
Derivation of neural network gradients using Fréchet calculus
Development of an efficient algorithm for gradient computation
Generalization to convolutional and other advanced architectures
Abstract
We present a derivation of the gradients of feedforward neural networks using Fr\'echet calculus which is arguably more compact than the ones usually presented in the literature. We first derive the gradients for ordinary neural networks working on vectorial data and show how these derived formulas can be used to derive a simple and efficient algorithm for calculating a neural networks gradients. Subsequently we show how our analysis generalizes to more general neural network architectures including, but not limited to, convolutional networks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Advanced Numerical Analysis Techniques
