A Derivation of Feedforward Neural Network Gradients Using Fr\'echet   Calculus

Thomas Hamm

arXiv:2209.13234·cs.LG·September 28, 2022

A Derivation of Feedforward Neural Network Gradients Using Fr\'echet Calculus

Thomas Hamm

PDF

Open Access

TL;DR

This paper introduces a compact derivation of neural network gradients using Fréchet calculus, providing a unified approach applicable to various architectures including convolutional networks.

Contribution

It presents a novel, more concise derivation of neural network gradients via Fréchet calculus, extending to complex architectures.

Findings

01

Derivation of neural network gradients using Fréchet calculus

02

Development of an efficient algorithm for gradient computation

03

Generalization to convolutional and other advanced architectures

Abstract

We present a derivation of the gradients of feedforward neural networks using Fr\'echet calculus which is arguably more compact than the ones usually presented in the literature. We first derive the gradients for ordinary neural networks working on vectorial data and show how these derived formulas can be used to derive a simple and efficient algorithm for calculating a neural networks gradients. Subsequently we show how our analysis generalizes to more general neural network architectures including, but not limited to, convolutional networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Advanced Numerical Analysis Techniques