An induction proof of the backpropagation algorithm in matrix notation
Dirk Ostwald, Franziska Us\'ee

TL;DR
This paper provides a rigorous induction proof of the backpropagation algorithm in matrix notation, clarifying its theoretical foundation using matrix differential calculus and enhancing its didactic presentation.
Contribution
It introduces a formal induction proof of backpropagation in matrix form, addressing gaps in the theoretical justification within deep learning literature.
Findings
Validates backpropagation using matrix differential calculus
Provides a clear inductive proof framework
Demonstrates implementation in computer code
Abstract
Backpropagation (BP) is a core component of the contemporary deep learning incarnation of neural networks. Briefly, BP is an algorithm that exploits the computational architecture of neural networks to efficiently evaluate the gradient of a cost function during neural network parameter optimization. The validity of BP rests on the application of a multivariate chain rule to the computational architecture of neural networks and their associated objective functions. Introductions to deep learning theory commonly present the computational architecture of neural networks in matrix form, but eschew a parallel formulation and justification of BP in the framework of matrix differential calculus. This entails several drawbacks for the theory and didactics of deep learning. In this work, we overcome these limitations by providing a full induction proof of the BP algorithm in matrix notation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Neural Networks and Applications · Algorithms and Data Compression
