An induction proof of the backpropagation algorithm in matrix notation

Dirk Ostwald; Franziska Us\'ee

arXiv:2107.09384·stat.ML·July 21, 2021

An induction proof of the backpropagation algorithm in matrix notation

Dirk Ostwald, Franziska Us\'ee

PDF

Open Access

TL;DR

This paper provides a rigorous induction proof of the backpropagation algorithm in matrix notation, clarifying its theoretical foundation using matrix differential calculus and enhancing its didactic presentation.

Contribution

It introduces a formal induction proof of backpropagation in matrix form, addressing gaps in the theoretical justification within deep learning literature.

Findings

01

Validates backpropagation using matrix differential calculus

02

Provides a clear inductive proof framework

03

Demonstrates implementation in computer code

Abstract

Backpropagation (BP) is a core component of the contemporary deep learning incarnation of neural networks. Briefly, BP is an algorithm that exploits the computational architecture of neural networks to efficiently evaluate the gradient of a cost function during neural network parameter optimization. The validity of BP rests on the application of a multivariate chain rule to the computational architecture of neural networks and their associated objective functions. Introductions to deep learning theory commonly present the computational architecture of neural networks in matrix form, but eschew a parallel formulation and justification of BP in the framework of matrix differential calculus. This entails several drawbacks for the theory and didactics of deep learning. In this work, we overcome these limitations by providing a full induction proof of the BP algorithm in matrix notation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms · Neural Networks and Applications · Algorithms and Data Compression