The Representation Theory of Neural Networks

Marco Antonio Armenta; Pierre-Marc Jodoin

arXiv:2007.12213·cs.LG·March 24, 2021

The Representation Theory of Neural Networks

Marco Antonio Armenta, Pierre-Marc Jodoin

PDF

TL;DR

This paper introduces a novel algebraic framework using quiver representation theory to model neural networks, providing exact mathematical descriptions and insights into their structure and data processing capabilities.

Contribution

It establishes that neural networks can be precisely represented as quiver representations, connecting neural network concepts with algebraic structures for the first time.

Findings

01

Neural networks are equivalent to quiver representations with activation functions.

02

Network quivers can model common neural network components like convolution and residual connections.

03

Neural networks map data to moduli spaces via quiver representations, revealing their algebraic structure.

Abstract

In this work, we show that neural networks can be represented via the mathematical theory of quiver representations. More specifically, we prove that a neural network is a quiver representation with activation functions, a mathematical object that we represent using a network quiver. Also, we show that network quivers gently adapt to common neural network concepts such as fully-connected layers, convolution operations, residual connections, batch normalization, pooling operations and even randomly wired neural networks. We show that this mathematical representation is by no means an approximation of what neural networks are as it exactly matches reality. This interpretation is algebraic and can be studied with algebraic methods. We also provide a quiver representation model to understand how a neural network creates representations from the data. We show that a neural network saves the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution