Neural network layers as parametric spans

Mattia G. Bergomi; Pietro Vertechi

arXiv:2208.00809·math.CT·September 7, 2022

Neural network layers as parametric spans

Mattia G. Bergomi, Pietro Vertechi

PDF

Open Access

TL;DR

This paper introduces a categorical framework for neural network layers using parametric spans, providing a unified mathematical foundation that generalizes classical layers and ensures differentiability for backpropagation.

Contribution

It presents a novel, general definition of neural network layers based on integration theory and parametric spans, unifying classical layers within a rigorous mathematical framework.

Findings

01

Generalized layer definition encompasses dense and convolutional layers.

02

Guarantees existence and computability of derivatives for backpropagation.

03

Provides a mathematically rigorous foundation for neural network layer design.

Abstract

Properties such as composability and automatic differentiation made artificial neural networks a pervasive tool in applications. Tackling more challenging problems caused neural networks to progressively become more complex and thus difficult to define from a mathematical perspective. We present a general definition of linear layer arising from a categorical framework based on the notions of integration theory and parametric spans. This definition generalizes and encompasses classical layers (e.g., dense, convolutional), while guaranteeing existence and computability of the layer's derivatives for backpropagation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsLinear Layer