Fundamental Components of Deep Learning: A category-theoretic approach
Bruno Gavranovi\'c

TL;DR
This paper introduces a category-theoretic framework for deep learning, providing a unifying, compositional mathematical foundation that models neural networks, backpropagation, and learning processes.
Contribution
It develops a novel, prescriptive categorical foundation for deep learning, systematizing existing approaches and modeling key properties like parametricity and bidirectionality.
Findings
Categorical model of neural networks using weighted optics
Formalization of backpropagation and architectures within the framework
Unified, compositional approach to supervised learning
Abstract
Deep learning, despite its remarkable achievements, is still a young field. Like the early stages of many scientific disciplines, it is marked by the discovery of new phenomena, ad-hoc design decisions, and the lack of a uniform and compositional mathematical foundation. From the intricacies of the implementation of backpropagation, through a growing zoo of neural network architectures, to the new and poorly understood phenomena such as double descent, scaling laws or in-context learning, there are few unifying principles in deep learning. This thesis develops a novel mathematical foundation for deep learning based on the language of category theory. We develop a new framework that is a) end-to-end, b) unform, and c) not merely descriptive, but prescriptive, meaning it is amenable to direct implementation in programming languages with sufficient features. We also systematise many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Cell Image Analysis Techniques
