Fundamental Components of Deep Learning: A category-theoretic approach

Bruno Gavranovi\'c

arXiv:2403.13001·cs.LG·March 21, 2024·1 cites

Fundamental Components of Deep Learning: A category-theoretic approach

Bruno Gavranovi\'c

PDF

Open Access 1 Repo

TL;DR

This paper introduces a category-theoretic framework for deep learning, providing a unifying, compositional mathematical foundation that models neural networks, backpropagation, and learning processes.

Contribution

It develops a novel, prescriptive categorical foundation for deep learning, systematizing existing approaches and modeling key properties like parametricity and bidirectionality.

Findings

01

Categorical model of neural networks using weighted optics

02

Formalization of backpropagation and architectures within the framework

03

Unified, compositional approach to supervised learning

Abstract

Deep learning, despite its remarkable achievements, is still a young field. Like the early stages of many scientific disciplines, it is marked by the discovery of new phenomena, ad-hoc design decisions, and the lack of a uniform and compositional mathematical foundation. From the intricacies of the implementation of backpropagation, through a growing zoo of neural network architectures, to the new and poorly understood phenomena such as double descent, scaling laws or in-context learning, there are few unifying principles in deep learning. This thesis develops a novel mathematical foundation for deep learning based on the language of category theory. We develop a new framework that is a) end-to-end, b) unform, and c) not merely descriptive, but prescriptive, meaning it is amenable to direct implementation in programming languages with sufficient features. We also systematise many…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bgavran/category_theory_machine_learning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Cell Image Analysis Techniques