Towards a Categorical Foundation of Deep Learning: A Survey

Francesco Riccardo Crescenzi

arXiv:2410.05353·cs.LG·October 16, 2024

Towards a Categorical Foundation of Deep Learning: A Survey

Francesco Riccardo Crescenzi

PDF

Open Access

TL;DR

This survey explores how category theory can provide a unifying and principled mathematical framework for deep learning, addressing current theoretical gaps and reproducibility issues.

Contribution

It reviews recent work applying category theory to deep learning, proposing a structured mathematical foundation for the field.

Findings

01

Categorical optics model gradient-based learning

02

Categorical algebras link classical CS to neural networks

03

String diagrams visually represent neural architectures

Abstract

The unprecedented pace of machine learning research has lead to incredible advances, but also poses hard challenges. At present, the field lacks strong theoretical underpinnings, and many important achievements stem from ad hoc design choices which are hard to justify in principle and whose effectiveness often goes unexplained. Research debt is increasing and many papers are found not to be reproducible. This thesis is a survey that covers some recent work attempting to study machine learning categorically. Category theory is a branch of abstract mathematics that has found successful applications in many fields, both inside and outside mathematics. Acting as a lingua franca of mathematics and science, category theory might be able to give a unifying structure to the field of machine learning. This could solve some of the aforementioned problems. In this work, we mainly focus on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBig Data and Digital Economy · Computational and Text Analysis Methods

MethodsFocus · High-Order Consensuses