Towards a Categorical Foundation of Deep Learning: A Survey
Francesco Riccardo Crescenzi

TL;DR
This survey explores how category theory can provide a unifying and principled mathematical framework for deep learning, addressing current theoretical gaps and reproducibility issues.
Contribution
It reviews recent work applying category theory to deep learning, proposing a structured mathematical foundation for the field.
Findings
Categorical optics model gradient-based learning
Categorical algebras link classical CS to neural networks
String diagrams visually represent neural architectures
Abstract
The unprecedented pace of machine learning research has lead to incredible advances, but also poses hard challenges. At present, the field lacks strong theoretical underpinnings, and many important achievements stem from ad hoc design choices which are hard to justify in principle and whose effectiveness often goes unexplained. Research debt is increasing and many papers are found not to be reproducible. This thesis is a survey that covers some recent work attempting to study machine learning categorically. Category theory is a branch of abstract mathematics that has found successful applications in many fields, both inside and outside mathematics. Acting as a lingua franca of mathematics and science, category theory might be able to give a unifying structure to the field of machine learning. This could solve some of the aforementioned problems. In this work, we mainly focus on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Computational and Text Analysis Methods
MethodsFocus · High-Order Consensuses
