Kan Extensions in Data Science and Machine Learning
Dan Shiebler

TL;DR
This paper explores how Kan extensions from category theory can be applied to various data science tasks such as classification, clustering, and function approximation, providing a unifying mathematical framework.
Contribution
It introduces novel applications of Kan extensions to derive algorithms for classification, clustering, and function learning in data science.
Findings
Derived a classification algorithm using Kan extension
Developed a procedure for learning clustering from labels
Demonstrated function approximation with Kan extensions
Abstract
A common problem in data science is "use this function defined over this small set to generate predictions over that larger set." Extrapolation, interpolation, statistical inference and forecasting all reduce to this problem. The Kan extension is a powerful tool in category theory that generalizes this notion. In this work we explore several applications of Kan extensions to data science. We begin by deriving a simple classification algorithm as a Kan extension and experimenting with this algorithm on real data. Next, we use the Kan extension to derive a procedure for learning clustering algorithms from labels and explore the performance of this procedure on real data. We then investigate how Kan extensions can be used to learn a general mapping from datasets of labeled examples to functions and to approximate a complex function with a simpler one.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Advanced Clustering Algorithms Research
