A Rose by Any Other Name Would Smell as Sweet: Categorical Homotopy Theory for Large Language Models
Sridhar Mahadevan

TL;DR
This paper introduces a categorical homotopy framework for large language models to better understand and handle the equivalence of different linguistic expressions, leveraging advanced mathematical concepts.
Contribution
It develops a novel categorical homotopy approach using Markov categories to model and analyze language equivalences in LLMs, addressing fundamental rephrasing issues.
Findings
Introduces LLM Markov category for language probability modeling
Applies categorical homotopy to capture weak equivalences in LLMs
Connects LLM analysis with higher algebraic K-theory and model categories
Abstract
Natural language is replete with superficially different statements, such as ``Charles Darwin wrote" and ``Charles Darwin is the author of", which carry the same meaning. Large language models (LLMs) should generate the same next-token probabilities in such cases, but usually do not. Empirical workarounds have been explored, such as using k-NN estimates of sentence similarity to produce smoothed estimates. In this paper, we tackle this problem more abstractly, introducing a categorical homotopy framework for LLMs. We introduce an LLM Markov category to represent probability distributions in language generated by an LLM, where the probability of a sentence, such as ``Charles Darwin wrote" is defined by an arrow in a Markov category. However, this approach runs into difficulties as language is full of equivalent rephrases, and each generates a non-isomorphic arrow in the LLM Markov…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
