A Rose by Any Other Name Would Smell as Sweet: Categorical Homotopy Theory for Large Language Models

Sridhar Mahadevan

arXiv:2508.10018·cs.CL·August 15, 2025

A Rose by Any Other Name Would Smell as Sweet: Categorical Homotopy Theory for Large Language Models

Sridhar Mahadevan

PDF

TL;DR

This paper introduces a categorical homotopy framework for large language models to better understand and handle the equivalence of different linguistic expressions, leveraging advanced mathematical concepts.

Contribution

It develops a novel categorical homotopy approach using Markov categories to model and analyze language equivalences in LLMs, addressing fundamental rephrasing issues.

Findings

01

Introduces LLM Markov category for language probability modeling

02

Applies categorical homotopy to capture weak equivalences in LLMs

03

Connects LLM analysis with higher algebraic K-theory and model categories

Abstract

Natural language is replete with superficially different statements, such as ``Charles Darwin wrote" and ``Charles Darwin is the author of", which carry the same meaning. Large language models (LLMs) should generate the same next-token probabilities in such cases, but usually do not. Empirical workarounds have been explored, such as using k-NN estimates of sentence similarity to produce smoothed estimates. In this paper, we tackle this problem more abstractly, introducing a categorical homotopy framework for LLMs. We introduce an LLM Markov category to represent probability distributions in language generated by an LLM, where the probability of a sentence, such as ``Charles Darwin wrote" is defined by an arrow in a Markov category. However, this approach runs into difficulties as language is full of equivalent rephrases, and each generates a non-isomorphic arrow in the LLM Markov…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.