Recovering manifold structure in LLM responses through a joint Euclidean mirror
Maximilian Baum, Aranyak Acharyya, Tianyi Chen, Avanti Athreya, Youngser Park, Francesco Sanna Passino, Carey E. Priebe, and Zachary Lubberts

TL;DR
This paper introduces a geometric framework to analyze and compare large language model responses across different tuning parameters by embedding response distributions into a low-dimensional Euclidean space.
Contribution
It develops a mathematical method to represent response distribution dissimilarities in Euclidean space and proposes estimation procedures with proven asymptotic properties.
Findings
Response distributions vary along distinct directions in the embedding space.
The method enables estimation of tuning parameters from response samples.
Experimental results show the approach effectively captures model response geometry.
Abstract
Understanding the behavior of black-box large language models and determining effective means of comparing their performance is a key task in modern machine learning. We consider how large language models respond to a specific query by analyzing how the distributions of responses vary over different values of tuning parameters. We frame this problem in a general mathematical setting, treating the mapping from model parameters to response distributions as a structured family of probability measures, endowed with a geometry via a dissimilarity measure. We show how dissimilarities between response distributions can be represented in low-dimensional Euclidean space through a joint Euclidean mirror surface encoding the underlying geometry, which permits both qualitative and quantitative analysis of large language models and provides insight into predicting response distributions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
