Uncovering Hierarchical Structure in LLM Embeddings with $\delta$-Hyperbolicity, Ultrametricity, and Neighbor Joining

Prakash Chourasia; Sarwan Ali; Murray Patterson

arXiv:2512.20926·cs.CG·December 29, 2025

Uncovering Hierarchical Structure in LLM Embeddings with $\delta$-Hyperbolicity, Ultrametricity, and Neighbor Joining

Prakash Chourasia, Sarwan Ali, Murray Patterson

PDF

Open Access

TL;DR

This paper evaluates the geometric properties of large language model embeddings using $\,delta$-hyperbolicity, ultrametricity, and Neighbor Joining, revealing their hierarchical structure and correlation with task performance.

Contribution

It introduces a novel framework for analyzing LLM embeddings' geometric properties, highlighting their hierarchical and tree-like structures using three complementary metrics.

Findings

01

LLM embeddings show varying degrees of hyperbolicity and ultrametricity.

02

The geometric properties correlate with model performance.

03

Embeddings often exhibit hierarchical, tree-like organization.

Abstract

The rapid advancement of large language models (LLMs) has enabled significant strides in various fields. This paper introduces a novel approach to evaluate the effectiveness of LLM embeddings in the context of inherent geometric properties. We investigate the structural properties of these embeddings through three complementary metrics $δ$ -hyperbolicity, Ultrametricity, and Neighbor Joining. $δ$ -hyperbolicity, a measure derived from geometric group theory, quantifies how much a metric space deviates from being a tree-like structure. In contrast, ultrametricity characterizes strictly hierarchical structures where distances obey a strong triangle inequality. While Neighbor Joining quantifies how tree-like the distance relationships are, it does so specifically with respect to the tree reconstructed by the Neighbor Joining algorithm. By analyzing the embeddings generated by LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Natural Language Processing Techniques