Investigating Representation Universality: Case Study on Genealogical Representations

David D. Baek; Yuxiao Li; Max Tegmark

arXiv:2410.08255·cs.LG·November 25, 2025

Investigating Representation Universality: Case Study on Genealogical Representations

David D. Baek, Yuxiao Li, Max Tegmark

PDF

Open Access

TL;DR

This paper explores whether large language models universally encode graph-structured knowledge using geometric structures, providing experimental evidence across multiple models and architectures to understand their internal representations.

Contribution

It introduces novel methods to identify and verify geometric subspaces in LLMs related to graph knowledge and compares representations across diverse models and architectures.

Findings

01

Identified a tree-like subspace in residual streams for genealogy questions

02

Validated causal effect of this subspace via activation patching

03

Quantified representational alignment across different models

Abstract

Motivated by interpretability and reliability, we investigate whether large language models (LLMs) deploy universal geometric structures to encode discrete, graph-structured knowledge. To this end, we present two complementary experimental evidence that might support universality of graph representations. First, on an in-context genealogy Q&A task, we train a cone probe to isolate a tree-like subspace in residual stream activations and use activation patching to verify its causal effect in answering related questions. We validate our findings across five different models. Second, we conduct model stitching experiments across models of diverse architectures and parameter counts (OPT, Pythia, Mistral, and LLaMA, 410 million to 8 billion parameters), quantifying representational alignment via relative degradation in the next-token prediction loss. Generally, we conclude that the lack of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling