From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
Chen Shani, Liron Soffer, Dan Jurafsky, Yann LeCun, Ravid Shwartz-Ziv

TL;DR
This paper compares how humans and large language models balance semantic richness and compression, revealing that models prioritize efficiency over nuanced understanding, with implications for developing more human-like AI.
Contribution
It introduces an information-theoretic framework to analyze and compare human and LLM conceptual structures, highlighting fundamental differences in their representational strategies.
Findings
LLMs broadly align with human category boundaries but lack fine-grained distinctions.
LLMs compress information more aggressively than humans, sacrificing semantic richness.
Encoder models outperform decoder models in aligning with human conceptual structures.
Abstract
Humans organize knowledge into compact conceptual categories that balance compression with semantic richness. Large Language Models (LLMs) exhibit impressive linguistic abilities, but whether they navigate this same compression-meaning trade-off remains unclear. We apply an Information Bottleneck framework to compare human conceptual structure with embeddings from 40+ LLMs using classic categorization benchmarks. We find that LLMs broadly align with human category boundaries, yet fall short on fine-grained semantic distinctions. Unlike humans, who maintain ``inefficient'' representations that preserve contextual nuance, LLMs aggressively compress, achieving more optimal information-theoretic compression at the cost of semantic richness. Surprisingly, encoder models outperform much larger decoder models in human alignment, suggesting that understanding and generation rely on distinct…
Peer Reviews
Decision·ICLR 2026 Poster
- very interesting and important premise - Systematic, broad comparison covering 40+ models and also layer-wise comparisons. - Formulation/digitization of the datasets from cognitive science is a good contribution - Transparent discussion on the limitations
- The main question lies in how strongly to rely on the proposed metrics to infer “human-like” learning. The information–compression trade-off captures geometric efficiency in embedding space, but it is not clear whether this translates to human-style conceptual abstraction or reasoning. - Dependency on parameters and metrics - The authors themselves acknowledge that "architectural design and pre-training objectives significantly influence a model's ability to abstract human-like conceptual info
* The study compares a wide range of LLM architectures and sizes. * Compression and concept formation is a deep question at the intersection of cognitive science and machine learning. * This study links displines between information theory and the prototype theories of human concept learning.
* The paper takes prototype theory as a given framework for human concept representation, but this theory has been very controversial compared to Exemplar theory (Medin & Schaffer, 1978) as an alternative explanation. Modern cognitive models are usually hybrid and integrate prototype and exemplar components. The manuscript should acknowledge this longstanding debate and should not take for granted that the cognitive system for humans are prototype-like. * The paper lacks an introduction or fig
(S1) The paper offers a theoretically motivated and quantitatively explicit framework that fuses Rate–Distortion Theory and the Information Bottleneck to provide a new perspective on the compression–meaning trade-off in LLMs. The use of semantic compactness as an internal proxy for meaning fidelity is original. (S2) The empirical analysis is well-executed and leverages high-quality, historically grounded human categorization datasets. Their public release greatly facilitates reproducibility and
(W1) Although the analysis is thorough, one might consider including token-level efficiency statistics in future works, though this does not affect the validity of the current findings. (W2) The study could be further enriched by considering computational efficiency (e.g., token-level cost) as an additional axis in the compression–meaning landscape. Doing so may illuminate how efficiency interacts with semantic representation in practice. (W3) This study only scoped in English, it would be int
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
