Representations Matter: Embedding Modes of Large Language Models using Dynamic Mode Decomposition
Mohamed Akrout

TL;DR
This paper introduces a novel method using dynamic mode decomposition to analyze and identify hallucinations in large language models by examining the spectral properties of text embeddings, revealing differences between generated and ground-truth texts.
Contribution
The paper applies dynamic mode decomposition to LLM embeddings, providing a new way to detect hallucinations based on spectral analysis of embedding patterns.
Findings
Hallucinated text embeddings have a low-rank spectrum.
Generated text shows fewer dominant modes compared to ground-truth.
Hallucinations are linked to rapid decay of embedding eigenmodes.
Abstract
Existing large language models (LLMs) are known for generating "hallucinated" content, namely a fabricated text of plausibly looking, yet unfounded, facts. To identify when these hallucination scenarios occur, we examine the properties of the generated text in the embedding space. Specifically, we draw inspiration from the dynamic mode decomposition (DMD) tool in analyzing the pattern evolution of text embeddings across sentences. We empirically demonstrate how the spectrum of sentence embeddings over paragraphs is constantly low-rank for the generated text, unlike that of the ground-truth text. Importantly, we find that evaluation cases having LLM hallucinations correspond to ground-truth embedding patterns with a higher number of modes being poorly approximated by the few modes associated with LLM embedding patterns. In analogy to near-field electromagnetic evanescent waves, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Language and cultural evolution
