Latent Structure Modulation in Large Language Models Through Stochastic Concept Embedding Transitions

Stefan Whitaker; Colin Sisate; Marcel Windsor; Nikolai Fairweather; Tarquin Goldborough; Oskar Lindenfeld

arXiv:2502.05553·cs.CL·August 11, 2025

Latent Structure Modulation in Large Language Models Through Stochastic Concept Embedding Transitions

Stefan Whitaker, Colin Sisate, Marcel Windsor, Nikolai Fairweather, Tarquin Goldborough, Oskar Lindenfeld

PDF

Open Access

TL;DR

This paper introduces stochastic embedding transitions in large language models, allowing dynamic, probabilistic updates to token representations that improve diversity, coherence, and low-frequency word retention during inference.

Contribution

It proposes a novel probabilistic transition framework for embeddings, enhancing flexibility and semantic coherence in large language models compared to static approaches.

Findings

01

Increased lexical diversity and generative coherence.

02

Enhanced retention of low-frequency vocabulary.

03

Maintained efficiency with minor computational overhead.

Abstract

Stochastic embedding transitions introduce a probabilistic mechanism for adjusting token representations dynamically during inference, mitigating the constraints imposed through static or deterministic embeddings. A transition framework was proposed in which each token embedding evolved through probabilistic updates, ensuring adaptability while preserving semantic integrity across linguistic contexts. Empirical evaluations demonstrated that models incorporating stochastic transitions exhibited greater lexical diversity, improved generative coherence, and enhanced retention of low-frequency vocabulary, contributing to more varied sentence structures and reduced reliance on high-probability token selections. Statistical analyses of embedding drift across transformer layers indicated that representations evolved more flexibly without losing coherence, supporting the hypothesis that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling