An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress
Hikmat Karimov, Rahid Zahid Alekberli

TL;DR
This paper introduces an information-geometric, thermodynamic-inspired framework for assessing the stability of large language models under uncertainty, emphasizing internal structure and entropy effects.
Contribution
It presents a novel, interpretable stability scoring framework that integrates task utility, entropy, and internal proxies, enhancing evaluation beyond traditional accuracy metrics.
Findings
The proposed stability score outperforms baseline measures with a mean improvement of 0.0299.
Higher entropy conditions show more pronounced stability gains, indicating nonlinear attenuation of uncertainty.
Analysis across 80 scenarios and four LLMs demonstrates the framework's consistency and interpretability.
Abstract
As large language models (LLMs) are increasingly deployed in high-stakes and operational settings, evaluation strategies based solely on aggregate accuracy are often insucient to characterize system reliability. This study proposes a thermodynamic inspired modeling framework for analyzing the stability of LLM outputs under conditions of uncertainty and perturbation. The framework introduces a composite stability score that integrates task utility, entropy as a measure of external uncertainty, and two internal structural proxies: internal integration and aligned reective capacity. Rather than interpreting these quantities as physical variables, the formulation is intended as an interpretable abstraction that captures how internal structure may modulate the impact of disorder on model behavior. Using the IST-20 benchmarking protocol and associated metadata, we analyze 80 modelscenario…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
