Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Giovanni Servedio; Alessandro De Bellis; Dario Di Palma; Vito Walter Anelli; Tommaso Di Noia

arXiv:2505.16520·cs.CL·June 2, 2025

Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Giovanni Servedio, Alessandro De Bellis, Dario Di Palma, Vito Walter Anelli, Tommaso Di Noia

PDF

Open Access 1 Video

TL;DR

This paper investigates whether internal states of LLMs encode truthfulness, using more realistic datasets, and finds that generalization remains challenging, highlighting the need for improved factuality evaluation methods.

Contribution

It introduces new methods for sampling and generating realistic true-false datasets from tabular and QA data, challenging prior synthetic dataset-based findings.

Findings

01

Partial validation of previous results

02

Generalization to LLM-generated datasets is difficult

03

Provides practical guidelines for factuality evaluation

Abstract

Factual hallucinations are a major challenge for Large Language Models (LLMs). They undermine reliability and user trust by generating inaccurate or fabricated content. Recent studies suggest that when generating false statements, the internal states of LLMs encode information about truthfulness. However, these studies often rely on synthetic datasets that lack realism, which limits generalization when evaluating the factual accuracy of text generated by the model itself. In this paper, we challenge the findings of previous work by investigating truthfulness encoding capabilities, leading to the generation of a more realistic and challenging dataset. Specifically, we extend previous work by introducing: (1) a strategy for sampling plausible true-false factoid sentences from tabular data and (2) a procedure for generating realistic, LLM-dependent true-false datasets from Question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs· underline

Taxonomy

TopicsArtificial Intelligence in Law