Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue
Riccardo Scantamburlo, Mauro Mezzanzana, Giacomo Buonanno, Francesco Bertolotti

TL;DR
This paper introduces a simple, interpretable metric called semantic delta that effectively distinguishes between human and LLM-generated dialogue based on thematic concentration, showing that AI outputs tend to be more thematically rigid.
Contribution
The work presents a novel, lightweight semantic delta metric derived from Empath analysis to differentiate human and LLM dialogues, enhancing detection methods.
Findings
AI texts have higher semantic delta than human texts.
Human dialogue shows broader semantic distribution.
Semantic delta can complement existing detection techniques.
Abstract
Do LLMs talk like us? This question intrigues a multitude of scholar and it is relevant in many fields, from education to academia. This work presents an interpretable statistical feature for distinguishing human written and LLMs generated dialogue. We introduce a lightweight metric derived from semantic categories distribution. Using the Empath lexical analysis framework, each text is mapped to a set of thematic intensity scores. We define semantic delta as the difference between the two most dominant category intensities within a dialogue, hypothesizing that LLM outputs exhibit stronger thematic concentration than human discourse. To evaluate this hypothesis, conversational data were generated from multiple LLM configurations and compared against heterogeneous human corpora, including scripted dialogue, literary works, and online discussions. A Welch t-test was applied to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Authorship Attribution and Profiling · Text Readability and Simplification
