When lies are mostly truthful: automated verbal deception detection for embedded lies
Riccardo Loconte, Bennett Kleinberg

TL;DR
This paper introduces a new dataset and demonstrates that a fine-tuned language model can detect embedded lies within truthful statements with 64% accuracy, highlighting the complexity of verbal deception detection.
Contribution
The study provides a novel dataset of embedded lies and shows that current language models can classify deceptive statements with moderate accuracy, advancing research in realistic deception detection.
Findings
Language model achieves 64% accuracy in detecting embedded lies.
Embedded lies closely resemble truthful statements linguistically.
Most deceptive statements contain about 1/3 embedded lies.
Abstract
Background: Verbal deception detection research relies on narratives and commonly assumes statements as truthful or deceptive. A more realistic perspective acknowledges that the veracity of statements exists on a continuum with truthful and deceptive parts being embedded within the same statement. However, research on embedded lies has been lagging behind. Methods: We collected a novel dataset of 2,088 truthful and deceptive statements with annotated embedded lies. Using a within-subjects design, participants provided a truthful account of an autobiographical event. They then rewrote their statement in a deceptive manner by including embedded lies, which they highlighted afterwards and judged on lie centrality, deceptiveness, and source. Results: We show that a fined-tuned language model (Llama-3-8B) can classify truthful statements and those containing embedded lies with 64% accuracy.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
