Multimodal Behavioral Sensors for Lie Detection: Integrating Visual, Auditory, and Generative Reasoning Cues
Daniel Grabowski, Kamila Łuczaj, Khalid Saeed

TL;DR
This paper introduces a lie detection system that combines visual, audio, and language cues to improve accuracy and explainability in deception analysis.
Contribution
A novel multimodal framework using ViViT, HuBERT, and GPT-5 for interpretable lie detection with chain-of-thought reasoning.
Findings
The ViViT-based visual model achieved 74.4% accuracy in detecting deception.
Multimodal fusion and CoT-based reasoning improved classification accuracy and interpretability.
GPT-5-based prompt-level fusion enabled zero-shot inference and explainable AI outputs.
Abstract
What are the main findings? A multimodal deception detection framework combining visual, audio, and language-based reasoning achieved high accuracy on a DOLOS dataset.The ViViT-based visual model reached 74.4% accuracy, while HuBERT audio classification showed strong performance on prosodic cues. A multimodal deception detection framework combining visual, audio, and language-based reasoning achieved high accuracy on a DOLOS dataset. The ViViT-based visual model reached 74.4% accuracy, while HuBERT audio classification showed strong performance on prosodic cues. What is the implication of the main finding? Multimodal fusion enhances robustness and interpretability in behavioral biometrics for deception analysis.Language-guided models like GPT-5 prompt-level fusion provide explainable AI outputs, facilitating trust and real-world applicability. Multimodal fusion enhances robustness…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDeception detection and forensic psychology · Anomaly Detection Techniques and Applications · Emotion and Mood Recognition
