Enhancing Lie Detection Accuracy: A Comparative Study of Classic ML,   CNN, and GCN Models using Audio-Visual Features

Abdelrahman Abdelwahab; Akshaj Vishnubhatla; Ayaan Vaswani; Advait; Bharathulwar; Arnav Kommaraju

arXiv:2411.08885·cs.MM·November 18, 2024

Enhancing Lie Detection Accuracy: A Comparative Study of Classic ML, CNN, and GCN Models using Audio-Visual Features

Abdelrahman Abdelwahab, Akshaj Vishnubhatla, Ayaan Vaswani, Advait, Bharathulwar, Arnav Kommaraju

PDF

Open Access

TL;DR

This study compares classic machine learning, CNN, and GCN models using audio-visual features to improve non-invasive lie detection accuracy, achieving up to 95.4% accuracy with a multimodal transformer approach.

Contribution

It introduces a multimodal transformer architecture combining audio, visual, and gesture features for enhanced lie detection, advancing beyond previous micro-expression based models.

Findings

01

CNN model achieved 95.4% accuracy

02

Multimodal features improved detection reliability

03

Further research needed for dataset quality and generalization

Abstract

Inaccuracies in polygraph tests often lead to wrongful convictions, false information, and bias, all of which have significant consequences for both legal and political systems. Recently, analyzing facial micro-expressions has emerged as a method for detecting deception; however, current models have not reached high accuracy and generalizability. The purpose of this study is to aid in remedying these problems. The unique multimodal transformer architecture used in this study improves upon previous approaches by using auditory inputs, visual facial micro-expressions, and manually transcribed gesture annotations, moving closer to a reliable non-invasive lie detection model. Visual and auditory features were extracted using the Vision Transformer and OpenSmile models respectively, which were then concatenated with the transcriptions of participants micro-expressions and gestures. Various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital and Cyber Forensics · Deception detection and forensic psychology

MethodsAttention Is All You Need · Absolute Position Encodings · Label Smoothing · Adam · Residual Connection · Softmax · Linear Layer · Dropout · Vision Transformer · Layer Normalization