Can lies be faked? Comparing low-stakes and high-stakes deception video datasets from a Machine Learning perspective
Mateus Karvat Camara, Adriana Postal, Tomas Henrique Maul, Gustavo, Paetzold

TL;DR
This study compares low-stakes and high-stakes deception video datasets to evaluate their differences and transferability for machine learning-based deception detection, revealing that low-stakes data can sometimes better classify high-stakes lies.
Contribution
It provides empirical evidence on the practical differences between low-stakes and high-stakes deception datasets for machine learning applications.
Findings
Low-stakes trained models better classify high-stakes deception.
Using low-stakes data for augmentation can decrease high-stakes classification accuracy.
The conceptual distinction between lie types may not fully translate to practical machine learning performance.
Abstract
Despite the great impact of lies in human societies and a meager 54% human accuracy for Deception Detection (DD), Machine Learning systems that perform automated DD are still not viable for proper application in real-life settings due to data scarcity. Few publicly available DD datasets exist and the creation of new datasets is hindered by the conceptual distinction between low-stakes and high-stakes lies. Theoretically, the two kinds of lies are so distinct that a dataset of one kind could not be used for applications for the other kind. Even though it is easier to acquire data on low-stakes deception since it can be simulated (faked) in controlled settings, these lies do not hold the same significance or depth as genuine high-stakes lies, which are much harder to obtain and hold the practical interest of automated DD systems. To investigate whether this distinction holds true from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques
