Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages
Maria Glenski, Ellyn Ayton, Robin Cosbey, Dustin Arendt, and Svitlana, Volkova

TL;DR
This paper introduces a comprehensive framework for evaluating the robustness of deception detection models across different domains, modalities, and languages, highlighting their vulnerabilities and potential improvements.
Contribution
It provides a novel framework for assessing model robustness in deceptive news detection across multiple challenging conditions and offers insights into how diverse training data can improve performance.
Findings
Models perform significantly worse on out-of-domain and non-English data.
Adding image content reduces errors in multimodal models.
ELMo embeddings outperform BERT and GloVe with image inputs.
Abstract
Evaluating model robustness is critical when developing trustworthy models not only to gain deeper understanding of model behavior, strengths, and weaknesses, but also to develop future models that are generalizable and robust across expected environments a model may encounter in deployment. In this paper we present a framework for measuring model robustness for an important but difficult text classification task - deceptive news detection. We evaluate model robustness to out-of-domain data, modality-specific features, and languages other than English. Our investigation focuses on three type of models: LSTM models trained on multiple datasets(Cross-Domain), several fusion LSTM models trained with images and text and evaluated with three state-of-the-art embeddings, BERT ELMo, and GloVe (Cross-Modality), and character-level CNN models trained on multiple languages (Cross-Language). Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Deception detection and forensic psychology · Misinformation and Its Impacts
MethodsAttention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Dropout · Dense Connections · Tanh Activation · Linear Warmup With Linear Decay · WordPiece · Sigmoid Activation · Attention Dropout
