Towards Trustworthy Deception Detection: Benchmarking Model Robustness   across Domains, Modalities, and Languages

Maria Glenski; Ellyn Ayton; Robin Cosbey; Dustin Arendt; and Svitlana; Volkova

arXiv:2104.11761·cs.CL·April 27, 2021

Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages

Maria Glenski, Ellyn Ayton, Robin Cosbey, Dustin Arendt, and Svitlana, Volkova

PDF

Open Access

TL;DR

This paper introduces a comprehensive framework for evaluating the robustness of deception detection models across different domains, modalities, and languages, highlighting their vulnerabilities and potential improvements.

Contribution

It provides a novel framework for assessing model robustness in deceptive news detection across multiple challenging conditions and offers insights into how diverse training data can improve performance.

Findings

01

Models perform significantly worse on out-of-domain and non-English data.

02

Adding image content reduces errors in multimodal models.

03

ELMo embeddings outperform BERT and GloVe with image inputs.

Abstract

Evaluating model robustness is critical when developing trustworthy models not only to gain deeper understanding of model behavior, strengths, and weaknesses, but also to develop future models that are generalizable and robust across expected environments a model may encounter in deployment. In this paper we present a framework for measuring model robustness for an important but difficult text classification task - deceptive news detection. We evaluate model robustness to out-of-domain data, modality-specific features, and languages other than English. Our investigation focuses on three type of models: LSTM models trained on multiple datasets(Cross-Domain), several fusion LSTM models trained with images and text and evaluated with three state-of-the-art embeddings, BERT ELMo, and GloVe (Cross-Modality), and character-level CNN models trained on multiple languages (Cross-Language). Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Deception detection and forensic psychology · Misinformation and Its Impacts

MethodsAttention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Dropout · Dense Connections · Tanh Activation · Linear Warmup With Linear Decay · WordPiece · Sigmoid Activation · Attention Dropout