Enhancing Disinformation Detection with Explainable AI and Named Entity   Replacement

Santiago Gonz\'alez-Silot; Andr\'es Montoro-Montarroso; Eugenio; Mart\'inez C\'amara; Juan G\'omez-Romero

arXiv:2502.04863·cs.CL·February 10, 2025

Enhancing Disinformation Detection with Explainable AI and Named Entity Replacement

Santiago Gonz\'alez-Silot, Andr\'es Montoro-Montarroso, Eugenio, Mart\'inez C\'amara, Juan G\'omez-Romero

PDF

Open Access

TL;DR

This paper improves disinformation detection by using explainable AI to identify and remove non-informative features and replace named entities, leading to better generalization and higher accuracy on external datasets.

Contribution

It introduces a novel preprocessing approach involving named entity replacement guided by explainability methods to enhance disinformation classification models.

Findings

01

Performance on external data increased by 65.78% after preprocessing.

02

Removing URLs and emoticons improves model focus on relevant features.

03

Named entity replacement reduces model bias and improves generalization.

Abstract

The automatic detection of disinformation presents a significant challenge in the field of natural language processing. This task addresses a multifaceted societal and communication issue, which needs approaches that extend beyond the identification of general linguistic patterns through data-driven algorithms. In this research work, we hypothesise that text classification methods are not able to capture the nuances of disinformation and they often ground their decision in superfluous features. Hence, we apply a post-hoc explainability method (SHAP, SHapley Additive exPlanations) to identify spurious elements with high impact on the classification models. Our findings show that non-informative elements (e.g., URLs and emoticons) should be removed and named entities (e.g., Rwanda) should be pseudo-anonymized before training to avoid models' bias and increase their generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Scientific Computing and Data Management