Trust Oriented Explainable AI for Fake News Detection

Krzysztof Siwek; Daniel Stankowski; Maciej Stodolski

arXiv:2603.11778·cs.CL·March 13, 2026

Trust Oriented Explainable AI for Fake News Detection

Krzysztof Siwek, Daniel Stankowski, Maciej Stodolski

PDF

Open Access

TL;DR

This paper explores how explainable AI techniques like SHAP, LIME, and Integrated Gradients improve transparency and trust in NLP-based fake news detection models, balancing interpretability with accuracy.

Contribution

It compares multiple XAI methods in fake news detection, highlighting their strengths, limitations, and impact on model transparency and trustworthiness.

Findings

01

XAI enhances model interpretability without sacrificing accuracy

02

Different XAI methods offer unique explanatory insights

03

Computational cost and parameter sensitivity are notable limitations

Abstract

This article examines the application of Explainable Artificial Intelligence (XAI) in NLP based fake news detection and compares selected interpretability methods. The work outlines key aspects of disinformation, neural network architectures, and XAI techniques, with a focus on SHAP, LIME, and Integrated Gradients. In the experimental study, classification models were implemented and interpreted using these methods. The results show that XAI enhances model transparency and interpretability while maintaining high detection accuracy. Each method provides distinct explanatory value: SHAP offers detailed local attributions, LIME provides simple and intuitive explanations, and Integrated Gradients performs efficiently with convolutional models. The study also highlights limitations such as computational cost and sensitivity to parameterization. Overall, the findings demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Misinformation and Its Impacts · Adversarial Robustness in Machine Learning