Experiments with truth using Machine Learning: Spectral analysis and   explainable classification of synthetic, false, and genuine information

Vishnu S. Pendyala; Madhulika Dutta

arXiv:2407.05464·cs.AI·July 9, 2024

Experiments with truth using Machine Learning: Spectral analysis and explainable classification of synthetic, false, and genuine information

Vishnu S. Pendyala, Madhulika Dutta

PDF

Open Access

TL;DR

This paper investigates the spectral and explainability aspects of classifying synthetic, false, and genuine information, revealing the close intertwining of misinformation with genuine data and the limitations of current ML methods.

Contribution

It provides a comprehensive spectral and explainability analysis of misinformation detection, highlighting the challenges and limitations of existing machine learning approaches.

Findings

01

ML algorithms struggle to effectively distinguish misinformation from genuine information

02

Spectral analysis reveals close similarities between false and genuine data

03

Explainability methods show intertwined features in misinformation and genuine content

Abstract

Misinformation is still a major societal problem and the arrival of Large Language Models (LLMs) only added to it. This paper analyzes synthetic, false, and genuine information in the form of text from spectral analysis, visualization, and explainability perspectives to find the answer to why the problem is still unsolved despite multiple years of research and a plethora of solutions in the literature. Various embedding techniques on multiple datasets are used to represent information for the purpose. The diverse spectral and non-spectral methods used on these embeddings include t-distributed Stochastic Neighbor Embedding (t-SNE), Principal Component Analysis (PCA), and Variational Autoencoders (VAEs). Classification is done using multiple machine learning algorithms. Local Interpretable Model-Agnostic Explanations (LIME), SHapley Additive exPlanations (SHAP), and Integrated Gradients…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications