Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew

Farhin Farhad Riya; Shahinul Hoque; Jinyuan Stella Sun; Olivera Kotevska

arXiv:2511.13535·cs.CV·November 19, 2025

Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew

Farhin Farhad Riya, Shahinul Hoque, Jinyuan Stella Sun, Olivera Kotevska

PDF

Open Access

TL;DR

This paper uncovers a novel attack in federated learning where small color perturbations can mislead interpretability tools without affecting model accuracy, exposing a new security vulnerability.

Contribution

It introduces the Chromatic Perturbation Module, a systematic method for poisoning explanations in federated learning through subtle color changes.

Findings

01

Saliency maps can be significantly distorted without accuracy loss.

02

Standard defenses are ineffective against color-based explanation attacks.

03

The attack reduces explanation fidelity by up to 35%.

Abstract

As machine learning models are increasingly deployed in safety-critical domains, visual explanation techniques have become essential tools for supporting transparency. In this work, we reveal a new class of attacks that compromise model interpretability without affecting accuracy. Specifically, we show that small color perturbations applied by adversarial clients in a federated learning setting can shift a model's saliency maps away from semantically meaningful regions while keeping the prediction unchanged. The proposed saliency-aware attack framework, called Chromatic Perturbation Module, systematically crafts adversarial examples by altering the color contrast between foreground and background in a way that disrupts explanation fidelity. These perturbations accumulate across training rounds, poisoning the global model's internal feature attributions in a stealthy and persistent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI