LaFA: Latent Feature Attacks on Non-negative Matrix Factorization

Minh Vu; Ben Nebgen; Erik Skau; Geigh Zollicoffer; Juan Castorena; Kim; Rasmussen; Boian Alexandrov; Manish Bhattarai

arXiv:2408.03909·cs.LG·August 8, 2024

LaFA: Latent Feature Attacks on Non-negative Matrix Factorization

Minh Vu, Ben Nebgen, Erik Skau, Geigh Zollicoffer, Juan Castorena, Kim, Rasmussen, Boian Alexandrov, Manish Bhattarai

PDF

Open Access

TL;DR

This paper introduces Latent Feature Attacks (LaFA), a novel method to manipulate the latent features of Non-negative Matrix Factorization (NMF), revealing vulnerabilities and demonstrating the effectiveness of gradient-based adversarial attacks on unsupervised learning.

Contribution

The paper proposes a new class of attacks called LaFA that target NMF's latent features using FE loss and introduces an implicit differentiation technique to scale attacks to larger datasets.

Findings

01

LaFA significantly alters NMF latent features with small perturbations.

02

FE loss-based attacks outperform naive methods in effectiveness.

03

NMF vulnerabilities are validated on synthetic and real-world data.

Abstract

As Machine Learning (ML) applications rapidly grow, concerns about adversarial attacks compromising their reliability have gained significant attention. One unsupervised ML method known for its resilience to such attacks is Non-negative Matrix Factorization (NMF), an algorithm that decomposes input data into lower-dimensional latent features. However, the introduction of powerful computational tools such as Pytorch enables the computation of gradients of the latent features with respect to the original data, raising concerns about NMF's reliability. Interestingly, naively deriving the adversarial loss for NMF as in the case of ML would result in the reconstruction loss, which can be shown theoretically to be an ineffective attacking objective. In this work, we introduce a novel class of attacks in NMF termed Latent Feature Attacks (LaFA), which aim to manipulate the latent features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection