EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation

Qi Zhou; Haipeng Chen; Yitao Zheng; Zhen Wang

arXiv:2012.04864·cs.LG·April 13, 2021

EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation

Qi Zhou, Haipeng Chen, Yitao Zheng, Zhen Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the vulnerability of Latent Dirichlet Allocation (LDA) models to adversarial evasion attacks, formalizes the attack as an NP-hard optimization problem, and proposes an efficient algorithm called EvaLDA, demonstrating its effectiveness through empirical evaluations.

Contribution

The paper introduces EvaLDA, a novel efficient algorithm for evasion attacks on LDA, and provides the first formalization and empirical analysis of LDA's security vulnerabilities.

Findings

01

EvaLDA can significantly increase the rank of a target topic with minimal word replacements.

02

Evasion attack on LDA is NP-hard, highlighting its computational complexity.

03

Empirical results show EvaLDA effectively manipulates LDA outputs on real datasets.

Abstract

As one of the most powerful topic models, Latent Dirichlet Allocation (LDA) has been used in a vast range of tasks, including document understanding, information retrieval and peer-reviewer assignment. Despite its tremendous popularity, the security of LDA has rarely been studied. This poses severe risks to security-critical tasks such as sentiment analysis and peer-reviewer assignment that are based on LDA. In this paper, we are interested in knowing whether LDA models are vulnerable to adversarial perturbations of benign document examples during inference time. We formalize the evasion attack to LDA models as an optimization problem and prove it to be NP-hard. We then propose a novel and efficient algorithm, EvaLDA to solve it. We show the effectiveness of EvaLDA via extensive empirical evaluations. For instance, in the NIPS dataset, EvaLDA can averagely promote the rank of a target…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tools-only/Evasion-Attack-against-LDA-Model
noneOfficial

Videos

EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation· underline

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Natural Language Processing Techniques

MethodsLinear Discriminant Analysis