Adversarial Attacks and Detection on Reinforcement Learning-Based   Interactive Recommender Systems

Yuanjiang Cao; Xiaocong Chen; Lina Yao; Xianzhi Wang; Wei Emma; Zhang

arXiv:2006.07934·cs.LG·June 16, 2020

Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems

Yuanjiang Cao, Xiaocong Chen, Lina Yao, Xianzhi Wang, Wei Emma, Zhang

PDF

TL;DR

This paper introduces an attack-agnostic detection method for reinforcement learning-based interactive recommender systems, demonstrating its effectiveness against diverse adversarial attacks through extensive experiments.

Contribution

It proposes a deep learning-based classifier for early detection of adversarial attacks, capable of generalizing across multiple attack crafting methods in recommender systems.

Findings

01

Most adversarial attacks are highly effective.

02

Attack strength and frequency significantly influence attack success.

03

Strategically-timed attacks can match continuous attacks with fewer attempts.

Abstract

Adversarial attacks pose significant challenges for detecting adversarial attacks at an early stage. We propose attack-agnostic detection on reinforcement learning-based interactive recommendation systems. We first craft adversarial examples to show their diverse distributions and then augment recommendation systems by detecting potential attacks with a deep learning-based classifier based on the crafted data. Finally, we study the attack strength and frequency of adversarial examples and evaluate our model on standard datasets with multiple crafting methods. Our extensive experiments show that most adversarial attacks are effective, and both attack strength and attack frequency impact the attack performance. The strategically-timed attack achieves comparative attack performance with only 1/3 to 1/2 attack frequency. Besides, our black-box detector trained with one crafting method has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.