On Detecting Data Pollution Attacks On Recommender Systems Using   Sequential GANs

Behzad Shahrasbi; Venugopal Mani; Apoorv Reddy Arrabothu; Deepthi; Sharma; Kannan Achan; Sushant Kumar

arXiv:2012.02509·cs.LG·December 7, 2020·1 cites

On Detecting Data Pollution Attacks On Recommender Systems Using Sequential GANs

Behzad Shahrasbi, Venugopal Mani, Apoorv Reddy Arrabothu, Deepthi, Sharma, Kannan Achan, Sushant Kumar

PDF

Open Access

TL;DR

This paper introduces a semi-supervised method using a modified sequential GAN architecture to detect malicious data injections in recommender systems by leveraging genuine data distribution and contextual user activity.

Contribution

It presents a novel semi-supervised attack detection approach that incorporates contextual information into GANs to identify data pollution in recommender systems.

Findings

01

Effective detection of malicious datapoints demonstrated

02

Utilizes less polluted data to learn genuine data distribution

03

Enhances robustness of recommender systems against attacks

Abstract

Recommender systems are an essential part of any e-commerce platform. Recommendations are typically generated by aggregating large amounts of user data. A malicious actor may be motivated to sway the output of such recommender systems by injecting malicious datapoints to leverage the system for financial gain. In this work, we propose a semi-supervised attack detection algorithm to identify the malicious datapoints. We do this by leveraging a portion of the dataset that has a lower chance of being polluted to learn the distribution of genuine datapoints. Our proposed approach modifies the Generative Adversarial Network architecture to take into account the contextual information from user activity. This allows the model to distinguish legitimate datapoints from the injected ones.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Spam and Phishing Detection · Adversarial Robustness in Machine Learning