Potent but Stealthy: Rethink Profile Pollution against Sequential Recommendation via Bi-level Constrained Reinforcement Paradigm

Jiajie Su; Zihan Nan; Yunshan Ma; Xiaobo Xia; Xiaohua Feng; Weiming Liu; Xiang Chen; Xiaolin Zheng; Chaochao Chen

arXiv:2511.09392·cs.LG·December 23, 2025

Potent but Stealthy: Rethink Profile Pollution against Sequential Recommendation via Bi-level Constrained Reinforcement Paradigm

Jiajie Su, Zihan Nan, Yunshan Ma, Xiaobo Xia, Xiaohua Feng, Weiming Liu, Xiang Chen, Xiaolin Zheng, Chaochao Chen

PDF

Open Access 1 Video

TL;DR

This paper introduces CREAT, a novel reinforcement learning-based attack method that subtly contaminates user interaction sequences to mislead sequential recommenders while maintaining stealthiness.

Contribution

The paper proposes a bi-level constrained reinforcement learning framework for profile pollution attacks, improving stealthiness and effectiveness over prior methods.

Findings

01

CREAT achieves high attack success rates.

02

CREAT maintains low detectability in perturbations.

03

Extensive experiments validate CREAT's effectiveness.

Abstract

Sequential Recommenders, which exploit dynamic user intents through interaction sequences, is vulnerable to adversarial attacks. While existing attacks primarily rely on data poisoning, they require large-scale user access or fake profiles thus lacking practicality. In this paper, we focus on the Profile Pollution Attack that subtly contaminates partial user interactions to induce targeted mispredictions. Previous PPA methods suffer from two limitations, i.e., i) over-reliance on sequence horizon impact restricts fine-grained perturbations on item transitions, and ii) holistic modifications cause detectable distribution shifts. To address these challenges, we propose a constrained reinforcement driven attack CREAT that synergizes a bi-level optimization framework with multi-reward reinforcement learning to balance adversarial efficacy and stealthiness. We first develop a Pattern…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Potent but Stealthy: Rethink Profile Pollution Against Sequential Recommendation via Bi-Level Constrained Reinforcement Paradigm· underline

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Advanced Graph Neural Networks