Prompt-Unknown Promotion Attacks against LLM-based Sequential Recommender Systems
Yuchuan Zhao, Tong Chen, Junliang Yu, Zongwei Wang, Lizhen Cui, Hongzhi Yin

TL;DR
This paper introduces a novel black-box attack framework, PUDA, that effectively promotes target items in LLM-based sequential recommender systems without access to the victim model or prompt.
Contribution
It proposes a dual-poisoning attack method that infers prompts and models, demonstrating significant vulnerabilities in LLM-SRSs under realistic security assumptions.
Findings
PUDA outperforms existing methods in boosting target item exposure.
The attack is effective even when both prompts and models are unknown.
The study highlights critical security risks in current LLM-SRSs.
Abstract
Large language model-powered sequential recommender systems (LLM-SRSs) have recently demonstrated remarkable performance, enabling recommendations through prompt-driven inference over user interaction sequences. However, this paradigm also introduces new security vulnerabilities, particularly text-level manipulations, rendering them appealing targets for promotion attacks that purposely boost the ranking of specific target items. Although such security risks have been receiving increasing attention, existing studies typically rely on an unrealistic assumption of access to either the victim model or prompt to unveil attack mechanisms. In this work, we investigate the item promotion attack in LLM-SRSs under a more realistic setting where both the system prompt and victim model are unknown to the attacker, and propose a Prompt-Unknown Dual-poisoning Attack (PUDA) framework. To simulate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
