Generalized Adversarial Code-Suggestions: Exploiting Contexts of   LLM-based Code-Completion

Karl Rubel; Maximilian Noppel; Christian Wressnegger

arXiv:2410.10526·cs.CR·October 15, 2024

Generalized Adversarial Code-Suggestions: Exploiting Contexts of LLM-based Code-Completion

Karl Rubel, Maximilian Noppel, Christian Wressnegger

PDF

Open Access

TL;DR

This paper introduces a generalized framework for adversarial attacks on LLM-based code assistants, highlighting their effectiveness and the limited defenses available, raising concerns about security in AI-assisted coding.

Contribution

It proposes a novel, flexible attack formulation over prompt triggers and embedding maps, extending prior work and evaluating defenses against these sophisticated attacks.

Findings

01

Directional-map attacks increase stealthiness

02

Most defenses offer limited protection

03

Attacks are highly effective across various scenarios

Abstract

While convenient, relying on LLM-powered code assistants in day-to-day work gives rise to severe attacks. For instance, the assistant might introduce subtle flaws and suggest vulnerable code to the user. These adversarial code-suggestions can be introduced via data poisoning and, thus, unknowingly by the model creators. In this paper, we provide a generalized formulation of such attacks, spawning and extending related work in this domain. This formulation is defined over two components: First, a trigger pattern occurring in the prompts of a specific user group, and, second, a learnable map in embedding space from the prompt to an adversarial bait. The latter gives rise to novel and more flexible targeted attack-strategies, allowing the adversary to choose the most suitable trigger pattern for a specific user-group arbitrarily, without restrictions on the pattern's tokens. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLaw, AI, and Intellectual Property · Artificial Intelligence in Law · Digital Rights Management and Security