Double Backdoored: Converting Code Large Language Model Backdoors to   Traditional Malware via Adversarial Instruction Tuning Attacks

Md Imran Hossen; Sai Venkatesh Chilukoti; Liqun Shan; Sheng Chen,; Yinzhi Cao; Xiali Hei

arXiv:2404.18567·cs.CR·March 10, 2025·3 cites

Double Backdoored: Converting Code Large Language Model Backdoors to Traditional Malware via Adversarial Instruction Tuning Attacks

Md Imran Hossen, Sai Venkatesh Chilukoti, Liqun Shan, Sheng Chen,, Yinzhi Cao, Xiali Hei

PDF

Open Access

TL;DR

This paper demonstrates that instruction-tuned large language models for coding are highly vulnerable to backdoor and poisoning attacks, which can lead to malicious code generation, highlighting urgent cybersecurity risks.

Contribution

The authors introduce MalInstructCoder, a framework with novel data poisoning and adversarial tuning techniques to assess and exploit vulnerabilities in instruction-tuned Code LLMs.

Findings

01

High attack success rates (75-86%) with minimal poisoning (0.5-1%)

02

Vulnerability of top Code LLMs to poisoning and backdoor attacks

03

Need for robust defenses against instruction-tuning vulnerabilities

Abstract

Instruction-tuned Large Language Models designed for coding tasks are increasingly employed as AI coding assistants. However, the cybersecurity vulnerabilities and implications arising from the widespread integration of these models are not yet fully understood due to limited research in this domain. This work investigates novel techniques for transitioning backdoors from the AI/ML domain to traditional computer malware, shedding light on the critical intersection of AI and cyber/software security. To explore this intersection, we present MalInstructCoder, a framework designed to comprehensively assess the cybersecurity vulnerabilities of instruction-tuned Code LLMs. MalInstructCoder introduces an automated data poisoning pipeline to inject malicious code snippets into benign code, poisoning instruction fine-tuning data while maintaining functional validity. It presents two practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Information and Cyber Security · Web Application Security Vulnerabilities