Double Backdoored: Converting Code Large Language Model Backdoors to Traditional Malware via Adversarial Instruction Tuning Attacks
Md Imran Hossen, Sai Venkatesh Chilukoti, Liqun Shan, Sheng Chen,, Yinzhi Cao, Xiali Hei

TL;DR
This paper demonstrates that instruction-tuned large language models for coding are highly vulnerable to backdoor and poisoning attacks, which can lead to malicious code generation, highlighting urgent cybersecurity risks.
Contribution
The authors introduce MalInstructCoder, a framework with novel data poisoning and adversarial tuning techniques to assess and exploit vulnerabilities in instruction-tuned Code LLMs.
Findings
High attack success rates (75-86%) with minimal poisoning (0.5-1%)
Vulnerability of top Code LLMs to poisoning and backdoor attacks
Need for robust defenses against instruction-tuning vulnerabilities
Abstract
Instruction-tuned Large Language Models designed for coding tasks are increasingly employed as AI coding assistants. However, the cybersecurity vulnerabilities and implications arising from the widespread integration of these models are not yet fully understood due to limited research in this domain. This work investigates novel techniques for transitioning backdoors from the AI/ML domain to traditional computer malware, shedding light on the critical intersection of AI and cyber/software security. To explore this intersection, we present MalInstructCoder, a framework designed to comprehensively assess the cybersecurity vulnerabilities of instruction-tuned Code LLMs. MalInstructCoder introduces an automated data poisoning pipeline to inject malicious code snippets into benign code, poisoning instruction fine-tuning data while maintaining functional validity. It presents two practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Information and Cyber Security · Web Application Security Vulnerabilities
