A Comprehensive Overview of Backdoor Attacks in Large Language Models   within Communication Networks

Haomiao Yang; Kunlan Xiang; Mengyu Ge; Hongwei Li; Rongxing Lu and; Shui Yu

arXiv:2308.14367·cs.CR·September 7, 2023·6 cites

A Comprehensive Overview of Backdoor Attacks in Large Language Models within Communication Networks

Haomiao Yang, Kunlan Xiang, Mengyu Ge, Hongwei Li, Rongxing Lu and, Shui Yu

PDF

Open Access

TL;DR

This paper provides a systematic review and taxonomy of backdoor attacks in large language models used in communication networks, highlighting challenges and future research directions.

Contribution

It introduces a comprehensive taxonomy of backdoor attack types in LLMs within communication networks and analyzes benchmark datasets for these attacks.

Findings

01

Proposes four categories of backdoor attacks: input-triggered, prompt-triggered, instruction-triggered, demonstration-triggered.

02

Analyzes benchmark datasets for backdoor attack detection and mitigation.

03

Identifies open challenges and future research directions in securing LLMs in communication networks.

Abstract

The Large Language Models (LLMs) are poised to offer efficient and intelligent services for future mobile communication networks, owing to their exceptional capabilities in language comprehension and generation. However, the extremely high data and computational resource requirements for the performance of LLMs compel developers to resort to outsourcing training or utilizing third-party data and computing resources. These strategies may expose the model within the network to maliciously manipulated training data and processing, providing an opportunity for attackers to embed a hidden backdoor into the model, termed a backdoor attack. Backdoor attack in LLMs refers to embedding a hidden backdoor in LLMs that causes the model to perform normally on benign samples but exhibit degraded performance on poisoned ones. This issue is particularly concerning within communication networks where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInterpreting and Communication in Healthcare · Topic Modeling · Natural Language Processing Techniques