Take Package as Language: Anomaly Detection Using Transformer

Jie Huang

arXiv:2412.04473·cs.CR·December 9, 2024

Take Package as Language: Anomaly Detection Using Transformer

Jie Huang

PDF

Open Access

TL;DR

This paper introduces NIDS-GPT, a GPT-based model that treats network packet data as sequences of independent tokens, significantly improving anomaly detection accuracy and interpretability in imbalanced and resource-limited scenarios.

Contribution

It proposes a novel approach of representing network data as token sequences for GPT models, enhancing detection performance and interpretability over traditional methods.

Findings

01

Achieves 100% accuracy on CICIDS2017 dataset under extreme imbalance

02

Over 90% accuracy in one-shot learning scenarios

03

Demonstrates scalability and interpretability of the model

Abstract

Network data packet anomaly detection faces numerous challenges, including exploring new anomaly supervision signals, researching weakly supervised anomaly detection, and improving model interpretability. This paper proposes NIDS-GPT, a GPT-based causal language model for network intrusion detection. Unlike previous work, NIDS-GPT innovatively treats each number in the packet as an independent "word" rather than packet fields, enabling a more fine-grained data representation. We adopt an improved GPT-2 model and design special tokenizers and embedding layers to better capture the structure and semantics of network data. NIDS-GPT has good scalability, supports unsupervised pre-training, and enhances model interpretability through attention weight visualization. Experiments on the CICIDS2017 and car-hacking datasets show that NIDS-GPT achieves 100\% accuracy under extreme imbalance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection

MethodsAttention Is All You Need · Layer Normalization · Residual Connection · Adam · Dense Connections · Cosine Annealing · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Discriminative Fine-Tuning