TraGe: A Generic Packet Representation for Traffic Classification Based on Header-Payload Differences
Chungang Lin, Yilong Jiang, Weiyao Zhang, Xuying Meng, Tianyu Zuo, Yujun Zhang

TL;DR
TraGe introduces a novel packet representation model that leverages header-payload differences and dynamic masking to improve traffic classification accuracy and robustness with limited labeled data.
Contribution
It presents a new pre-training approach tailored for network traffic data, addressing limitations of image/text-based models and enhancing classification performance.
Findings
Outperforms state-of-the-art methods by up to 6.97%
Demonstrates robustness under parameter fluctuations
Effective with limited labeled data
Abstract
Traffic classification has a significant impact on maintaining the Quality of Service (QoS) of the network. Since traditional methods heavily rely on feature extraction and large scale labeled data, some recent pre-trained models manage to reduce the dependency by utilizing different pre-training tasks to train generic representations for network packets. However, existing pre-trained models typically adopt pre-training tasks developed for image or text data, which are not tailored to traffic data. As a result, the obtained traffic representations fail to fully reflect the information contained in the traffic, and may even disrupt the protocol information. To address this, we propose TraGe, a novel generic packet representation model for traffic classification. Based on the differences between the header and payload-the two fundamental components of a network packet-we perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Network Packet Processing and Optimization
