ET-BERT: A Contextualized Datagram Representation with Pre-training   Transformers for Encrypted Traffic Classification

Xinjie Lin; Gang Xiong; Gaopeng Gou; Zhen Li; Junzheng Shi; Jing Yu

arXiv:2202.06335·cs.CR·February 22, 2022

ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification

Xinjie Lin, Gang Xiong, Gaopeng Gou, Zhen Li, Junzheng Shi, Jing Yu

PDF

1 Repo

TL;DR

ET-BERT introduces a pre-trained transformer-based model for encrypted traffic classification, leveraging large-scale unlabeled data to achieve high accuracy and robustness, significantly outperforming existing methods.

Contribution

The paper proposes ET-BERT, a novel pre-trained transformer model for encrypted traffic representation that generalizes well with limited labeled data and achieves state-of-the-art results.

Findings

01

ET-BERT achieves up to 99.2% F1 on ISCX-Tor.

02

Pre-training improves classification accuracy across multiple datasets.

03

Analysis of cipher randomness provides insights into classification boundaries.

Abstract

Encrypted traffic classification requires discriminative and robust traffic representation captured from content-invisible and imbalanced traffic data for accurate classification, which is challenging but indispensable to achieve network security and network management. The major limitation of existing solutions is that they highly rely on the deep features, which are overly dependent on data size and hard to generalize on unseen data. How to leverage the open-domain unlabeled traffic data to learn representation with strong generalization ability remains a key challenge. In this paper,we propose a new traffic representation model called Encrypted Traffic Bidirectional Encoder Representations from Transformer (ET-BERT), which pre-trains deep contextualized datagram-level representation from large-scale unlabeled data. The pre-trained model can be fine-tuned on a small number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

linwhitehat/et-bert
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Softmax · Layer Normalization · Multi-Head Attention · Dense Connections · Byte Pair Encoding · Dropout · Label Smoothing · Position-Wise Feed-Forward Layer