TL;DR
ET-BERT introduces a pre-trained transformer-based model for encrypted traffic classification, leveraging large-scale unlabeled data to achieve high accuracy and robustness, significantly outperforming existing methods.
Contribution
The paper proposes ET-BERT, a novel pre-trained transformer model for encrypted traffic representation that generalizes well with limited labeled data and achieves state-of-the-art results.
Findings
ET-BERT achieves up to 99.2% F1 on ISCX-Tor.
Pre-training improves classification accuracy across multiple datasets.
Analysis of cipher randomness provides insights into classification boundaries.
Abstract
Encrypted traffic classification requires discriminative and robust traffic representation captured from content-invisible and imbalanced traffic data for accurate classification, which is challenging but indispensable to achieve network security and network management. The major limitation of existing solutions is that they highly rely on the deep features, which are overly dependent on data size and hard to generalize on unseen data. How to leverage the open-domain unlabeled traffic data to learn representation with strong generalization ability remains a key challenge. In this paper,we propose a new traffic representation model called Encrypted Traffic Bidirectional Encoder Representations from Transformer (ET-BERT), which pre-trains deep contextualized datagram-level representation from large-scale unlabeled data. The pre-trained model can be fine-tuned on a small number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Softmax · Layer Normalization · Multi-Head Attention · Dense Connections · Byte Pair Encoding · Dropout · Label Smoothing · Position-Wise Feed-Forward Layer
