Deep Learning and Traffic Classification: Lessons learned from a commercial-grade dataset with hundreds of encrypted and zero-day applications
Lixuan Yang, Alessandro Finamore, Feng Jun, Dario Rossi

TL;DR
This paper evaluates deep learning traffic classifiers in real-world settings with thousands of application labels and introduces a novel, lightweight method for zero-day application detection, highlighting DL's advantages for unknown traffic identification.
Contribution
It provides a comprehensive assessment of DL traffic classifiers in commercial environments and proposes a new technique for zero-day detection that outperforms existing methods.
Findings
DL models perform similarly to ML for known traffic classification
DL's non-linear feature extraction enhances zero-day detection
The proposed method is more accurate and lightweight than prior approaches
Abstract
The increasing success of Machine Learning (ML) and Deep Learning (DL) has recently re-sparked interest towards traffic classification. While classification of known traffic is a well investigated subject with supervised classification tools (such as ML and DL models) are known to provide satisfactory performance, detection of unknown (or zero-day) traffic is more challenging and typically handled by unsupervised techniques (such as clustering algorithms). In this paper, we share our experience on a commercial-grade DL traffic classification engine that is able to (i) identify known applications from encrypted traffic, as well as (ii) handle unknown zero-day applications. In particular, our contribution for (i) is to perform a thorough assessment of state of the art traffic classifiers in commercial-grade settings comprising few thousands of very fine grained application labels, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
