Extensible Machine Learning for Encrypted Network Traffic Application Labeling via Uncertainty Quantification
Steven Jorgensen, John Holodnak, Jensen Dempsey, Karla de Souza,, Ananditha Raghunath, Vernon Rivet, Noah DeMoes, Andr\'es Alejos, and Allan, Wollaber (MIT Lincoln Laboratory)

TL;DR
This paper introduces a machine learning framework with uncertainty quantification for classifying encrypted network traffic, including a new dataset, enabling reliable detection of known and novel traffic types with high accuracy.
Contribution
The work presents a new dataset of VPN-encrypted traffic and a rapid, calibrated ML framework that provides predictive probabilities and out-of-distribution scores for encrypted traffic classification.
Findings
Achieved an F1 score of 0.98 on the dataset.
Effectively flags uncertain and out-of-distribution traffic.
Successfully extends to enterprise network scenarios.
Abstract
With the increasing prevalence of encrypted network traffic, cyber security analysts have been turning to machine learning (ML) techniques to elucidate the traffic on their networks. However, ML models can become stale as new traffic emerges that is outside of the distribution of the training set. In order to reliably adapt in this dynamic environment, ML models must additionally provide contextualized uncertainty quantification to their predictions, which has received little attention in the cyber security domain. Uncertainty quantification is necessary both to signal when the model is uncertain about which class to choose in its label assignment and when the traffic is not likely to belong to any pre-trained classes. We present a new, public dataset of network traffic that includes labeled, Virtual Private Network (VPN)-encrypted network traffic generated by 10 applications and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Cryptography and Data Security · Network Security and Intrusion Detection
