One task to rule them all: A closer look at traffic classification generalizability
Elham Akbari, Zihao Zhou, Mohammad Ali Salahuddin, Noura Limam, Raouf Boutaba, Bertrand Mathieu, Stephanie Moteau, and Stephane Tuffin

TL;DR
This paper investigates the generalizability of traffic classification models across different datasets and real-world network scenarios, revealing significant performance drops under distribution shifts and highlighting the importance of evaluation context.
Contribution
It introduces a realistic evaluation framework for traffic classification under distribution shift, analyzing the robustness of existing models and emphasizing the need for generalizable solutions.
Findings
Performance drops to 30-40% under distribution shift
Simple 1-Nearest Neighbor performs comparably to complex models
Evaluation across models reveals limited generalizability in current solutions
Abstract
Existing website fingerprinting and traffic classification solutions do not work well when the evaluation context changes, as their performances often heavily rely on context-specific assumptions. To clarify this problem, we take three prior solutions presented for different but similar traffic classification and website fingerprinting tasks, and apply each solution's model to another solution's dataset. We pinpoint dataset-specific and model-specific properties that lead each of them to overperform in their specific evaluation context. As a realistic evaluation context that takes practical labeling constraints into account, we design an evaluation framework using two recent real-world TLS traffic datasets from large-scale networks. The framework simulates a futuristic scenario in which SNIs are hidden in some networks but not in others, and the classifier's goal is to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Spam and Phishing Detection · Network Security and Intrusion Detection
