Tutorial on Flow-Based Network Traffic Classification Using Machine Learning
Adrian Pekar, Richard Plny, Karel Hynek

TL;DR
This tutorial offers a comprehensive, practical guide for building machine learning-based network traffic classification systems that are effective even with encrypted traffic, emphasizing real-world applicability and reproducibility.
Contribution
It provides an end-to-end workflow, including dataset creation, feature engineering, model training, and deployment, with practical guidance and reproducible notebooks for real traffic data.
Findings
Effective classification under encryption demonstrated
Guidelines for feature engineering and model evaluation provided
Reproducible workflows with real traffic data included
Abstract
Modern networks carry increasingly diverse and encrypted traffic types that demand classification techniques beyond traditional port-based and payload-based methods. This tutorial provides a practical, end-to-end guide to building machine-learning-based network traffic flow classification systems. We cover the workflow from flow metering and dataset creation, through ground-truth labeling and feature engineering, to leakage-resistant experimental design, model training and evaluation, explainability, and deployment considerations. The tutorial focuses on supervised flow-based classification that remains effective under encryption and provides actionable guidance on algorithm selection, performance metrics, and realistic partitioning strategies, with emphasis on common real-world measurement artifacts and methodological pitfalls. A companion set of five Jupyter notebooks on GitHub…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Legal and Policy Issues
