Active Learning Framework to Automate NetworkTraffic Classification
Jaroslav Pe\v{s}ek, Dominik Soukup, Tom\'a\v{s} \v{C}ejka

TL;DR
This paper introduces a novel Active Learning Framework (ALF) for network traffic classification that continuously updates datasets and models, addressing challenges like data scarcity and high traffic volumes in high-speed networks.
Contribution
The paper presents a new ALF that automates dataset and model evolution for network traffic analysis, supporting high-speed networks and research experimentation.
Findings
ALF can be deployed for 100 Gb/s network analysis.
Supports research on annotation and dataset optimization.
Addresses dataset aging and data scarcity challenges.
Abstract
Recent network traffic classification methods benefitfrom machine learning (ML) technology. However, there aremany challenges due to use of ML, such as: lack of high-qualityannotated datasets, data-drifts and other effects causing aging ofdatasets and ML models, high volumes of network traffic etc. Thispaper argues that it is necessary to augment traditional workflowsof ML training&deployment and adapt Active Learning concepton network traffic analysis. The paper presents a novel ActiveLearning Framework (ALF) to address this topic. ALF providesprepared software components that can be used to deploy an activelearning loop and maintain an ALF instance that continuouslyevolves a dataset and ML model automatically. The resultingsolution is deployable for IP flow-based analysis of high-speed(100 Gb/s) networks, and also supports research experiments ondifferent strategies and methods for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Anomaly Detection Techniques and Applications
