Traffic Refinery: Cost-Aware Data Representation for Machine Learning on Network Traffic
Francesco Bronzino, Paul Schmitt, Sara Ayoubi, Hyojoon Kim, Renata, Teixeira, Nick Feamster

TL;DR
This paper introduces Traffic Refinery, a framework for evaluating network traffic representations that balances model accuracy with operational system costs, aiding practical deployment of network management ML models.
Contribution
It presents a new system and framework for joint evaluation of ML performance and system costs for network traffic representations, with a real-time traffic transformation tool.
Findings
Exploring different traffic representations improves model deployment feasibility.
Traffic Refinery enables real-time traffic monitoring and feature transformation at 10 Gbps.
Balancing system costs and accuracy enhances practical network management solutions.
Abstract
Network management often relies on machine learning to make predictions about performance and security from network traffic. Often, the representation of the traffic is as important as the choice of the model. The features that the model relies on, and the representation of those features, ultimately determine model accuracy, as well as where and whether the model can be deployed in practice. Thus, the design and evaluation of these models ultimately requires understanding not only model accuracy but also the systems costs associated with deploying the model in an operational network. Towards this goal, this paper develops a new framework and system that enables a joint evaluation of both the conventional notions of machine learning performance (e.g., model accuracy) and the systems-level costs of different representations of network traffic. We highlight these two dimensions for two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Anomaly Detection Techniques and Applications
