CATO: End-to-End Optimization of ML-Based Traffic Analysis Pipelines
Gerry Wan, Shinan Liu, Francesco Bronzino, Nick Feamster, Zakir, Durumeric

TL;DR
CATO is a framework that jointly optimizes ML-based network traffic analysis pipelines for both predictive accuracy and system efficiency, enabling deployment with significantly reduced latency and higher throughput.
Contribution
It introduces a multi-objective Bayesian optimization approach to automatically generate end-to-end optimized traffic analysis pipelines that balance accuracy and system costs.
Findings
Up to 3600x lower inference latency
3.7x higher zero-loss throughput
Better model performance compared to feature optimization techniques
Abstract
Machine learning has shown tremendous potential for improving the capabilities of network traffic analysis applications, often outperforming simpler rule-based heuristics. However, ML-based solutions remain difficult to deploy in practice. Many existing approaches only optimize the predictive performance of their models, overlooking the practical challenges of running them against network traffic in real time. This is especially problematic in the domain of traffic analysis, where the efficiency of the serving pipeline is a critical factor in determining the usability of a model. In this work, we introduce CATO, a framework that addresses this problem by jointly optimizing the predictive performance and the associated systems costs of the serving pipeline. CATO leverages recent advances in multi-objective Bayesian optimization to efficiently identify Pareto-optimal configurations, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Network Packet Processing and Optimization · Network Security and Intrusion Detection
