Traffic-MoE: A Sparse Foundation Model for Network Traffic Analysis
Jiajun Zhou, Changhui Sun, Meng Shen, Shanqing Yu, Qi Xuan

TL;DR
Traffic-MoE introduces a sparse, expert-routing model for network traffic analysis that significantly improves detection accuracy, throughput, and efficiency, enabling real-time deployment in security environments.
Contribution
It presents a novel sparse foundation model with dynamic routing for efficient, high-performance network traffic analysis, addressing computational challenges of large pre-trained models.
Findings
Up to 12.38% better detection performance
91.62% increase in throughput
47.81% reduction in inference latency
Abstract
While pre-trained large models have achieved state-of-the-art performance in network traffic analysis, their prohibitive computational costs hinder deployment in real-time, throughput-sensitive network defense environments. This work bridges the gap between advanced representation learning and practical network protection by introducing Traffic-MoE, a sparse foundation model optimized for high-efficiency real-time inference. By dynamically routing traffic tokens to a small subset of specialized experts, Traffic-MoE effectively decouples model capacity from computational overhead. Extensive evaluations across three security-oriented tasks demonstrate that Traffic-MoE achieves up to a 12.38% improvement in detection performance compared to leading dense competitors. Crucially, it delivers a 91.62% increase in throughput, reduces inference latency by 47.81%, and cuts peak GPU memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Adversarial Robustness in Machine Learning
