Pegasus: A Universal Framework for Scalable Deep Learning Inference on the Dataplane
Yinchao Zhang, Su Yao, Yong Feng, Kang Chen, Tong Li, Zhuotao Liu, Yi Zhao, Lexuan Zhang, Xiangyu Gao, Feng Xiong, Qi Li, Ke Xu

TL;DR
Pegasus introduces a novel framework translating deep learning operations into dataplane-friendly primitives, enabling scalable, accurate, and general DL inference directly on network hardware, significantly outperforming existing methods.
Contribution
It proposes a universal set of primitives and fusion techniques to enable scalable, accurate DL inference on the dataplane, addressing limitations of current match-action table abstractions.
Findings
Supports various DL models including MLP, RNN, CNN, AutoEncoder
Achieves up to 22.8% accuracy improvement over state-of-the-art
Enables model sizes up to 248x larger and input scales 212x larger
Abstract
The paradigm of Intelligent DataPlane (IDP) embeds deep learning (DL) models on the network dataplane to enable intelligent traffic analysis at line-speed. However, the current use of the match-action table (MAT) abstraction on the dataplane is misaligned with DL inference, leading to several key limitations, including accuracy degradation, limited scale, and lack of generality. This paper proposes Pegasus to address these limitations. Pegasus translates DL operations into three dataplane-oriented primitives to achieve generality: Partition, Map, and SumReduce. Specifically, Partition "divides" high-dimensional features into multiple low-dimensional vectors, making them more suitable for the dataplane; Map "conquers" computations on the low-dimensional vectors in parallel with the technique of fuzzy matching, while SumReduce "combines" the computation results. Additionally, Pegasus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Software-Defined Networks and 5G · Traffic Prediction and Management Techniques
MethodsPEGASUS
