Octopus: A Heterogeneous In-network Computing Accelerator Enabling Deep Learning for network
Dong Wen, Tao Li, Chenglong Li, Pengye Xia, Hui Yang, Zhigang Sun

TL;DR
Octopus is a heterogeneous in-network computing accelerator designed to enable deep learning for network models, addressing challenges in computing power, task granularity, and model generality, with FPGA implementation demonstrating high performance.
Contribution
It introduces a novel heterogeneous accelerator architecture with a feature extractor, vector accelerator, systolic array, on-chip memory, and Risc-V core for in-network deep learning.
Findings
Achieves 31 million packets per second feature extraction
Provides 207 nanoseconds packet-based computing latency
Reaches 90,000 flow-based computing throughput
Abstract
Deep learning (DL) for network models have achieved excellent performance in the field and are becoming a promising component in future intelligent network system. Programmable in-network computing device has great potential to deploy DL for network models, however, existing device cannot afford to run a DL model. The main challenges of data-plane supporting DL-based network models lie in computing power, task granularity, model generality and feature extracting. To address above problems, we propose Octopus: a heterogeneous in-network computing accelerator enabling DL for network models. A feature extractor is designed for fast and efficient feature extracting. Vector accelerator and systolic array work in a heterogeneous collaborative way, offering low-latency-highthroughput general computing ability for packet-and-flow-based tasks. Octopus also contains on-chip memory fabric for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Software-Defined Networks and 5G · Advanced Computing and Algorithms
