INSIGHT: A Survey of In-Network Systems for Intelligent, High-Efficiency AI and Topology Optimization

Aleksandr Algazinov; Joydeep Chandra; and Matt Laing

arXiv:2505.24269·cs.NI·August 19, 2025

INSIGHT: A Survey of In-Network Systems for Intelligent, High-Efficiency AI and Topology Optimization

Aleksandr Algazinov, Joydeep Chandra, and Matt Laing

PDF

Open Access

TL;DR

This survey explores how in-network computation can enhance AI workloads by leveraging programmable network devices, discussing architectures, methodologies, applications, and future research directions.

Contribution

It provides a comprehensive overview of in-network AI systems, analyzing architectures, methodologies, and applications, and proposing future research directions.

Findings

01

In-network computation reduces AI workload latency and improves throughput.

02

Frameworks like Planter and Quark facilitate development of in-network AI solutions.

03

In-network aggregation and federated learning enhance privacy and scalability.

Abstract

In-network computation represents a transformative approach to addressing the escalating demands of Artificial Intelligence (AI) workloads on network infrastructure. By leveraging the processing capabilities of network devices such as switches, routers, and Network Interface Cards (NICs), this paradigm enables AI computations to be performed directly within the network fabric, significantly reducing latency, enhancing throughput, and optimizing resource utilization. This paper provides a comprehensive analysis of optimizing in-network computation for AI, exploring the evolution of programmable network architectures, such as Software-Defined Networking (SDN) and Programmable Data Planes (PDPs), and their convergence with AI. It examines methodologies for mapping AI models onto resource-constrained network devices, addressing challenges like limited memory and computational capabilities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing