Reconfigurable Stream Network Architecture
Chengyue Wang, Xiaofan Zhang, Jason Cong, James C. Hoe

TL;DR
This paper introduces the Reconfigurable Stream Network Architecture (RSN), an ISA abstraction for heterogeneous AI hardware that improves efficiency, reduces latency, and enhances throughput by modeling data paths as circuit-switched networks.
Contribution
The paper presents a novel ISA abstraction called RSN for DNNs, enabling efficient coordination of heterogeneous resources and phase transitions with explicit control over data movement.
Findings
Reduces latency by 6.1x on a heterogeneous platform
Improves throughput by 2.4x to 3.2x compared to state-of-the-art
Achieves 2.1x higher energy efficiency than A100 GPU
Abstract
As AI systems grow increasingly specialized and complex, managing hardware heterogeneity becomes a pressing challenge. How can we efficiently coordinate and synchronize heterogeneous hardware resources to achieve high utilization? How can we minimize the friction of transitioning between diverse computation phases, reducing costly stalls from initialization, pipeline setup, or drain? Our insight is that a network abstraction at the ISA level naturally unifies heterogeneous resource orchestration and phase transitions. This paper presents a Reconfigurable Stream Network Architecture (RSN), a novel ISA abstraction designed for the DNN domain. RSN models the datapath as a circuit-switched network with stateful functional units as nodes and data streaming on the edges. Programming a computation corresponds to triggering a path. Software is explicitly exposed to the compute and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPeer-to-Peer Network Technologies · Network Traffic and Congestion Control · Caching and Content Delivery
