TL;DR
This paper introduces a modular, high-performance on-chip communication platform designed for heterogeneous many-core systems, enabling scalable, high-bandwidth, low-latency data transfer across thousands of cores.
Contribution
It presents a novel, topology-agnostic communication platform with customizable modules that support high bandwidth and concurrency for heterogeneous SoCs.
Findings
Supports 2.5 GHz, 1024-bit data width communication fabrics
Scales to 1024 cores with 32 TB/s bandwidth
Achieves 24 ns round-trip latency between cores
Abstract
On-chip communication infrastructure is a central component of modern systems-on-chip (SoCs), and it continues to gain importance as the number of cores, the heterogeneity of components, and the on-chip and off-chip bandwidth continue to grow. Decades of research on on-chip networks enabled cache-coherent shared-memory multiprocessors. However, communication fabrics that meet the needs of heterogeneous many-cores and accelerator-rich SoCs, which are not, or only partially, coherent, are a much less mature research area. In this work, we present a modular, topology-agnostic, high-performance on-chip communication platform. The platform includes components to build and link subnetworks with customizable bandwidth and concurrency properties and adheres to a state-of-the-art, industry-standard protocol. We discuss microarchitectural trade-offs and timing/area characteristics of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
