Uber: Utilizing Buffers to Simplify NoCs for Hundreds-Cores
Giorgos Passas

TL;DR
This paper introduces Uber, a buffer-based NoC design that simplifies routing for hundreds-core systems, achieving near-ideal latency with minimal queueing delays and comparable performance to more complex mesh architectures.
Contribution
Uber proposes a balanced NoC design using buffers to reduce complexity and improve latency, demonstrating that a 16-port mesh with extended concentration can approach ideal performance.
Findings
Utilization increases from 2% to at least 50% with Uber.
A 16-port mesh with buffers achieves near-ideal latency.
Uber's performance matches a 64-port optimized mesh, with a 12% improvement.
Abstract
Approaching ideal wire latency using a network-on-chip (NoC) is an important practical problem for many-core systems, particularly hundreds-cores. Although other researchers have focused on optimizing large meshes, bypassing or speculating router pipelines, or creating more intricate logarithmic topologies, this paper proposes a balanced combination that trades queue buffers for simplicity. Preliminary analysis of nine benchmarks from PARSEC and SPLASH using execution-driven simulation shows that utilization rises from 2% when connecting a single core per mesh port to at least 50%, as slack for delay in concentrator and router queues is around 6x higher compared to the ideal latency of just 20 cycles. That is, a 16-port mesh suffices because queueing is the uncommon case for system performance. In this way, the mesh hop count is bounded to three, as load becomes uniform via extended…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterconnection Networks and Systems · Parallel Computing and Optimization Techniques · Advancements in Battery Materials
