Understanding the Impact of On-chip Communication on DNN Accelerator Performance
Robert Guirado, Hyoukjun Kwon, Eduard Alarc\'on, Sergi Abadal and, Tushar Krishna

TL;DR
This paper analyzes how on-chip communication affects the performance of CNN accelerators, emphasizing the importance of dataflow and exploring the potential of wireless on-chip networks for future improvements.
Contribution
It provides a detailed analysis of on-chip communication flows in CNN accelerators and discusses the potential benefits of wireless on-chip networks.
Findings
On-chip communication is critical for CNN accelerator performance.
Current dataflows heavily depend on efficient on-chip data movement.
Wireless on-chip networks could offer significant future advantages.
Abstract
Deep Neural Networks have flourished at an unprecedented pace in recent years. They have achieved outstanding accuracy in fields such as computer vision, natural language processing, medicine or economics. Specifically, Convolutional Neural Networks (CNN) are particularly suited to object recognition or identification tasks. This, however, comes at a high computational cost, prompting the use of specialized GPU architectures or even ASICs to achieve high speeds and energy efficiency. ASIC accelerators streamline the execution of certain dataflows amenable to CNN computation that imply the constant movement of large amounts of data, thereby turning on-chip communication into a critical function within the accelerator. This paper studies the communication flows within CNN inference accelerators of edge devices, with the aim to justify current and future decisions in the design of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
