Domino: A Tailored Network-on-Chip Architecture to Enable Highly Localized Inter- and Intra-Memory DNN Computing
Kaining Zhou, Yangshuo He, Rui Xiao, Kejie Huang

TL;DR
Domino is a novel Network-on-Chip architecture designed for highly localized DNN computing, significantly reducing data movement energy and improving power efficiency and throughput in CIM-based neural network accelerators.
Contribution
The paper introduces Domino, a flexible CIM processor architecture with tailored distributed instruction scheduling for inter-memory computing and enhanced mapping flexibility.
Findings
Achieves 1.15 to 9.49 times power efficiency improvements.
Improves throughput by 1.57 to 12.96 times.
Reduces data movement energy in DNN accelerators.
Abstract
The ever-increasing computation complexity of fast-growing Deep Neural Networks (DNNs) has requested new computing paradigms to overcome the memory wall in conventional Von Neumann computing architectures. The emerging Computing-In-Memory (CIM) architecture has been a promising candidate to accelerate neural network computing. However, the data movement between CIM arrays may still dominate the total power consumption in conventional designs. This paper proposes a flexible CIM processor architecture named Domino to enable stream computing and local data access to significantly reduce the data movement energy. Meanwhile, Domino employs tailored distributed instruction scheduling within Network-on-Chip (NoC) to implement inter-memory-computing and attain mapping flexibility. The evaluation with prevailing CNN models shows that Domino achieves 1.15-to-9.49 power efficiency over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Advanced Neural Network Applications
