YOLoC: DeploY Large-Scale Neural Network by ROM-based Computing-in-Memory using ResiduaL Branch on a Chip
Yiming Chen, Guodong Yin, Zhanhong Tan, Mingyen Lee, Zekun Yang,, Yongpan Liu, Huazhong Yang, Kaisheng Ma, Xueqing Li

TL;DR
This paper introduces YOLoC, a novel ROM-based computing-in-memory framework with a residual branch technique, significantly improving energy efficiency for large-scale neural network deployment in object detection tasks.
Contribution
It proposes the first computing-in-ROM design with a residual branch method, enabling higher on-chip memory capacity and energy efficiency for neural networks.
Findings
Achieves 14.8x energy efficiency improvement for YOLO
Achieves 4.8x energy efficiency improvement for ResNet-18
Maintains less than 8% latency overhead and minimal accuracy loss
Abstract
Computing-in-memory (CiM) is a promising technique to achieve high energy efficiency in data-intensive matrix-vector multiplication (MVM) by relieving the memory bottleneck. Unfortunately, due to the limited SRAM capacity, existing SRAM-based CiM needs to reload the weights from DRAM in large-scale networks. This undesired fact weakens the energy efficiency significantly. This work, for the first time, proposes the concept, design, and optimization of computing-in-ROM to achieve much higher on-chip memory capacity, and thus less DRAM access and lower energy consumption. Furthermore, to support different computing scenarios with varying weights, a weight fine-tune technique, namely Residual Branch (ReBranch), is also proposed. ReBranch combines ROM-CiM and assisting SRAM-CiM to ahieve high versatility. YOLoC, a ReBranch-assisted ROM-CiM framework for object detection is presented and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
