Hardware-Robust In-RRAM-Computing for Object Detection
Yu-Hsiang Chiang, Cheng En Ni, Yun Sung, Tuo-Hung Hou, Tian-Sheuan, Chang, and Shyh Jye Jou

TL;DR
This paper presents a hardware-robust in-memory computing macro for object detection that mitigates device variation effects through joint hardware-software optimization, enabling accurate detection despite hardware nonidealities.
Contribution
It introduces a novel joint hardware-software design approach for in-RRAM computing that enhances robustness for complex object detection tasks.
Findings
Achieved only 3.85% mAP drop under hardware nonidealities
Implemented ternary weight mapping and removed batch normalization for robustness
Successfully applied to complex object detection with minimal accuracy loss
Abstract
In-memory computing is becoming a popular architecture for deep-learning hardware accelerators recently due to its highly parallel computing, low power, and low area cost. However, in-RRAM computing (IRC) suffered from large device variation and numerous nonideal effects in hardware. Although previous approaches including these effects in model training successfully improved variation tolerance, they only considered part of the nonideal effects and relatively simple classification tasks. This paper proposes a joint hardware and software optimization strategy to design a hardware-robust IRC macro for object detection. We lower the cell current by using a low word-line voltage to enable a complete convolution calculation in one operation that minimizes the impact of nonlinear addition. We also implement ternary weight mapping and remove batch normalization for better tolerance against…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBatch Normalization · Convolution
