Drop-Connect as a Fault-Tolerance Approach for RRAM-based Deep Neural Network Accelerators
Mingyuan Xiang, Xuhan Xie, Pedro Savarese, Xin Yuan, Michael Maire and, Yanjing Li

TL;DR
This paper introduces a drop-connect training method that enables RRAM-based DNN accelerators to maintain high accuracy despite high defect rates, without hardware modifications or retraining.
Contribution
It proposes a defect-aware training approach that simulates RRAM faults during training, improving fault tolerance without hardware changes or additional circuitry.
Findings
Maintains less than 1% accuracy degradation at 30% defect rates.
Requires no hardware modifications or retraining.
Minimal impact on runtime and energy efficiency.
Abstract
Resistive random-access memory (RRAM) is widely recognized as a promising emerging hardware platform for deep neural networks (DNNs). Yet, due to manufacturing limitations, current RRAM devices are highly susceptible to hardware defects, which poses a significant challenge to their practical applicability. In this paper, we present a machine learning technique that enables the deployment of defect-prone RRAM accelerators for DNN applications, without necessitating modifying the hardware, retraining of the neural network, or implementing additional detection circuitry/logic. The key idea involves incorporating a drop-connect inspired approach during the training phase of a DNN, where random subsets of weights are selected to emulate fault effects (e.g., set to zero to mimic stuck-at-1 faults), thereby equipping the DNN with the ability to learn and adapt to RRAM defects with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Effects in Electronics · Advanced Memory and Neural Computing · Integrated Circuits and Semiconductor Failure Analysis
