FTT-NAS: Discovering Fault-Tolerant Convolutional Neural Architecture
Xuefei Ning, Guangjun Ge, Wenshuo Li, Zhenhua Zhu, Yin Zheng, Xiaoming, Chen, Zhen Gao, Yu Wang, Huazhong Yang

TL;DR
This paper introduces FTT-NAS, a method for automatically discovering convolutional neural network architectures that are robust to various faults in edge devices, enhancing reliability in fault-prone environments.
Contribution
It proposes a fault-tolerant neural architecture search (FTT-NAS) framework that incorporates fault models and fault-tolerant training to improve neural network robustness.
Findings
Discovered architectures outperform baseline models significantly.
F-FTT-Net achieves 86.2% accuracy under fault conditions, surpassing MobileNet-V2.
W-FTT-Net achieves 69.6% accuracy, outperforming ResNet-20.
Abstract
With the fast evolvement of embedded deep-learning computing systems, applications powered by deep learning are moving from the cloud to the edge. When deploying neural networks (NNs) onto the devices under complex environments, there are various types of possible faults: soft errors caused by cosmic radiation and radioactive impurities, voltage instability, aging, temperature variations, and malicious attackers. Thus the safety risk of deploying NNs is now drawing much attention. In this paper, after the analysis of the possible faults in various types of NN accelerators, we formalize and implement various fault models from the algorithmic perspective. We propose Fault-Tolerant Neural Architecture Search (FT-NAS) to automatically discover convolutional neural network (CNN) architectures that are reliable to various faults in nowadays devices. Then we incorporate fault-tolerant training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Software System Performance and Reliability
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
