A Systematic Literature Review on Hardware Reliability Assessment Methods for Deep Neural Networks
Mohammad Hasan Ahmadilivani, Mahdi Taheri, Jaan Raik, Masoud, Daneshtalab, Maksim Jenihhin

TL;DR
This paper systematically reviews various methods for assessing the reliability of deep neural networks, categorizing approaches like fault injection, analytical, and hybrid methods, and discusses their advantages, disadvantages, and open challenges.
Contribution
It provides a comprehensive categorization and analysis of existing DNN reliability assessment methods, highlighting research gaps and future challenges.
Findings
Fault Injection is the most common assessment method.
Different platforms and metrics are used for reliability evaluation.
Open challenges include standardization and scalability of methods.
Abstract
Artificial Intelligence (AI) and, in particular, Machine Learning (ML) have emerged to be utilized in various applications due to their capability to learn how to solve complex problems. Over the last decade, rapid advances in ML have presented Deep Neural Networks (DNNs) consisting of a large number of neurons and layers. DNN Hardware Accelerators (DHAs) are leveraged to deploy DNNs in the target applications. Safety-critical applications, where hardware faults/errors would result in catastrophic consequences, also benefit from DHAs. Therefore, the reliability of DNNs is an essential subject of research. In recent years, several studies have been published accordingly to assess the reliability of DNNs. In this regard, various reliability assessment methods have been proposed on a variety of platforms and applications. Hence, there is a need to summarize the state of the art to identify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Reliability and Maintenance Optimization · Radiation Effects in Electronics
MethodsSurrogate Lagrangian Relaxation
