Saliency Assisted Quantization for Neural Networks
Elmira Mousa Rezabeyk, Salar Beigzad, Yasin Hamzavi, Mohsen, Bagheritabar, Seyedeh Sogol Mirikhoozani

TL;DR
This paper introduces a saliency-assisted quantization approach for neural networks that enhances interpretability and efficiency, analyzing the trade-offs between model accuracy and transparency across different quantization levels on benchmark datasets.
Contribution
It proposes integrating real-time saliency explanations with quantization during training, exploring how different bit-widths affect interpretability and accuracy in neural networks.
Findings
Lower bit-width quantization reduces model accuracy and interpretability.
Quantization impacts saliency map clarity, requiring careful parameter tuning.
Trade-offs exist between resource efficiency and transparency in neural network deployment.
Abstract
Deep learning methods have established a significant place in image classification. While prior research has focused on enhancing final outcomes, the opaque nature of the decision-making process in these models remains a concern for experts. Additionally, the deployment of these methods can be problematic in resource-limited environments. This paper tackles the inherent black-box nature of these models by providing real-time explanations during the training phase, compelling the model to concentrate on the most distinctive and crucial aspects of the input. Furthermore, we employ established quantization techniques to address resource constraints. To assess the effectiveness of our approach, we explore how quantization influences the interpretability and accuracy of Convolutional Neural Networks through a comparative analysis of saliency maps from standard and quantized models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsFocus
