Attention Consistency Regularization for Interpretable Early-Exit Neural Networks
Yanhua Zhao

TL;DR
This paper introduces Explanation-Guided Training (EGT), a method that enhances interpretability and consistency in early-exit neural networks by aligning attention maps, achieving high accuracy and faster inference while improving explainability.
Contribution
The paper proposes an attention consistency regularization framework for early-exit neural networks, improving interpretability and trustworthiness without sacrificing accuracy.
Findings
Achieves up to 98.97% accuracy with early exits
Provides 1.97x faster inference speed
Improves attention consistency by up to 18.5%
Abstract
Early-exit neural networks enable adaptive inference by allowing predictions at intermediate layers, reducing computational cost. However, early exits often lack interpretability and may focus on different features than deeper layers, limiting trust and explainability. This paper presents Explanation-Guided Training (EGT), a multi-objective framework that improves interpretability and consistency in early-exit networks through attention-based regularization. EGT introduces an attention consistency loss that aligns early-exit attention maps with the final exit. The framework jointly optimizes classification accuracy and attention consistency through a weighted combination of losses. Experiments on a real-world image classification dataset demonstrate that EGT achieves up to 98.97% overall accuracy (matching baseline performance) with a 1.97x inference speedup through early exits, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
