Attention Consistency Regularization for Interpretable Early-Exit Neural Networks

Yanhua Zhao

arXiv:2601.08891·cs.LG·February 5, 2026

Attention Consistency Regularization for Interpretable Early-Exit Neural Networks

Yanhua Zhao

PDF

Open Access

TL;DR

This paper introduces Explanation-Guided Training (EGT), a method that enhances interpretability and consistency in early-exit neural networks by aligning attention maps, achieving high accuracy and faster inference while improving explainability.

Contribution

The paper proposes an attention consistency regularization framework for early-exit neural networks, improving interpretability and trustworthiness without sacrificing accuracy.

Findings

01

Achieves up to 98.97% accuracy with early exits

02

Provides 1.97x faster inference speed

03

Improves attention consistency by up to 18.5%

Abstract

Early-exit neural networks enable adaptive inference by allowing predictions at intermediate layers, reducing computational cost. However, early exits often lack interpretability and may focus on different features than deeper layers, limiting trust and explainability. This paper presents Explanation-Guided Training (EGT), a multi-objective framework that improves interpretability and consistency in early-exit networks through attention-based regularization. EGT introduces an attention consistency loss that aligns early-exit attention maps with the final exit. The framework jointly optimizes classification accuracy and attention consistency through a weighted combination of losses. Experiments on a real-world image classification dataset demonstrate that EGT achieves up to 98.97% overall accuracy (matching baseline performance) with a 1.97x inference speedup through early exits, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis