Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks
Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho

TL;DR
This paper introduces methods to improve sequential training of early-exiting neural networks by balancing stability and plasticity, leading to better accuracy and efficiency.
Contribution
It proposes two novel approaches, Elastic Weight Consolidation and Learning without Forgetting, to mitigate interference in sequentially trained early-exit neural networks.
Findings
Enhanced accuracy of early exits over existing methods
Achieved significant speedups at low computational budgets
Consistent performance improvements across benchmarks
Abstract
Early-exiting neural networks enable adaptive inference by allowing inputs to exit at intermediate classifiers, reducing computation for easy samples while maintaining high accuracy. In practice, exits can be trained sequentially by incrementally adding them to a shared backbone; however, this sequential training can cause newly introduced exits to interfere with previously learned ones, degrading the performance of earlier classifiers. We address this problem by retaining the knowledge embedded in existing exits while allowing new ones to specialize. We propose two alternative approaches that operate at different levels of the model. The first constrains learning by protecting parameters that are important for previously trained exits, while the second preserves the output distributions of earlier exits as the network adapts. These alternatives directly reflect the stability-plasticity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
