Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks

Alaa Zniber; Ouassim Karrakchou; Mounir Ghogho

arXiv:2605.05358·cs.LG·May 8, 2026

Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks

Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho

PDF

TL;DR

This paper introduces methods to improve sequential training of early-exiting neural networks by balancing stability and plasticity, leading to better accuracy and efficiency.

Contribution

It proposes two novel approaches, Elastic Weight Consolidation and Learning without Forgetting, to mitigate interference in sequentially trained early-exit neural networks.

Findings

01

Enhanced accuracy of early exits over existing methods

02

Achieved significant speedups at low computational budgets

03

Consistent performance improvements across benchmarks

Abstract

Early-exiting neural networks enable adaptive inference by allowing inputs to exit at intermediate classifiers, reducing computation for easy samples while maintaining high accuracy. In practice, exits can be trained sequentially by incrementally adding them to a shared backbone; however, this sequential training can cause newly introduced exits to interfere with previously learned ones, degrading the performance of earlier classifiers. We address this problem by retaining the knowledge embedded in existing exits while allowing new ones to specialize. We propose two alternative approaches that operate at different levels of the model. The first constrains learning by protecting parameters that are important for previously trained exits, while the second preserves the output distributions of earlier exits as the network adapts. These alternatives directly reflect the stability-plasticity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.