E2-AEN: End-to-End Incremental Learning with Adaptively Expandable Network
Guimei Cao, Zhanzhan Cheng, Yunlu Xu, Duo Li, Shiliang Pu, Yi Niu and, Fei Wu

TL;DR
E2-AEN is an end-to-end trainable adaptive network that dynamically generates lightweight structures for new tasks in incremental learning, effectively avoiding catastrophic forgetting and reducing computational costs.
Contribution
The paper introduces E2-AEN, a novel end-to-end framework with adaptive gating and regularization for incremental learning, eliminating the need for multi-stage structure search.
Findings
Achieves state-of-the-art results on CIFAR, VDD, COCO, VOC, and SSLAD benchmarks.
Effectively prevents catastrophic forgetting while maintaining high accuracy.
Reduces computational cost through adaptive pruning strategies.
Abstract
Expandable networks have demonstrated their advantages in dealing with catastrophic forgetting problem in incremental learning. Considering that different tasks may need different structures, recent methods design dynamic structures adapted to different tasks via sophisticated skills. Their routine is to search expandable structures first and then train on the new tasks, which, however, breaks tasks into multiple training stages, leading to suboptimal or overmuch computational cost. In this paper, we propose an end-to-end trainable adaptively expandable network named E2-AEN, which dynamically generates lightweight structures for new tasks without any accuracy drop in previous tasks. Specifically, the network contains a serial of powerful feature adapters for augmenting the previously learned representations to new tasks, and avoiding task interference. These adapters are controlled via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM
MethodsPruning
