TL;DR
This paper introduces Adaptive Aggregation Networks (AANets), a novel architecture for class-incremental learning that dynamically balances stability and plasticity by aggregating residual blocks, improving performance across multiple benchmarks.
Contribution
The paper proposes a new network architecture with adaptive residual block aggregation to better handle stability-plasticity trade-offs in CIL.
Findings
AANets improve performance on CIFAR-100, ImageNet-Subset, and ImageNet benchmarks.
Existing CIL methods can be enhanced by integrating AANets architecture.
Dynamic aggregation weights effectively balance stability and plasticity.
Abstract
Class-Incremental Learning (CIL) aims to learn a classification model with the number of classes increasing phase-by-phase. An inherent problem in CIL is the stability-plasticity dilemma between the learning of old and new classes, i.e., high-plasticity models easily forget old classes, but high-stability models are weak to learn new classes. We alleviate this issue by proposing a novel network architecture called Adaptive Aggregation Networks (AANets), in which we explicitly build two types of residual blocks at each residual level (taking ResNet as the baseline architecture): a stable block and a plastic block. We aggregate the output feature maps from these two blocks and then feed the results to the next-level blocks. We adapt the aggregation weights in order to balance these two types of blocks, i.e., to balance stability and plasticity, dynamically. We conduct extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods1x1 Convolution · Average Pooling · Batch Normalization · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · Bottleneck Residual Block · Residual Block · Global Average Pooling
