Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning

Zhi-Hong Qi; Da-Wei Zhou; Yiran Yao; Han-Jia Ye; De-Chuan Zhan

arXiv:2409.07446·cs.LG·September 12, 2024

Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning

Zhi-Hong Qi, Da-Wei Zhou, Yiran Yao, Han-Jia Ye, De-Chuan Zhan

PDF

Open Access 1 Repo

TL;DR

This paper introduces APART, a novel exemplar-free method leveraging adaptive adapter routing in pre-trained models to address long-tailed class-incremental learning, effectively mitigating forgetting and class imbalance without retraining classifiers.

Contribution

The paper proposes a new adapter-based framework with adaptive routing and adapter pools for LTCIL, avoiding retraining classifiers and improving handling of minority classes.

Findings

01

APART outperforms existing methods on benchmark datasets.

02

It effectively mitigates catastrophic forgetting.

03

The approach enhances minority class recognition.

Abstract

In our ever-evolving world, new data exhibits a long-tailed distribution, such as e-commerce platform reviews. This necessitates continuous model learning imbalanced data without forgetting, addressing the challenge of long-tailed class-incremental learning (LTCIL). Existing methods often rely on retraining linear classifiers with former data, which is impractical in real-world settings. In this paper, we harness the potent representation capabilities of pre-trained models and introduce AdaPtive Adapter RouTing (APART) as an exemplar-free solution for LTCIL. To counteract forgetting, we train inserted adapters with frozen pre-trained weights for deeper adaptation and maintain a pool of adapters for selection during sequential model updates. Additionally, we present an auxiliary adapter pool designed for effective generalization, especially on minority classes. Adaptive instance routing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vita-qzh/apart
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsAdapter