PANDA -- Patch And Distribution-Aware Augmentation for Long-Tailed Exemplar-Free Continual Learning
Siddeshwar Raghavan, Jiangpeng He, Fengqing Zhu

TL;DR
PANDA is a novel augmentation framework designed for exemplar-free continual learning that addresses data imbalance issues by enhancing low-frequency classes and balancing task distributions, thereby improving model accuracy and reducing forgetting.
Contribution
PANDA introduces a patch-based augmentation and adaptive balancing strategy that effectively mitigates intra-task and inter-task data imbalances in PTM-based EFCL.
Findings
PANDA improves accuracy across multiple EFCL benchmarks.
PANDA reduces catastrophic forgetting in continual learning.
PANDA seamlessly integrates with existing PTM-based methods.
Abstract
Exemplar-Free Continual Learning (EFCL) restricts the storage of previous task data and is highly susceptible to catastrophic forgetting. While pre-trained models (PTMs) are increasingly leveraged for EFCL, existing methods often overlook the inherent imbalance of real-world data distributions. We discovered that real-world data streams commonly exhibit dual-level imbalances, dataset-level distributions combined with extreme or reversed skews within individual tasks, creating both intra-task and inter-task disparities that hinder effective learning and generalization. To address these challenges, we propose PANDA, a Patch-and-Distribution-Aware Augmentation framework that integrates seamlessly with existing PTM-based EFCL methods. PANDA amplifies low-frequency classes by using a CLIP encoder to identify representative regions and transplanting those into frequent-class samples within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis
