Language-Inspired Relation Transfer for Few-shot Class-Incremental Learning
Yifan Zhao, Jia Li, Zeyin Song, Yonghong Tian

TL;DR
This paper introduces a novel language-inspired relation transfer method for few-shot class-incremental learning, leveraging joint visual and textual information to improve model performance in open-world scenarios.
Contribution
It proposes a new LRT paradigm that transfers text knowledge to visual domains and uses prompt learning and contrastive learning for domain alignment, outperforming existing methods.
Findings
LRT outperforms state-of-the-art models by over 13% on mini-ImageNet.
LRT achieves over 7% improvement on CIFAR-100.
The method effectively mitigates domain gaps and enhances incremental learning performance.
Abstract
Depicting novel classes with language descriptions by observing few-shot samples is inherent in human-learning systems. This lifelong learning capability helps to distinguish new knowledge from old ones through the increase of open-world learning, namely Few-Shot Class-Incremental Learning (FSCIL). Existing works to solve this problem mainly rely on the careful tuning of visual encoders, which shows an evident trade-off between the base knowledge and incremental ones. Motivated by human learning systems, we propose a new Language-inspired Relation Transfer (LRT) paradigm to understand objects by joint visual clues and text depictions, composed of two major steps. We first transfer the pretrained text knowledge to the visual domains by proposing a graph relation transformation module and then fuse the visual and language embedding by a text-vision prototypical fusion module. Second, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning · Balanced Selection
