Mutual-Information Based Few-Shot Classification
Malik Boudiaf, Ziko Imtiaz Masud, J\'er\^ome Rony, Jose Dolz, Ismail, Ben Ayed, Pablo Piantanida

TL;DR
This paper presents TIM, a transductive method for few-shot learning that maximizes mutual information between query features and labels, achieving significant improvements over state-of-the-art methods across various datasets.
Contribution
TIM introduces a mutual-information maximization approach with a fast alternating-direction solver for transductive few-shot classification, compatible with any feature extractor.
Findings
TIM outperforms state-of-the-art methods by 2-5% accuracy.
It is effective across multiple datasets and challenging scenarios.
The method is modular and can be applied on top of existing feature extractors.
Abstract
We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. We motivate our transductive loss by deriving a formal relation between the classification accuracy and mutual-information maximization. Furthermore, we propose a new alternating-direction solver, which substantially speeds up transductive inference over gradient-based optimization, while yielding competitive accuracy. We also provide a convergence analysis of our solver based on Zangwill's theory and bound-optimization arguments. TIM inference is modular: it can be used on top of any base-training feature extractor. Following standard transductive few-shot settings, our comprehensive experiments demonstrate that TIM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsTransductive Inference
