Memory Efficient Meta-Learning with Large Images
John Bronskill, Daniela Massiceti, Massimiliano Patacchiola, Katja, Hofmann, Sebastian Nowozin, Richard E. Turner

TL;DR
This paper introduces LITE, a memory-efficient meta-training scheme that enables large-image meta-learning on a single GPU by approximating gradients with a subset of images, achieving state-of-the-art results.
Contribution
LITE allows memory-efficient meta-training on large images using gradient subset approximation, enabling high performance on real-world benchmarks with minimal hardware.
Findings
Achieves state-of-the-art accuracy on ORBIT benchmark.
Outperforms existing meta-learners on VTAB+MD benchmarks.
Enables training on large tasks with a single GPU.
Abstract
Meta learning approaches to few-shot classification are computationally efficient at test time, requiring just a few optimization steps or single forward pass to learn a new task, but they remain highly memory-intensive to train. This limitation arises because a task's entire support set, which can contain up to 1000 images, must be processed before an optimization step can be taken. Harnessing the performance gains offered by large images thus requires either parallelizing the meta-learner across multiple GPUs, which may not be available, or trade-offs between task and image size when memory constraints apply. We improve on both options by proposing LITE, a general and memory efficient episodic training scheme that enables meta-training on large tasks composed of large images on a single GPU. We achieve this by observing that the gradients for a task can be decomposed into a sum of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
