Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
Alon Albalak, Colin Raffel, William Yang Wang

TL;DR
This paper introduces scalable algorithms for few-shot learning with auxiliary data, leveraging explore-exploit strategies to improve generalization and outperform existing methods, including large language models like GPT-3.
Contribution
We develop two algorithms, EXP3-FLAD and UCB1-FLAD, that scale independently of auxiliary dataset count, enhancing few-shot learning performance.
Findings
Our methods outperform prior FLAD approaches by 4%.
Achieved superior results with 3 billion parameter models over GPT-3.
Algorithms scale efficiently to 100x more auxiliary datasets.
Abstract
Few-shot learning is valuable in many real-world applications, but learning a generalizable model without overfitting to the few labeled datapoints is challenging. In this work, we focus on Few-shot Learning with Auxiliary Data (FLAD), a training paradigm that assumes access to auxiliary data during few-shot learning in hopes of improving generalization. Previous works have proposed automated methods for mixing auxiliary and target data, but these methods typically scale linearly (or worse) with the number of auxiliary datasets, limiting their practicality. In this work we relate FLAD to the explore-exploit dilemma that is central to the multi-armed bandit setting and derive algorithms whose computational complexity is independent of the number of auxiliary datasets, allowing us to scale to 100x more auxiliary datasets than prior methods. We propose two algorithms -- EXP3-FLAD and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCOVID-19 diagnosis using AI · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Cosine Annealing · Adam · Weight Decay · Residual Connection
