CTRL Your Shift: Clustered Transfer Residual Learning for Many Small Datasets
Gauri Jain, Dominik Rothenh\"ausler, Kirk Bansak, Elisabeth Paulson

TL;DR
This paper presents CTRL, a meta-learning approach designed to improve predictive accuracy and source-specific differentiation in machine learning tasks involving many small, heterogeneous datasets with distributional shifts.
Contribution
The paper introduces CTRL, a novel meta-learning method that effectively clusters and transfers residuals across sources, enabling better accuracy and heterogeneity preservation in small, diverse datasets.
Findings
CTRL outperforms state-of-the-art benchmarks on large-scale datasets.
High-quality clusters can be learned efficiently without repeated refitting.
CTRL maintains source-level heterogeneity while improving overall accuracy.
Abstract
Machine learning (ML) tasks often utilize large-scale data that is drawn from several distinct sources, such as different locations, treatment arms, or groups. In such settings, practitioners often desire predictions that not only exhibit good overall accuracy, but also remain reliable within each source and preserve the differences that matter across sources. For instance, several asylum and refugee resettlement programs now use ML-based employment predictions to guide where newly arriving families are placed within a host country, which requires generating informative and differentiated predictions for many and often small source locations. However, this task is made challenging by several common characteristics of the data in these settings: the presence of numerous distinct data sources, distributional shifts between them, and substantial variation in sample sizes across sources.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMigration, Health and Trauma · Domain Adaptation and Few-Shot Learning · Topic Modeling
