Cumulative Meta-Learning from Active Learning Queries for Robustness to Spurious Correlations
Kin Whye Chew, Jingxian Wang

TL;DR
This paper introduces CAML, a meta-learning framework for active learning that cumulatively refines the model's inductive bias to improve robustness against spurious correlations in datasets.
Contribution
CAML uniquely leverages sequential dependence in active learning rounds to meta-learn a cumulative inductive bias, enhancing model robustness and minority-group accuracy.
Findings
CAML improves minority-group accuracy by up to 29.9% on Waterbirds.
CAML achieves up to 27.8% gains on Dominoes.
CAML enhances robustness across various benchmarks and acquisition strategies.
Abstract
Spurious correlations in real-world datasets cause machine learning models to rely on irrelevant patterns, undermining reliability, generalization, and fairness. Active learning offers a promising way to address this failure mode by querying informative samples that distinguish core features from spurious ones. However, standard active-learning methods simply append queried examples to the labeled set, effectively updating only the likelihood term. In deep learning regimes, the influence of these informative samples can be diluted by the larger labeled set and memorized by overparameterized models. We propose Cumulative Active Meta-Learning (CAML), an active-learning framework that uses queried examples to meta-learn the prior, or inductive bias, governing how the model adapts. CAML casts each active-learning round as a meta-learning task: the current labeled set serves as meta-train…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
