Understanding new tasks through the lens of training data via   exponential tilting

Subha Maity; Mikhail Yurochkin; Moulinath Banerjee; Yuekai Sun

arXiv:2205.13577·cs.LG·February 22, 2023·1 cites

Understanding new tasks through the lens of training data via exponential tilting

Subha Maity, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method to reweight training data using exponential tilting to better understand and adapt to new target tasks, improving model deployment in different contexts.

Contribution

It proposes a novel exponential tilt-based reweighting approach to estimate target task distribution from training data, aiding in performance evaluation and model adaptation.

Findings

01

Effective reweighting on Waterbirds benchmark

02

Improved target performance estimation

03

Facilitates model fine-tuning and selection

Abstract

Deploying machine learning models to new tasks is a major challenge despite the large size of the modern training datasets. However, it is conceivable that the training data can be reweighted to be more representative of the new (target) task. We consider the problem of reweighing the training samples to gain insights into the distribution of the target task. Specifically, we formulate a distribution shift model based on the exponential tilt assumption and learn train data importance weights minimizing the KL divergence between labeled train and unlabeled target datasets. The learned train data weights can then be used for downstream tasks such as target performance evaluation, fine-tuning, and model selection. We demonstrate the efficacy of our method on Waterbirds and Breeds benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

smaityumich/exponential-tilting
pytorchOfficial

Videos

Understanding new tasks through the lens of training data via exponential tilting· slideslive

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Domain Adaptation and Few-Shot Learning