Mitigating covariate shift in non-colocated data with learned parameter   priors

Behraj Khan; Behroz Mirza; Nouman Durrani; Tahir Syed

arXiv:2411.06499·cs.LG·November 12, 2024

Mitigating covariate shift in non-colocated data with learned parameter priors

Behraj Khan, Behroz Mirza, Nouman Durrani, Tahir Syed

PDF

Open Access

TL;DR

This paper introduces FIcsR, a method to reduce covariate shift in distributed data by learning parameter priors, improving model accuracy across multiple datasets and cross-validation settings.

Contribution

The paper proposes FIcsR, a novel approach that minimizes f-divergence using learned priors to mitigate covariate shift in distributed data scenarios.

Findings

01

Improves accuracy by over 5% compared to batch state-of-the-art.

02

Reduces degradation under covariate shift in extensive experiments.

03

Effective across multiple datasets and cross-validation configurations.

Abstract

When training data are distributed across{ time or space,} covariate shift across fragments of training data biases cross-validation, compromising model selection and assessment. We present \textit{Fragmentation-Induced covariate-shift Remediation} ( $F I cs R$ ), which minimizes an $f$ -divergence between a fragment's covariate distribution and that of the standard cross-validation baseline. We s{how} an equivalence with popular importance-weighting methods. {The method}'s numerical solution poses a computational challenge owing to the overparametrized nature of a neural network, and we derive a Fisher Information approximation. When accumulated over fragments, this provides a global estimate of the amount of shift remediation thus far needed, and we incorporate that as a prior via the minimization objective. In the paper, we run extensive classification experiments on multiple data classes,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference

MethodsSparse Evolutionary Training