Mitigating covariate shift in non-colocated data with learned parameter priors
Behraj Khan, Behroz Mirza, Nouman Durrani, Tahir Syed

TL;DR
This paper introduces FIcsR, a method to reduce covariate shift in distributed data by learning parameter priors, improving model accuracy across multiple datasets and cross-validation settings.
Contribution
The paper proposes FIcsR, a novel approach that minimizes f-divergence using learned priors to mitigate covariate shift in distributed data scenarios.
Findings
Improves accuracy by over 5% compared to batch state-of-the-art.
Reduces degradation under covariate shift in extensive experiments.
Effective across multiple datasets and cross-validation configurations.
Abstract
When training data are distributed across{ time or space,} covariate shift across fragments of training data biases cross-validation, compromising model selection and assessment. We present \textit{Fragmentation-Induced covariate-shift Remediation} (), which minimizes an -divergence between a fragment's covariate distribution and that of the standard cross-validation baseline. We s{how} an equivalence with popular importance-weighting methods. {The method}'s numerical solution poses a computational challenge owing to the overparametrized nature of a neural network, and we derive a Fisher Information approximation. When accumulated over fragments, this provides a global estimate of the amount of shift remediation thus far needed, and we incorporate that as a prior via the minimization objective. In the paper, we run extensive classification experiments on multiple data classes,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference
MethodsSparse Evolutionary Training
