Stable Learning via Sample Reweighting

Zheyan Shen; Peng Cui; Tong Zhang; Kun Kuang

arXiv:1911.12580·cs.LG·December 2, 2019·6 cites

Stable Learning via Sample Reweighting

Zheyan Shen, Peng Cui, Tong Zhang, Kun Kuang

PDF

Open Access

TL;DR

This paper introduces a sample reweighting technique to reduce collinearity in linear models, enhancing stability and robustness when training and test data distributions differ.

Contribution

The paper proposes a novel data pretreatment method that improves the condition of the design matrix, applicable with any standard learning algorithm.

Findings

01

Improved stability of predictions across different data distributions.

02

Effective reduction of collinearity among input variables.

03

Demonstrated success on both simulated and real datasets.

Abstract

We consider the problem of learning linear prediction models with model misspecification bias. In such case, the collinearity among input variables may inflate the error of parameter estimation, resulting in instability of prediction results when training and test distributions do not match. In this paper we theoretically analyze this fundamental problem and propose a sample reweighting method that reduces collinearity among input variables. Our method can be seen as a pretreatment of data to improve the condition of design matrix, and it can then be combined with any standard learning method for parameter estimation and variable selection. Empirical studies on both simulation and real datasets demonstrate the effectiveness of our method in terms of more stable performance across different distributed data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Machine Learning and ELM

MethodsTest