Environment Invariant Linear Least Squares
Jianqing Fan, Cong Fang, Yihong Gu, Tong Zhang

TL;DR
This paper introduces a novel environment invariant linear least squares (EILLS) method for multi-environment linear regression that leverages invariance across environments to accurately estimate true parameters and perform variable selection.
Contribution
It proposes the first statistically efficient invariance learning approach in the general linear model, applicable without structural assumptions, and provides non-asymptotic error bounds and variable selection consistency.
Findings
EILLS achieves near-minimal identification conditions.
The method provides non-asymptotic $\,\ell_2$ error bounds.
The $\,\ell_0$ penalized EILLS attains variable selection consistency.
Abstract
This paper considers a multi-environment linear regression model in which data from multiple experimental settings are collected. The joint distribution of the response variable and covariates may vary across different environments, yet the conditional expectations of given the unknown set of important variables are invariant. Such a statistical model is related to the problem of endogeneity, causal inference, and transfer learning. The motivation behind it is illustrated by how the goals of prediction and attribution are inherent in estimating the true parameter and the important variable set. We construct a novel environment invariant linear least squares (EILLS) objective function, a multi-environment version of linear least-squares regression that leverages the above conditional expectation invariance structure and heterogeneity among different environments to determine the true…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Spectroscopy and Chemometric Analyses · Machine Learning and ELM
MethodsLinear Regression
