Modular Regression: Improving Linear Models by Incorporating Auxiliary Data
Ying Jin, Dominik Rothenh\"ausler

TL;DR
This paper introduces modular regression, a framework that enhances linear models by decomposing tasks and integrating auxiliary data, leading to improved estimation and prediction accuracy in various settings.
Contribution
The paper proposes a novel modular regression framework that leverages auxiliary information to improve linear model estimation, extending to high-dimensional and missing-data scenarios.
Findings
Improves estimation efficiency and prediction accuracy over standard linear regression and Lasso.
Effective in both low-dimensional and high-dimensional settings, including missing-data cases.
Demonstrated success on simulated and real datasets.
Abstract
This paper develops a new framework, called modular regression, to utilize auxiliary information -- such as variables other than the original features or additional data sets -- in the training process of linear models. At a high level, our method follows the routine: (i) decomposing the regression task into several sub-tasks, (ii) fitting the sub-task models, and (iii) using the sub-task models to provide an improved estimate for the original regression problem. This routine applies to widely-used low-dimensional (generalized) linear models and high-dimensional regularized linear regression. It also naturally extends to missing-data settings where only partial observations are available. By incorporating auxiliary information, our approach improves the estimation efficiency and prediction accuracy upon linear regression or the Lasso under a conditional independence assumption for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
