Multi-Platform Multivariate Regression with Group Sparsity for High-Dimensional Data Integration
Shanshan Qin, Guanlin Zhang, Xin Gao, Yuehua Wu

TL;DR
This paper introduces a new regression model that integrates data from multiple platforms to improve prediction accuracy and interpretability in high-dimensional settings.
Contribution
The novel MM-HLR model combines Lasso and group Lasso penalties to handle multivariate responses and cross-platform data integration.
Findings
The MM-HLR model achieves low bias and small variance in simulations.
The method is robust across various data dimensions and performs well on real financial data.
Theoretical guarantees include oracle bounds on prediction error and support recovery.
Abstract
High-dimensional regression with multivariate responses poses significant challenges when data are collected across multiple platforms, each with potentially correlated outcomes. In this paper, we introduce a multi-platform multivariate high-dimensional linear regression (MM-HLR) model for simultaneously modeling within-platform correlation and cross-platform information fusion. Our approach incorporates a mixture of Lasso and group Lasso penalties to promote both individual predictor sparsity and cross-platform group sparsity, thereby enhancing interpretability and estimation stability. We develop an efficient computational algorithm based on iteratively reweighted least squares and block coordinate descent to solve the resulting regularized optimization problem. We establish theoretical guarantees for our estimator, including oracle bounds on prediction error, estimation accuracy, and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Imbalanced Data Classification Techniques
