Structured Input-Output Lasso, with Application to eQTL Mapping, and a Thresholding Algorithm for Fast Estimation
Seunghak Lee, Eric P. Xing

TL;DR
This paper introduces a structured input-output lasso model for high-dimensional multi-task regression, specifically applied to eQTL mapping, along with a fast hierarchical group thresholding algorithm for efficient estimation, improving the discovery of genetic influences on gene expression.
Contribution
The paper presents a novel structured input-output lasso model and an efficient hierarchical group thresholding algorithm tailored for high-dimensional eQTL mapping problems.
Findings
Model outperforms existing methods in recovering structured sparse relationships.
Algorithm significantly faster and more effective than other optimization techniques.
Demonstrated success on both simulated data and real yeast eQTL dataset.
Abstract
We consider the problem of learning a high-dimensional multi-task regression model, under sparsity constraints induced by presence of grouping structures on the input covariates and on the output predictors. This problem is primarily motivated by expression quantitative trait locus (eQTL) mapping, of which the goal is to discover genetic variations in the genome (inputs) that influence the expression levels of multiple co-expressed genes (outputs), either epistatically, or pleiotropically, or both. A structured input-output lasso (SIOL) model based on an intricate l1/l2-norm penalty over the regression coefficient matrix is employed to enable discovery of complex sparse input/output relationships; and a highly efficient new optimization algorithm called hierarchical group thresholding (HiGT) is developed to solve the resultant non-differentiable, non-separable, and ultra…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification · Fault Detection and Control Systems · Statistical Methods and Inference
