Sparse matrix linear models for structured high-throughput data

Jane W. Liang; Saunak Sen

arXiv:1712.05767·stat.CO·March 1, 2021

Sparse matrix linear models for structured high-throughput data

Jane W. Liang, Saunak Sen

PDF

Open Access 2 Repos

TL;DR

This paper introduces fast, scalable methods for fitting sparse matrix linear models to high-throughput biological data, enabling efficient analysis of large structured datasets with covariates on both rows and columns.

Contribution

It develops novel algorithms leveraging matrix properties for sparse estimation in large matrix linear models, addressing computational challenges in high-dimensional data.

Findings

01

Algorithms perform well on simulated data.

02

Effective analysis of biological datasets demonstrated.

03

Methods are implemented in Julia for accessibility.

Abstract

Recent technological advancements have led to the rapid generation of high-throughput biological data, which can be used to address novel scientific questions in broad areas of research. These data can be thought of as a large matrix with covariates annotating both rows and columns of this matrix. Matrix linear models provide a convenient way for modeling such data. In many situations, sparse estimation of these models is desired. We present fast, general methods for fitting sparse matrix linear models to structured high-throughput data. We induce model sparsity using an L $_{1}$ penalty and consider the case when the response matrix and the covariate matrices are large. Due to data size, standard methods for estimation of these penalized regression models fail if the problem is converted to the corresponding univariate regression scenario. By leveraging matrix properties in the structure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Face and Expression Recognition · Genetic Mapping and Diversity in Plants and Animals