Finding Groups of Cross-Correlated Features in Bi-View Data
Miheer Dewaskar, John Palowitch, Mark He, Michael I. Love, Andrew B., Nobel

TL;DR
This paper introduces a new iterative-testing method called BSP for identifying stable groups of cross-correlated features across different data types, effectively revealing biologically meaningful subnetworks in complex datasets.
Contribution
The paper presents BSP, a novel iterative-testing approach that improves detection of stable bimodules of cross-correlated features, outperforming existing methods in accuracy and false discovery control.
Findings
BSP outperforms existing methods in recovering true bimodules.
BSP identified biologically meaningful SNP-gene subnetworks.
Application to GTEx data revealed significant genomic subnetworks.
Abstract
Datasets in which measurements of two (or more) types are obtained from a common set of samples arise in many scientific applications. A common problem in the exploratory analysis of such data is to identify groups of features of different data types that are strongly associated. A bimodule is a pair (A,B) of feature sets from two data types such that the aggregate cross-correlation between the features in A and those in B is large. A bimodule (A,B) is stable if A coincides with the set of features that have significant aggregate correlation with the features in B, and vice-versa. This paper proposes an iterative-testing based bimodule search procedure (BSP) to identify stable bimodules. Compared to existing methods for detecting cross-correlated features, BSP was the best at recovering true bimodules with sufficient signal, while limiting the false discoveries. In addition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genetic Mapping and Diversity in Plants and Animals · Genetic and phenotypic traits in livestock
