A Hierarchical Framework for State Space Matrix Inference and Clustering
Chandler Zuo, Kailei Chen, Kyle Hewitt, Emery Bresnick, Sunduz Keles

TL;DR
This paper introduces MBASIC, a hierarchical framework that simultaneously infers hidden activity states and clusters units based on their state profiles across multiple genomic datasets, improving analysis accuracy.
Contribution
MBASIC is a novel integrated framework combining state-space inference and clustering, adaptable to various data distributions and experimental heterogeneity.
Findings
MBASIC accurately recovers underlying states and clusters in simulations.
MBASIC outperforms two-step approaches in genome data analysis.
Application to ENCODE data demonstrates higher data fidelity.
Abstract
In recent years, a large number of genomic and epigenomic studies have been focusing on the integrative analysis of multiple experimental datasets measured over a large number of observational units. The objectives of such studies include not only inferring a hidden state of activity for each unit over individual experiments, but also detecting highly associated clusters of units based on their inferred states. In this paper, we develop the MBASIC (Matrix Based Analysis for State-space Inference and Clustering) framework. MBASIC consists of two parts: state-space mapping and state-space clustering. In state-space mapping, it maps observations onto a finite state-space, representing the activation states of units across conditions. In state-space clustering, MBASIC incorporates a finite mixture model to cluster the units based on their inferred state-space profiles across all conditions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
