bioSBM: a random graph model to integrate epigenomic data in chromatin structure prediction
Alex Chen Yi Zhang, Angelo Rosa, Guido Sanguinetti

TL;DR
bioSBM is a graph-based model that captures chromatin interaction patterns from Hi-C data and links them to biochemical features, enabling accurate genome-wide predictions of chromatin contacts across different cell lines.
Contribution
The paper introduces bioSBM, a novel generative graph model that integrates epigenomic data with chromatin structure prediction, revealing biological communities and enabling cross-cell-line predictions.
Findings
Identified 7 chromatin communities aligning with biological annotations.
Mapped biochemical features to graph parameters for genome-wide predictions.
Successfully predicted chromatin contact maps in unseen cell lines.
Abstract
The spatial organization of chromatin within the nucleus plays a crucial role in gene expression and genome function. However, the quantitative relationship between this organization and nuclear biochemical processes remains under debate. In this study, we present a graph-based generative model, bioSBM, designed to capture long-range chromatin interaction patterns from Hi-C data and, importantly, simultaneously link these patterns to biochemical features. Applying bioSBM to Hi-C maps of the GM12878 lymphoblastoid cell line, we identified a latent structure of chromatin interactions, revealing 7 distinct communities that strongly align with known biological annotations. Additionally, we infer a linear transformation that maps biochemical observables, such as histone marks, to the parameters of the generative graph model, enabling accurate genome-wide predictions of chromatin contact maps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · Bioinformatics and Genomic Networks · Gene expression and cancer classification
MethodsALIGN
