TL;DR
This paper introduces a novel network model for detecting topologically associating domains (TADs) in Hi-C data, accounting for non-exchangeability of genomic positions and incorporating biological covariates, leading to improved TAD identification.
Contribution
The authors propose a new non-exchangeable network model for TAD detection that integrates cell-type specific CTCF binding sites and provides an efficient likelihood optimization method.
Findings
Model accurately detects TADs with high probability when properly initialized.
Method outperforms spectral clustering in simulated data.
Identified TADs show relevant epigenetic features across cell types.
Abstract
Chromosome conformation capture experiments such as Hi-C are used to map the three-dimensional spatial organization of genomes. One specific feature of the 3D organization is known as topologically associating domains (TADs), which are densely interacting, contiguous chromatin regions playing important roles in regulating gene expression. A few algorithms have been proposed to detect TADs. In particular, the structure of Hi-C data naturally inspires application of community detection methods. However, one of the drawbacks of community detection is that most methods take exchangeability of the nodes in the network for granted; whereas the nodes in this case, i.e. the positions on the chromosomes, are not exchangeable. We propose a network model for detecting TADs using Hi-C data that takes into account this non-exchangeability. In addition, our model explicitly makes use of cell-type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
