Computational tools for the multiscale analysis of Hi-C data in bacterial chromosomes
Nelle Varoquaux, Virginia S. Lioy, Fr\'ed\'eric Boccard, Ivan, Junier

TL;DR
This paper introduces a multiscale analysis framework for bacterial Hi-C data, combining standard and novel tools, with accessible code and statistical assessment, demonstrated on Pseudomonas aeruginosa.
Contribution
It provides a comprehensive Python-based toolkit for multiscale bacterial chromosome analysis, including novel indices and statistical validation methods.
Findings
Identification of nested interaction domains in bacterial chromosomes
Development of a scale-independent locus involvement index
Accessible tools and statistical methods for Hi-C data analysis
Abstract
Just as in eukaryotes, high-throughput chromosome conformation capture (Hi-C) data have revealed nested organizations of bacterial chromosomes into overlapping interaction domains. In this chapter, we present a multiscale analysis framework aiming at capturing and quantifying these properties. These include both standard tools (e.g. contact laws) and novel ones such as an index that allows identifying loci involved in domain formation independently of the structuring scale at play. Our objective is two-fold. On the one hand, we aim at providing a full, understandable Python/Jupyter-based code which can be used by both computer scientists as well as biologists with no advanced computational background. On the other hand, we discuss statistical issues inherent to Hi-C data analysis, focusing more particularly on how to properly assess the statistical significance of results. As a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
