jahmm: A tool for discretizing multiple ChIP seq profiles
Guillaume Filion, Pol Cusc\'o

TL;DR
jahmm is an R package that uses a Hidden Markov Model with mixture negative multinomial emissions to discretize multiple ChIP-seq profiles, enabling consistent comparison and merging of datasets from different experiments.
Contribution
It introduces a novel HMM-based method for discretizing multiple ChIP-seq profiles simultaneously, resolving conflicts among replicates and datasets.
Findings
Effective discretization across diverse features
Enables merging datasets from different sources
Includes a quality control step for discretization validation
Abstract
Chromatin immunoprecipitation and high throughput sequencing (ChIP-seq) is the de facto standard method to map chromatin features on genomes. The output of ChIP-seq is quantitative within a single genome-wide profile, but there is no natural way to compare experiments, which is why the data is often discretized as present/absent calls. Many tools perform this task efficiently, however they process a single input at a time, which produces discretization conflicts among replicates. Here we present the implementation of a Hidden Markov Model (HMM) using mixture negative multinomial emissions to discretize ChIP-seq profiles. The method gives meaningful discretization for a wide range of features and allows to merge datasets from different origins into a single discretized profile, which resolves discretization conflicts. A quality control step performed after the discretization accepts or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · Genetic and phenotypic traits in livestock · Genomics and Phylogenetic Studies
