# Accurate chromatin marks peak calling with Omnipeak

**Authors:** Oleg Shpynov, Maxim N Artyomov

PMC · DOI: 10.1093/nar/gkaf1454 · Nucleic Acids Research · 2026-01-09

## TL;DR

Omnipeak is a new tool for accurately identifying chromatin marks in sequencing data, handling various peak lengths and data qualities better than existing methods.

## Contribution

Omnipeak introduces a universal peak-calling algorithm using a hidden Markov model to handle diverse ChIP-seq and ATAC-seq data.

## Key findings

- Omnipeak consistently identifies narrow and broad peaks across different data types and qualities.
- It outperforms existing tools in replicate agreement and robustness to noise and missing control tracks.
- The tool supports multiple input formats and integrates with genome browsers for practical use.

## Abstract

Chromatin immunoprecipitation sequencing (ChIP-seq) is a widely used technique for identifying transcription-factor binding and histone modifications across the genome. Peaks in ChIP-seq data vary in length depending on the biological context, from narrow transcription factor binding sites to broad histone modification domains. However, commonly used peak-calling tools are tailored to the specific types of data and struggle to consistently handle various peak lengths, datasets of varying quality, and missing control tracks—common issues in comparative or meta-analyses. These limitations are addressed with Omnipeak, a universal unsupervised peak-calling algorithm based on a constrained three-state hidden Markov model. Omnipeak accurately models global genomic read coverage, capturing structure patterns in the data of all length scales and variable quality. We benchmarked Omnipeak versus eight different peak calling methods using over 550 public and 300 synthetic datasets, including conventional, ultra-low-input ChIP-seq, and ATAC-seq. Omnipeak produced consistent peaks across narrow, broad, and variable mark lengths, with the best agreement between replicates and robustness against noise and lack of control tracks. Together with a variety of supported input formats and peak calling capabilities within the genome browser, Omnipeak is well-positioned for processing various ChIP-seq and ATAC-seq datasets.

Graphical Abstract

## Full-text entities

- **Genes:** LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, CTCF (CCCTC-binding factor) [NCBI Gene 10664] {aka CFAP108, FAP108, MRD21}, XCL1 (X-C motif chemokine ligand 1) [NCBI Gene 6375] {aka ATAC, LPTN, LTN, SCM-1, SCM-1a, SCM1}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, Dnase1 (deoxyribonuclease I) [NCBI Gene 13419] {aka DNaseI, Dnl1}, CD14 (CD14 molecule) [NCBI Gene 929]
- **Diseases:** ENCODE (MESH:C564021)
- **Chemicals:** DHS (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** GM12878 — Homo sapiens (Human), Transformed cell line (CVCL_7526), K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004), H1 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_HA53)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12784980/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12784980/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12784980/full.md

---
Source: https://tomesphere.com/paper/PMC12784980