# StripePy: fast and robust characterization of architectural stripes

**Authors:** Andrea Raffo, Roberto Rossini, Jonas Paulsen

PMC · DOI: 10.1093/bioinformatics/btaf351 · 2025-06-13

## TL;DR

StripePy is a new tool for identifying architectural stripes in genomic data, offering faster and more reliable analysis than existing methods.

## Contribution

StripePy introduces a computational geometry-based method for stripe detection and includes a new simulated benchmark called StripeBench.

## Key findings

- StripePy outperforms existing tools in detecting architectural stripes in Hi-C and Micro-C data.
- StripePy includes a simulated benchmark called StripeBench for testing and validation.

## Abstract

Architectural stripes in Hi-C and related data are crucial for gene regulation, development, and DNA repair. Despite their importance, few tools exist for automatic stripe detection.

We introduce StripePy, which leverages computational geometry methods to identify and analyze architectural stripes in contact maps from Chromosome Conformation Capture experiments like Hi-C and Micro-C. StripePy outperforms existing tools, as shown through tests on various datasets and a newly developed simulated benchmark, StripeBench, providing a valuable resource for the community.

StripePy is released to the public as an open-source, MIT-licensed Python application. StripePy source code is hosted on GitHub at https://github.com/paulsengroup/StripePy and is archived on Zenodo. StripePy can be easily installed from source or PyPI using pip and from Bioconda using conda. Containerized versions of StripePy are regularly published on DockerHub.

## Full-text entities

- **Genes:** CTCF (CCCTC-binding factor) [NCBI Gene 10664] {aka CFAP108, FAP108, MRD21}, AHR (aryl hydrocarbon receptor) [NCBI Gene 196] {aka FVH3, RP85, bHLHe76}, TNR (tenascin R) [NCBI Gene 7143] {aka NEDSTO, TN-R}, RAD21 (RAD21 cohesin complex component) [NCBI Gene 5885] {aka CDLS4, HR21, HRAD21, MCD1, MGS, NXP1}
- **Diseases:** 's CLI (MESH:D010300), Hi-C (OMIM:211750)
- **Chemicals:** MNase (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** C — Mus musculus (Mouse), Finite cell line (CVCL_S361), GM12878 — Homo sapiens (Human), Transformed cell line (CVCL_7526), H1-hESC — Gallus gallus (Chicken), Somatic stem cell (CVCL_JE75), H1 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_HA53)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12215313/full.md

---
Source: https://tomesphere.com/paper/PMC12215313