# ProOvErlap: Assessing feature proximity/overlap and testing statistical significance from genomic intervals

**Authors:** Nicolò Gualandi, Alessio Bertozzo, Claudio Brancolini

PMC · DOI: 10.1016/j.jbc.2025.110209 · 2025-05-08

## TL;DR

This paper introduces a computational method to assess and visualize the overlap and proximity of genomic features, aiding in the understanding of biological processes.

## Contribution

A new computational method for analyzing genomic feature overlap and proximity with statistical significance testing is introduced.

## Key findings

- The method quantitatively assesses proximity or overlap between genomic features.
- It determines statistical significance using a nonparametric randomization test.
- The method provides clear visualizations and is easy to use via a single command line.

## Abstract

Feature overlap is a critical concept in bioinformatics and occurs when two genomic intervals, usually represented as BED files, are located in the same genomic regions. Instead, feature proximity refers to the spatial proximity of genomic elements. For example, promoters typically overlap or are close to the genes they regulate. Overlap and proximity are also important in epigenetic studies. Here, the overlap of regions enriched for specific epigenetic modifications or accessible chromatin can elucidate complex molecular phenotypes. Consequently, the ability to analyze and interpret feature overlap and proximity is essential for understanding the biological processes that contribute to a given phenotype. To address this need, we present a computational method capable of analyzing data represented in the BED format. This method aims to quantitatively assess the degree of proximity or overlap between genomic features and to determine the statistical significance of these events in the context of a nonparametric randomization test. The aim is to understand whether the observed state differs from what would be expected by chance. The method is designed to be easy to use, requiring only a single command line to run, allowing straightforward overlap and proximity analysis. It also provides clear visualizations and publication-quality figures. In conclusion, this study highlights the importance of feature overlap and proximity in epigenetic studies and presents a method to improve the systematic assessment and interpretation of these features. We propose a new resource for identifying biologically significant interactions between genomic features in both healthy and disease states.

## Full-text entities

- **Genes:** SP1 (Sp1 transcription factor) [NCBI Gene 6667], CCNC (cyclin C) [NCBI Gene 892] {aka CycC, SRB11, hSRB11}, MED12 (mediator complex subunit 12) [NCBI Gene 9968] {aka ARC240, CAGH45, FGS1, HDKR, HOPA, Kto}, CDK8 (cyclin dependent kinase 8) [NCBI Gene 1024] {aka IDDHBA, K35}, MED13 (mediator complex subunit 13) [NCBI Gene 9969] {aka ARC250, DRIP250, HSPC221, MRD61, THRAP1, TRAP240}, XCL1 (X-C motif chemokine ligand 1) [NCBI Gene 6375] {aka ATAC, LPTN, LTN, SCM-1, SCM-1a, SCM1}, BRD4 (bromodomain containing 4) [NCBI Gene 23476] {aka CAP, CDLS6, FSHRG4, HUNK1, HUNKI, MCAP}
- **Diseases:** BED (MESH:C564092), fibroid tumors (MESH:D007889), Uterine leiomyomas (OMIM:150699), benign tumors (MESH:D009369)
- **Chemicals:** H3K27Ac (-), polyA (MESH:D011061), LEIO (MESH:D019304)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12172997/full.md

---
Source: https://tomesphere.com/paper/PMC12172997