# Adversarial attack of sequence-free enhancer prediction identifies chromatin architecture

**Authors:** Jamil Gafur, Olivia W Lang, William K M Lai

PMC · DOI: 10.1093/bioinformatics/btaf371 · 2025-06-24

## TL;DR

This paper shows that enhancers can be accurately predicted using chromatin data alone, and introduces a new method to understand how AI models make these predictions.

## Contribution

The novel contribution is a cell-type invariant enhancer prediction platform and the first use of adversarial particle swarm optimization for genomic neural networks.

## Key findings

- Chromatin datasets alone can accurately identify enhancers genome-wide.
- A multi-cell-type neural network platform was developed for enhancer prediction without DNA sequence data.
- Adversarial particle swarm optimization (APSO) was used to explain genomic neural network predictions.

## Abstract

The wide range of cellular complexity created by multicellular organisms is due in large part to the intricate and synergistic interplay of regulatory complexes throughout the eukaryotic genome. These regulatory elements “enhance” specific gene programs and have been shown to operate in diverse networks that are distinct across cell states of the same organism. Attempts to characterize and predict enhancers have typically focused on leveraging information-dense DNA sequence in parallel with epigenomic assays. We examined the viability of enhancer prediction using only a minimal set of epigenomic datasets without direct DNA information.

We demonstrate that chromatin datasets are sufficient to identify enhancers genome-wide with high accuracy. By training networks leveraging data from multiple cell types simultaneously, we generated a cell-type invariant enhancer prediction platform that utilized only the patterns of protein binding for inference. We also showed the utility of swarm-based adversarial attacks [adversarial particle swarm optimization (APSO)] to deconvolute trained genomic neural networks for the first time. Critically, unlike saliency mapping or other game-theory based approaches, APSO is completely network-architecture independent and can be applied to any prediction engine to derive the features that drive inference.

All software and code for data downloading, processing, enhancer inference, eXplainable AI (XAI), and complete figure generation are publicly available on GitHub at https://github.com/EpiGenomicsCode/ChromEnhancer and Zenodo at https://doi.org/10.5281/zenodo.15652797.

## Full-text entities

- **Genes:** CEBPA (CCAAT enhancer binding protein alpha) [NCBI Gene 1050] {aka C/EBP-alpha, CEBP}, EP300 (EP300 lysine acetyltransferase) [NCBI Gene 2033] {aka KAT3B, MKHK2, RSTS2, p300}, HNRNPLL (heterogeneous nuclear ribonucleoprotein L like) [NCBI Gene 92906] {aka HNRPLL, SRRF}, GATA1 (GATA binding protein 1) [NCBI Gene 2623] {aka CNSHA9, ERYF1, GATA-1, GF-1, GF1, HAEADA}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, CTCF (CCCTC-binding factor) [NCBI Gene 10664] {aka CFAP108, FAP108, MRD21}, SNRNP70 (small nuclear ribonucleoprotein U1 subunit 70) [NCBI Gene 6625] {aka RNPU1Z, RPU1, SNRP70, Snp1, U1-70K, U170K}, YY1 (YY1 transcription factor) [NCBI Gene 7528] {aka DELTA, GADEVS, INO80S, NF-E1, UCRBP, YIN-YANG-1}, HNRNPUL1 (heterogeneous nuclear ribonucleoprotein U like 1) [NCBI Gene 11100] {aka E1B-AP5, E1BAP5, HNRPUL1}, POLR2A (RNA polymerase II subunit A) [NCBI Gene 5430] {aka NEDHIB, POLR2, POLRA, RPB1, RPBh1, RPO2}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, CHIA (chitinase acidic) [NCBI Gene 27159] {aka AMCASE, CHIT2, TSA1902}
- **Diseases:** APSO (MESH:D012513), cancer (MESH:D009369)
- **Chemicals:** APSO (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** A549 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_0023), HepG2 — Homo sapiens (Human), Hepatoblastoma, Cancer cell line (CVCL_0027), MCF-7 — Homo sapiens (Human), Invasive breast carcinoma of no special type, Cancer cell line (CVCL_0031), K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12240468/full.md

---
Source: https://tomesphere.com/paper/PMC12240468