# Modeling nascent transcription from chromatin landscape and structure with CLASTER

**Authors:** Marc Pielies Avellí, Arnór Ingi Sigurdsson, Joaquim Ollé López, Takeo Narita, Nils Krietenstein, Chunaram Choudhary, Simon Rasmussen

PMC · DOI: 10.1186/s13059-026-03992-5 · Genome Biology · 2026-02-14

## TL;DR

CLASTER is a deep learning model that predicts nascent transcription levels using chromatin landscape and 3D structure data.

## Contribution

CLASTER introduces a novel deep neural network integrating chromatin data to predict transcription.

## Key findings

- CLASTER effectively translates chromatin data into kilobasepair-resolution transcription levels.
- The model reveals genomic organization as a key factor in machine learning approaches for transcription.
- CLASTER enables prediction of the impact of epigenetic perturbations in silico.

## Abstract

We present the Chromatin Landscape and Structure to Expression Regressor (CLASTER), an epigenetic-based deep neural network that can integrate different data modalities describing the chromatin landscape and its 3D structure. CLASTER effectively translates them into nascent transcription levels measured at a kilobasepair resolution. The model provides a platform to understand the epigenetic drivers and learned rules of nascent transcription, and to predict the impact of in silico epigenetic perturbations. We conclude that the predominant locality of current machine learning approaches emerges as a signature of genomic organization, having broad implications for future modeling approaches.

The online version contains supplementary material available at 10.1186/s13059-026-03992-5.

## Full-text entities

- **Genes:** AKIRIN2 (akirin 2) [NCBI Gene 55122] {aka C6orf166, FBI1, dJ486L4.2}, POLR2A (RNA polymerase II subunit A) [NCBI Gene 5430] {aka NEDHIB, POLR2, POLRA, RPB1, RPBh1, RPO2}, TMEM222 (transmembrane protein 222) [NCBI Gene 84065] {aka C1orf160, NEDMOSBA}, KLF4 (KLF transcription factor 4) [NCBI Gene 9314] {aka EZF, GKLF}
- **Diseases:** erythroleukemia (MESH:D004915), CLASTER (MESH:D001039)
- **Chemicals:** EU (MESH:D005063), 5-EU (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13011747/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13011747/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/PMC13011747/full.md

---
Source: https://tomesphere.com/paper/PMC13011747