# ChromaFactor: Deconvolution of single-molecule chromatin organization with non-negative matrix factorization

**Authors:** Laura M. Gunsalus, Michael J. Keiser, Katherine S. Pollard, Jian Ma, Jie Liu, Jian Ma, Jie Liu, Jian Ma, Jie Liu

PMC · DOI: 10.1371/journal.pcbi.1012841 · PLOS Computational Biology · 2025-02-18

## TL;DR

ChromaFactor is a new computational tool that helps analyze single-cell chromatin structures to better understand how genome organization relates to function.

## Contribution

ChromaFactor introduces a non-negative matrix factorization approach to deconvolve single-molecule chromatin data into primary components linked to functional phenotypes.

## Key findings

- Primary components identified by ChromaFactor correlate with active transcription, enhancer-promoter distance, and genomic compartment.
- Some bulk trends are observed in only a small fraction of cells, indicating rare subpopulations may drive genome organization changes.
- ChromaFactor's components are associated with bulk genomic measurements of transcription factor and structural protein binding.

## Abstract

The investigation of chromatin organization in single cells holds great promise for identifying causal relationships between genome structure and function. However, analysis of single-molecule data is hampered by extreme yet inherent heterogeneity, making it challenging to determine the contributions of individual chromatin fibers to bulk trends. To address this challenge, we propose ChromaFactor, a novel computational approach based on non-negative matrix factorization that deconvolves single-molecule chromatin organization datasets into their most salient primary components. ChromaFactor provides the ability to identify trends accounting for the maximum variance in the dataset while simultaneously describing the contribution of individual molecules to each component. Applying our approach to two single-molecule imaging datasets across different genomic scales, we find that these primary components demonstrate significant correlation with key functional phenotypes, including active transcription, enhancer-promoter distance, and genomic compartment. Also, we find that some bulk trends exist at the single-cell level, but only in a small fraction of cells, suggesting that critical changes in genome organization may be driven by specific rare subpopulations rather than occurring uniformly across all cells. ChromaFactor offers a robust tool for understanding the complex interplay between chromatin structure and function on individual DNA molecules, pinpointing which subpopulations drive functional changes and fostering new insights into cellular heterogeneity and its implications for bulk genomic phenomena.

Emerging imaging technologies are generating high-resolution, single-molecule chromatin interaction data for individual loci, providing an opportunity to identify the chromatin structures that contribute to patterns seen in bulk genomics assays or that correlate with variability in gene regulation. However, chromatin conformation varies dramatically between cells, making it challenging to determine which patterns are important for transcriptional activity and which represent inherent heterogeneity. We present ChromaFactor, a new method to address this problem based on non-negative matrix factorization. ChromaFactor identifies the major interaction patterns across chromatin fibers as well as the individual molecules contributing to each pattern. We analyzed human and fly single-molecule imaging data, finding significant correlations between ChromaFactor’s primary components and paired molecular phenotypes measured in the same cells, including active transcription, enhancer-promoter distance, and genomic compartment. Interaction patterns in primary components are also associated with bulk genomic measurements of transcription factor and structural protein binding. These findings demonstrate that ChromaFactor provides a robust framework for understanding the complex interplay between chromatin structure and function at the single-molecule level by treating single-cell chromatin measurements not as independent instances, but as snapshots of continuous, dynamic states.

## Linked entities

- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Genes:** BACE2 (beta-secretase 2) [NCBI Gene 25825] {aka AEPLC, ALP56, ASP1, ASP21, BAE2, CDA13}, HLCS (holocarboxylase synthetase) [NCBI Gene 3141] {aka HCS}, HOXA9 (homeobox A9) [NCBI Gene 3205] {aka ABD-B, HOX1, HOX1.7, HOX1G}, RAD21 (RAD21 cohesin complex component) [NCBI Gene 5885] {aka CDLS4, HR21, HRAD21, MCD1, MGS, NXP1}, SON (SON DNA and RNA binding protein) [NCBI Gene 6651] {aka BASS1, C21orf50, DBP-5, NREBP, SON3, TOKIMS}, CTCF (CCCTC-binding factor) [NCBI Gene 10664] {aka CFAP108, FAP108, MRD21}, Ubx (Ultrabithorax) [NCBI Gene 42034] {aka BX-C, Bxl, CG10388, Cbx, DUbx, Dm Ubx}, abd-A (abdominal A) [NCBI Gene 42037] {aka Abd A, AbdA, Abda, BX-C, CG10325, Cbxd}
- **Diseases:** NMF (MESH:C538347)
- **Species:** Drosophila melanogaster (fruit fly, species) [taxon 7227], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** IMR-90 — Homo sapiens (Human), Finite cell line (CVCL_0347)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11849981/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11849981/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC11849981/full.md

---
Source: https://tomesphere.com/paper/PMC11849981