# Unsupervised Machine Learning Reveals Temporal Components of Gene Expression in HeLa Cells Following Release from Cell Cycle Arrest

**Authors:** Tom Maimon, Yaron Trink, Jacob Goldberger, Tomer Kalisky

PMC · DOI: 10.3390/ijms26199491 · International Journal of Molecular Sciences · 2025-09-28

## TL;DR

This study uses unsupervised machine learning to uncover hidden patterns in gene expression over time in HeLa cells after cell cycle arrest.

## Contribution

The novel use of unsupervised machine learning to deconvolve gene expression into temporal components reveals dynamic biological processes.

## Key findings

- Two oscillatory gene expression components correspond to G1-S and G2-M cell cycle phases.
- A third transient component is linked to early response genes and cervical cancer.
- Unsupervised methods can reveal hidden temporal dynamics in biological systems.

## Abstract

Gene expression measurements of tissues, tumors, or cell lines taken over multiple time points are valuable for describing dynamic biological phenomena such as the response to growth factors. However, such phenomena typically involve multiple biological processes occurring in parallel, making it difficult to identify and discern their respective contributions at any time point. Here, we demonstrate the use of unsupervised machine learning to deconvolve a series of time-dependent gene expression measurements into its underlying temporal components. We first downloaded publicly available RNAseq data obtained from synchronized HeLa cells at consecutive time points following release from cell cycle arrest. Then, we used Fourier analysis and Topic modeling to reveal three underlying components and their relative contributions at each time point. We identified two temporal components with oscillatory behavior, corresponding to the G1-S and G2-M phases of the cell cycle, and a third component with a transient expression pattern, associated with the immediate early response gene program, regulation of cell proliferation, and cervical cancer. This study demonstrates the use of unsupervised machine learning to identify hidden temporal components in biological systems, with potential applications to early detection and monitoring of diseases and recovery processes.

## Linked entities

- **Diseases:** cervical cancer (MONDO:0002974)

## Full-text entities

- **Diseases:** tumors (MESH:D009369), cervical cancer (MESH:D002583)
- **Cell lines:** HeLa — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_0030)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12524476/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12524476/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12524476/full.md

---
Source: https://tomesphere.com/paper/PMC12524476