# Incorporating temporal information during feature engineering bolsters emulation of spatio-temporal emergence

**Authors:** Jason Y Cain, Jacob I Evarts, Jessica S Yu, Neda Bagheri

PMC · DOI: 10.1093/bioinformatics/btae131 · Bioinformatics · 2024-03-05

## TL;DR

Including temporal data improves predictions of tumor behavior from complex models, highlighting the role of time in biological emergence.

## Contribution

Incorporating intermediate temporal states significantly enhances machine learning emulation of spatio-temporal biological systems.

## Key findings

- Emulators using temporal information outperform those using only initial conditions.
- Including intermediate simulation states improves predictive performance of tumor property emulation.
- Emulators show inconsistent performance, indicating complex cell dynamics are hard to replace.

## Abstract

Emergent biological dynamics derive from the evolution of lower-level spatial and temporal processes. A long-standing challenge for scientists and engineers is identifying simple low-level rules that give rise to complex higher-level dynamics. High-resolution biological data acquisition enables this identification and has evolved at a rapid pace for both experimental and computational approaches. Simultaneously harnessing the resolution and managing the expense of emerging technologies—e.g. live cell imaging, scRNAseq, agent-based models—requires a deeper understanding of how spatial and temporal axes impact biological systems. Effective emulation is a promising solution to manage the expense of increasingly complex high-resolution computational models. In this research, we focus on the emulation of a tumor microenvironment agent-based model to examine the relationship between spatial and temporal environment features, and emergent tumor properties.

Despite significant feature engineering, we find limited predictive capacity of tumor properties from initial system representations. However, incorporating temporal information derived from intermediate simulation states dramatically improves the predictive performance of machine learning models. We train a deep-learning emulator on intermediate simulation states and observe promising enhancements over emulators trained solely on initial conditions. Our results underscore the importance of incorporating temporal information in the evaluation of spatio-temporal emergent behavior. Nevertheless, the emulators exhibit inconsistent performance, suggesting that the underlying model characterizes unique cell populations dynamics that are not easily replaced.

All source codes for the agent-based model, emulation, and analyses are publicly available at the corresponding DOIs: 10.5281/zenodo.10622155, 10.5281/zenodo.10611675, 10.5281/zenodo.10621244, respectively.

## Linked entities

- **Diseases:** tumor (MONDO:0005070)

## Full-text entities

- **Diseases:** tumor (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10957516/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10957516/full.md

## References

67 references — full list in the complete paper: https://tomesphere.com/paper/PMC10957516/full.md

---
Source: https://tomesphere.com/paper/PMC10957516