# Connecting the dots: deep learning-based automated model building methods in cryo-EM

**Authors:** Harsh Bansia, Amedee des Georges

PMC · DOI: 10.3389/fmolb.2025.1613399 · Frontiers in Molecular Biosciences · 2026-02-11

## TL;DR

This paper reviews how deep learning is being used to automate model building in cryo-EM, helping scientists determine biomolecular structures more efficiently.

## Contribution

The paper introduces a classification of deep learning methods for cryo-EM model building based on structural hierarchy and model derivation.

## Key findings

- Deep learning methods can automate model building at different structural levels in cryo-EM.
- Hybrid methods combine learned features with structural templates for improved modeling.
- Current limitations include dataset diversity and capturing conformational heterogeneity.

## Abstract

The resolution revolution in single particle cryo-electron microscopy (cryo-EM) has dramatically expanded our structural knowledge of large biomolecular complexes. While high-resolution cryo-EM density maps enable atomic model building, lower-resolution maps can still reveal secondary structures, folds, and domains. When combined with integrative modeling approaches, such data can provide meaningful insights into biomolecular structure and function. Constructing accurate models, however, remains challenging: at low resolutions it is difficult to interpret density maps features reliably, and at high resolutions traditional model-building workflows can become a time-consuming bottleneck. Deep learning, which is transforming problem-solving across scientific domains, offers powerful new tools to automate and accelerate this process. In this review, we discuss deep learning-based methods developed to automate model building in cryo-EM density maps, assessing their impact on streamlining structure determination. Recognizing that biomacromolecular structures exhibit hierarchical organization, we classify these methods according to their ability to model primary, secondary, tertiary, and quaternary structures of biomolecules. Deep learning tools for building atomic models in cryo-EM density maps are further grouped as de novo, where the model is predicted directly from features learned from the cryo-EM density, or hybrid, where it is derived by integrating structural templates with these features. We outline current limitations, including the challenge of obtaining sufficiently large and diverse datasets for training networks to model different types of biomolecules in the cryo-EM density maps, and the open challenge of constructing training sets that capture the conformational heterogeneity often observed in the cryo-EM maps. We conclude by highlighting emerging directions for this rapidly advancing field, which promise to make automated, data-driven model building an integral part of structural biology.

## Full-text entities

- **Genes:** MICA (MHC class I polypeptide-related sequence A) [NCBI Gene 100507436] {aka MIC-A, PERB11.1}, BCAR1 (BCAR1 scaffold protein, Cas family member) [NCBI Gene 9564] {aka CAS, CAS1, CASS1, CRKAS, P130Cas}, RYR1 (ryanodine receptor 1) [NCBI Gene 6261] {aka CCO, CMYO1A, CMYO1B, CMYP1A, CMYP1B, KDS}
- **Diseases:** TSP (MESH:D000076082)
- **Chemicals:** Cytosine (MESH:D003596), Adenine (MESH:D000225), carbohydrates (MESH:D002241), Uracil (MESH:D014498), D (MESH:D003903), P (MESH:D010758), sugar (MESH:D000073893), Guanine (MESH:D006147), N (MESH:D009584), nucleotide (MESH:D009711), hydrogen (MESH:D006859), pyrimidine (MESH:C030986), Thymine (MESH:D013941), DAQ (-), phosphate (MESH:D010710), T (MESH:D014316), C (MESH:D002244), D-ribose (MESH:D012266), acid (MESH:D000143), Amino Acid (MESH:D000596), sugar-phosphate (MESH:D013403), U (MESH:D014501), disulfide (MESH:D004220)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12932156/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12932156/full.md

## References

203 references — full list in the complete paper: https://tomesphere.com/paper/PMC12932156/full.md

---
Source: https://tomesphere.com/paper/PMC12932156