# Knowledge transfer from macro-world to micro-world: enhancing 3D Cryo-ET classification through fine-tuning video-based deep models

**Authors:** Sabhay Jain, Xingjian Li, Min Xu

PMC · DOI: 10.1093/bioinformatics/btae368 · 2024-06-18

## TL;DR

This paper explores using pre-trained video models to improve 3D Cryo-ET classification, achieving better accuracy and lower training costs.

## Contribution

The novel approach transfers knowledge from video-based models to Cryo-ET classification, demonstrating cross-domain effectiveness.

## Key findings

- Video initialization improves classification accuracy in Cryo-ET subtomograms.
- Training costs are significantly reduced with video-based pre-training.
- The approach also benefits medical 3D classification tasks.

## Abstract

Deep learning models have achieved remarkable success in a wide range of natural-world tasks, such as vision, language, and speech recognition. These accomplishments are largely attributed to the availability of open-source large-scale datasets. More importantly, pre-trained foundational model learnings exhibit a surprising degree of transferability to downstream tasks, enabling efficient learning even with limited training examples. However, the application of such natural-domain models to the domain of tiny Cryo-Electron Tomography (Cryo-ET) images has been a relatively unexplored frontier. This research is motivated by the intuition that 3D Cryo-ET voxel data can be conceptually viewed as a sequence of progressively evolving video frames.

Leveraging the above insight, we propose a novel approach that involves the utilization of 3D models pre-trained on large-scale video datasets to enhance Cryo-ET subtomogram classification. Our experiments, conducted on both simulated and real Cryo-ET datasets, reveal compelling results. The use of video initialization not only demonstrates improvements in classification accuracy but also substantially reduces training costs. Further analyses provide additional evidence of the value of video initialization in enhancing subtomogram feature extraction. Additionally, we observe that video initialization yields similar positive effects when applied to medical 3D classification tasks, underscoring the potential of cross-domain knowledge transfer from video-based models to advance the state-of-the-art in a wide range of biological and medical data types.

https://github.com/xulabs/aitom.

## Full-text entities

- **Genes:** HFM1 (helicase for meiosis 1) [NCBI Gene 164045] {aka MER3, POF9, SEC63D1, Si-11, Si-11-6, helicase}
- **Diseases:** Cryo-ET (MESH:D028361), Alzheimer's disease (MESH:D000544), infection (MESH:D007239)
- **Chemicals:** Cryo-ET (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** T20S, DELTA

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11269433/full.md

---
Source: https://tomesphere.com/paper/PMC11269433