# Rethinking DeepVariant: Efficient Neural Architectures for Intelligent Variant Calling

**Authors:** Anastasiia Gurianova, Anastasiia Pestruilova, Aleksandra Beliaeva, Artem Kasianov, Liudmila Mikhailova, Egor Guguchkin, Evgeny Karpulevich

PMC · DOI: 10.3390/ijms27010513 · International Journal of Molecular Sciences · 2026-01-04

## TL;DR

This paper improves the DeepVariant tool by replacing its old neural network with a more efficient one, resulting in better accuracy and faster performance in identifying genetic variants.

## Contribution

The paper introduces a modernized DeepVariant prototype with alternative neural network backbones, demonstrating improved efficiency and accuracy.

## Key findings

- Replacing Inception V3 with EfficientNet reduced parameters by twofold and improved convergence speed.
- The updated workflow achieved a +0.1% increase in SNP F1-score, detecting hundreds more true variants per genome.
- Optimizing neural architecture enhances variant calling accuracy, robustness, and efficiency.

## Abstract

DeepVariant has revolutionized the field of genetic variant identification by reframing variant detection as an image classification problem. However, despite its wide adoption in bioinformatics workflows, the tool continues to evolve mainly through the expansion of training datasets, while its core neural network architecture—Inception V3—has remained unchanged. In this study, we revisited the DeepVariant design and presented a prototype of a modernized version that supports alternative neural network backbones. As a proof of concept, we replaced the legacy Inception V3 model with a mid-sized EfficientNet model and evaluated its performance using the benchmark dataset from the Genome in a Bottle (GIAB) project. Alternative architecture demonstrated faster convergence, a twofold reduction in the number of parameters, and improved accuracy in variant identification. On the test dataset, updated workflow achieved consistent improvements of +0.1% in SNP F1-score, enabling the detection of up to several hundred additional true variants per genome. These results show that optimizing the neural architecture alone can enhance the accuracy, robustness, and efficiency of variant calling, thereby improving the overall quality of sequencing data analysis.

## Full-text entities

- **Diseases:** injury to (MESH:D014947), GIAB (MESH:D042822)
- **Chemicals:** WhatsHap (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12786509/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12786509/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12786509/full.md

---
Source: https://tomesphere.com/paper/PMC12786509