# Accurate and Efficient Phylogenetic Inference through End-To-End Deep Learning

**Authors:** Xinru Zhang, Shizhe Ding, Chungong Yu, Jianquan Zhao, Dongbo Bu

PMC · DOI: 10.1093/molbev/msaf260 · Molecular Biology and Evolution · 2025-10-23

## TL;DR

This paper introduces NeuralNJ, a deep learning method that improves the accuracy and efficiency of phylogenetic tree reconstruction.

## Contribution

NeuralNJ uses an end-to-end framework with a learnable neighbor joining mechanism and reinforcement learning for better phylogenetic inference.

## Key findings

- NeuralNJ outperforms existing methods in both accuracy and efficiency for phylogenetic tree reconstruction.
- The method effectively handles complex evolutionary scenarios with hundreds of taxa.
- Reinforcement learning-based tree search enhances inference accuracy.

## Abstract

Accurate phylogenetic inference is crucial for understanding evolutionary relationships among species. Deep learning technique has been introduced for phylogenetic inference; however, the existing deep learning-based approaches either suffer from limited accuracy as they split inference into several disjoint stages, or exhibit low efficiency and hardly apply to the cases with over 20 species. Here, we present an accurate and efficient approach to phylogenetic inference. Our approach, called NeuralNJ, employs an end-to-end framework that directly constructs phylogenetic trees from the input taxa, thus effectively avoiding the inaccuracy incurred by the split inference stages. The key innovation of NeuralNJ lies in its learnable neighbor joining mechanism, which iteratively joins neighbors guided by learned priority scores and thereby achieves accurate tree reconstruction. The inference accuracy is further enhanced through incorporating reinforcement learning-based tree search. Using both simulated and empirical data as representatives, we demonstrate that NeuralNJ can effectively infer phylogenetic tree with improved computational efficiency and reconstruction accuracy. The study paves the way to accurate and efficient phylogenetic inference for hundreds of taxa in complex evolutionary scenarios.

## Full-text entities

- **Diseases:** COVID-19 (MESH:D000086382)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12622301/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12622301/full.md

## References

60 references — full list in the complete paper: https://tomesphere.com/paper/PMC12622301/full.md

---
Source: https://tomesphere.com/paper/PMC12622301