# TreeHub: a comprehensive dataset of phylogenetic trees

**Authors:** Ping Wu, Yawei Cao, Jiajie Yang, Hui Wu

PMC · DOI: 10.1038/s41597-025-05282-4 · Scientific Data · 2025-06-02

## TL;DR

TreeHub is a new dataset containing thousands of phylogenetic trees from scientific papers, helping researchers study evolutionary relationships more effectively.

## Contribution

TreeHub introduces an automatically extracted and integrated dataset of phylogenetic trees from published research.

## Key findings

- TreeHub includes 135,502 phylogenetic trees from 7,879 research articles.
- The dataset spans 609 academic journals and integrates species information.
- TreeHub aims to support biodiversity and evolutionary research with high-density data.

## Abstract

Phylogenetic relationships are crucial for solving various biological questions, serving as a fundamental knowledge in biology. However, the application of phylogenetic trees has been limited by inadequate coverage of updated published phylogenies and the scarcity of reliable comprehensive datasets. In this study, we present a novel approach for automatically extracting phylogenetic data and integrating relevant species information from scientific papers and public databases. On this basis, we constructed a dataset TreeHub, including 135,502 corresponding phylogenetic trees from 7,879 phylogenetic research articles across 609 academic journals. This database will serve as a reliable and accessible resource for the scientific community, accelerating innovations in biodiversity studies and evolutionary theory based on high-density data.

## Full-text entities

- **Diseases:** PK (MESH:C564858)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12130454/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12130454/full.md

## References

5 references — full list in the complete paper: https://tomesphere.com/paper/PMC12130454/full.md

---
Source: https://tomesphere.com/paper/PMC12130454