# Volcano: a pipeline to characterize long terminal repeat-retrotransposons families in plants

**Authors:** Hao He, Fei Shen, Yong Hou, Xiaozeng Yang

PMC · DOI: 10.1093/bioadv/vbaf162 · 2025-07-04

## TL;DR

Volcano is a new pipeline that helps scientists study and classify LTR-RTs in plant genomes, which are important for understanding genome evolution.

## Contribution

Volcano introduces an improved algorithm for classifying LTR-RTs and quantifying their expression in plants.

## Key findings

- Larger plant genomes contain more LTR-RTs.
- Volcano effectively categorizes LTR-RTs at the clade level.
- The pipeline can quantify LTR-RT expression using RNA-seq data.

## Abstract

Long Terminal Repeat Retrotransposons (LTR-RTs) comprise a significant portion of repetitive sequences in numerous plant species. LTR-RTs hold considerable functional significance, as they can impact gene family functionality and contribute to the formation of new genes. Investigating the quantities and activities of LTR-RTs is essential for understanding species’ evolutionary dynamics and the foundational mechanisms driving genome evolution. While current softwares can predict and initially classify LTR-RTs, there is a high need for more comprehensive and efficient software to fully characterize and quantify LTR-RTs during burst events and in subsequent detailed classification and quantification, especially given the surged demands of genome annotation.

In this study, we have developed a pipeline called Volcano to accurately classify LTR-RTs and characterize burst families in plants. To distinguish different clades of LTR-RTs, we have implemented an improved depth-first search algorithm. Volcano can also quantify LTR-RT expression using RNA-seq data. By analyzing LTR-RTs in three genomes from the Asteraceae family, we observed that larger genomes tend to contain a greater number of LTR-RTs, and our software effectively categorizes them at the clade level.

The proposed Volcano compressor can be downloaded from https://github.com/Suosihe/volcano_LTR.

## Linked entities

- **Species:** Asteraceae (taxon 4210)

## Full-text entities

- **Diseases:** LTR (MESH:D000094024), EC (MESH:D005955), CL (MESH:D002971)
- **Chemicals:** EDTA (-)
- **Species:** Erigeron canadensis (horseweed, species) [taxon 72917], Oryza australiensis (species) [taxon 4532], Zea mays (maize, species) [taxon 4577], Chrysanthemum lavandulifolium (species) [taxon 146996], Cichorium intybus (chicory, species) [taxon 13427], Scaevola taccada (beach naupaka, species) [taxon 16481], Carthamus tinctorius (safflower, species) [taxon 4222]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12349922/full.md

---
Source: https://tomesphere.com/paper/PMC12349922