# Chloroplast Genome Diversity and Marker Potentials of Diverse Ensete ventricosum Accessions

**Authors:** Manosh Kumar Biswas, Bulbul Ahmed, Mohamed Hijri, Trude Schwarzacher, J. S. (Pat) Heslop-Harrison

PMC · DOI: 10.3390/ijms26199561 · 2025-09-30

## TL;DR

This paper analyzes the chloroplast genomes of 15 Ensete ventricosum landraces to study genetic diversity and identify molecular markers for conservation and breeding.

## Contribution

The study provides the first comparative analysis of chloroplast genomes in E. ventricosum and identifies SSR markers for genetic improvement.

## Key findings

- The chloroplast genomes of E. ventricosum landraces are highly conserved in structure and gene content.
- Simple sequence repeats (SSRs) are predominantly located in the large single-copy region and enriched in A/T nucleotides.
- Phylogenetic analysis reveals a common origin for most landraces, with notable genetic divergence in a China sample.

## Abstract

Ensete ventricosum is a morphologically gigantic, monocot, diploid sister to the banana plant species. It is commercially cultivated as a starch source, only in Ethiopia, where it feeds twenty million people. Here, the complete chloroplast (CP) genomes of 15 diverse landraces of E. ventricosum were assembled and annotated, for comparative genomics, genetic diversity analysis, and molecular marker development. The assembled E. ventricosum CP genomes ranged between 168,388 and 168,806 bp. The sampled CP genomes were quadripartite in structure and had two single-copy regions, a large single-copy region (LSC, average length 88,657 bp), and a small single-copy region (SSC, average length 11,098 bp) separated by inverted repeat regions (IR, average length 34,437 bp). The total number of annotated genes varies between 135 and 138, including 89–92 protein-coding genes, 38 tRNA genes, and 4 rRNA genes. All CP genes, including non-functional ones and intergenic regions, were transcribed with the transcriptome, covering almost 92% of the E. ventricosum CP genome. Codon usage, amino acid frequency, GC contents, and repeat nucleotides were similar among the 15 landraces. Mono- and tetranucleotide simple sequence repeats (SSRs) were found more frequently than other SSRs. An average of 71% of these SSRs were located in the LSC region, and the majority of the SSR motifs were composed of A/T nucleotides. A phylogenetic analysis of the 15 Ensete landraces indicated a common evolutionary origin, while the China sample was positioned separately, suggesting notable genetic differences. This study presents a comparative analysis of the chloroplast genomes of 15 E. ventricosum landraces, providing valuable insights into their genetic diversity and evolution. The identified SSR markers and conserved genomic features offer essential resources for future research and an improvement in Ensete conservation and breeding.

## Linked entities

- **Species:** Ensete ventricosum (taxon 4639)

## Full-text entities

- **Chemicals:** starch (MESH:D013213)
- **Species:** Ensete ventricosum (species) [taxon 4639], Homo sapiens (human, species) [taxon 9606], Ensete (genus) [taxon 4638]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12525447/full.md

---
Source: https://tomesphere.com/paper/PMC12525447