# First genome and transcription factor profile for Asimina triloba, a native North American fruit tree

**Authors:** Gabdiel E. Yulfo‐Soto, Hannah Toth, Sarah E. Francino, Jason Leung, Lyndel W. Meinhardt, G. Matt Davies, Jonathan M. Jacobs, Stephen P. Cohen

PMC · DOI: 10.1002/tpg2.70181 · 2026-03-04

## TL;DR

This paper presents the first high-quality genome and transcription factor profile for pawpaw, a native North American fruit tree, to support future breeding and commercial development.

## Contribution

The first publicly available pawpaw genome assembly and identification of transcription factors linked to agronomic traits.

## Key findings

- The pawpaw genome is 851.7 Mbp with high contiguity and completeness scores.
- 47 WRKY and 43 NAC transcription factors were identified and linked to traits like disease resistance and fruit development.
- The genome resource supports targeted breeding for improved pawpaw varieties.

## Abstract

Pawpaw (Asimina triloba) is the only fruit‐producing tree of the soursop (custard apple) family Annonaceae that is native to temperate North America. Pawpaws are extensively cultivated in the northeast United States, but to date, there are few genetic resources and no publicly available genome assemblies. Here, we present the first high‐quality genome assembly and annotation of pawpaw (cultivar Mango), derived from high‐fidelity third‐generation sequencing. The 851.7‐Mbp assembly consists of 68 contigs with a scaffold N50 of 28.5 Mbp, guanine‐cytosine content of 37%, and a 96.1% benchmarking universal single‐copy ortholog completeness score (eudicots). We profiled agronomically relevant transcription factors in the transcription factor family with the DNA‐binding WRKY amino acid domain and no apical meristem/Arabidopsis transcription activation factor/cup‐shaped cotyledon transcription factor families, which have functions related to environmental and pathogen immunity responses and regulation of fruit traits. Our resource facilitates future genetic and breeding research for this culturally important fruiting tree, expanding its economic and commercial potential.

Third‐generation genome sequencing facilitates the generation of a high‐quality pawpaw genome assembly.The assembly was 851.7 Mbp, with 37.06% guanine‐cytosine content, 98.7% (embryophyta), and 96.1% (eudicotyledons) benchmarking universal single‐copy ortholog (BUSCO) completeness, and was highly contiguous (68 contigs, N50 = 28.6 Mbp).Forty‐seven transcription factor family with the DNA‐binding WRKY amino acid domain (WRKY) and 43 no apical meristem/Arabidopsis transcription activation factor/cup‐shaped cotyledon transcription factor family (NAC) transcriptions factors (TFs) were identified in pawpaw and compared to known TFs to predict pawpaw TFs involved in desirable agronomic traits.

Third‐generation genome sequencing facilitates the generation of a high‐quality pawpaw genome assembly.

The assembly was 851.7 Mbp, with 37.06% guanine‐cytosine content, 98.7% (embryophyta), and 96.1% (eudicotyledons) benchmarking universal single‐copy ortholog (BUSCO) completeness, and was highly contiguous (68 contigs, N50 = 28.6 Mbp).

Forty‐seven transcription factor family with the DNA‐binding WRKY amino acid domain (WRKY) and 43 no apical meristem/Arabidopsis transcription activation factor/cup‐shaped cotyledon transcription factor family (NAC) transcriptions factors (TFs) were identified in pawpaw and compared to known TFs to predict pawpaw TFs involved in desirable agronomic traits.

Pawpaw is a North American fruit tree that has significant cultural importance to Indigenous people and traditional communities in Appalachia. Most fruit are collected from the wild, but dozens of cultivated pawpaw varieties are also grown in the northeast United States. Despite this, there are no publicly available genomic resources for pawpaw. Here, we used advanced DNA sequencing techniques to assemble the first pawpaw genome. We used computational tools to identify genes that are involved in valuable agronomic traits such as stress tolerance, disease resistance, flower development, and fruiting. This research contributes to the modernization of pawpaw breeding, enabling targeted efforts to develop new varieties that could bring pawpaw closer to its market potential.

## Linked entities

- **Proteins:** WRKY (probable WRKY transcription factor 33), XK (X-linked Kx blood group antigen, Kell and VPS13A binding protein)
- **Species:** Asimina triloba (taxon 12953)

## Full-text entities

- **Genes:** NAC3 (NAC domain containing protein 3) [NCBI Gene 820789] {aka ANAC055, ATNAC3, NAC domain containing protein 3, NAC domain containing protein 55, NAC-DOMAIN ENCODING GENE, NAC055}, WRKY19 (protein kinase family protein) [NCBI Gene 826810] {aka ATWRKY19, F16J13.90, F16J13_90, MAPK/ERK KINASE KINASE 4, MAPKKK11, MEKK4}, WRKY [NCBI Gene 100191122], WRKY72 (WRKY DNA-binding protein 72) [NCBI Gene 831365] {aka ARABIDOPSIS THALIANA WRKY DNA-BINDING PROTEIN 72, ATWRKY72, F8M21.20, F8M21_20, WRKY DNA-binding protein 72}, SlWRKY72b [NCBI Gene 100529133], NAC1 (NAC domain protein) [NCBI Gene 543917] {aka NC1, SlNAC1}
- **Diseases:** viral disease (MESH:D014777), bruising (MESH:D003288), leaf necrosis (MESH:D009336), fungal spot (MESH:D009181), pawpaw diseases (MESH:D004194), root-knot nematodes (MESH:D009349), fruit rot (MESH:D005535)
- **Chemicals:** NAC (-), auxin (MESH:D007210), jasmonic acid (MESH:C011006), anthocyanins (MESH:D000872), salicylic acid (MESH:D020156), sugar (MESH:D000073893), salt (MESH:D012492), abscisic acid (MESH:D000040), nitrogen (MESH:D009584), ethylene (MESH:C036216), carotenoids (MESH:D002338), brassinosteroids (MESH:D060406)
- **Species:** Asimina incana (species) [taxon 690726], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Annona reticulata (anona corazon, species) [taxon 301862], Solanum lycopersicum (tomato, species) [taxon 4081], Mycocentrospora (genus) [taxon 227733], Homo sapiens (human, species) [taxon 9606], Botrytis (genus) [taxon 33196], eudicotyledons (eudicots, clade) [taxon 71240], Ananas comosus (pineapple, species) [taxon 4615], Phytophthora citrophthora (species) [taxon 4793], Malus domestica (apple, species) [taxon 3750], Pseudomonas syringae pv. tomato (no rank) [taxon 323], Magnoliidae (clade) [taxon 232347], Tobacco ringspot virus (no rank) [taxon 12282], Annona (genus) [taxon 13336], Asimina reticulata (species) [taxon 690729], Pyrus communis (pear, species) [taxon 23211], Squamosa (genus) [taxon 704607], Annona squamosa (sugar-apple, species) [taxon 301693], Annona muricata (guanabana, species) [taxon 13337], Musa acuminata (banana, species) [taxon 4641], Asimina tetramera (species) [taxon 690730], Annona cherimola (cherimoya, species) [taxon 49314], Asimina longifolia (polecat bush, species) [taxon 251264], Asimina x nashii (species) [taxon 690724], Asimina pygmaea (species) [taxon 690728], Mangifera indica (mango, species) [taxon 29780], Pseudomonas syringae (species) [taxon 317], Asimina triloba (pawpaw, species) [taxon 12953], Asimina parviflora (dwarf pawpaw, species) [taxon 137664]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12961255/full.md

---
Source: https://tomesphere.com/paper/PMC12961255