# A trio-binning approach for genome assembly reveals extensive structural variation between two Cannabis cultivars: Punto Rojo and Cherry Pie

**Authors:** Brett Pike, Alexander Kozik, Wilson Terán

PMC · DOI: 10.1093/g3journal/jkaf286 · G3: Genes | Genomes | Genetics · 2025-12-30

## TL;DR

This paper introduces a cost-effective method to assemble and phase Cannabis genomes, revealing genetic differences that may affect traits like disease resistance and chemical production.

## Contribution

The study demonstrates a trio-binning approach for high-quality haplotype phasing in Cannabis with low sequencing coverage.

## Key findings

- The trio-binning approach achieved good contiguity and gene completeness with only 18 × coverage per haplotype.
- Structural and copy number variations were identified that may influence disease resistance and secondary metabolite synthesis in Cannabis.
- The method is shown to be cost-effective for genome-wide haplotype phasing in Cannabis cultivars.

## Abstract

With the advent of long-read DNA sequencing technologies, assembling eukaryotic genomes has become routine; however, properly phasing the maternal and paternal contributions, which is of great value for breeding programs, remains technically challenging. Here, we use the trio-binning approach to separate Oxford Nanopore reads derived from a Cannabis F1 wide cross, made between the Colombian landrace Punto Rojo and the Colorado CBD clone Cherry Pie #16. Reads were obtained from a single PromethION flow cell, generating assemblies with coverage of just 18 × per haplotype, but with good contiguity and gene completeness, demonstrating that it is a cost-effective approach for genome-wide and high-quality haplotype phasing. Evaluated through the lenses of disease resistance and secondary metabolite synthesis, both being traits of interest for the Cannabis industry, we report copy number and structural variation that, as has recently been shown for other major crops, may contribute to phenotypic variation along several relevant dimensions.

## Linked entities

- **Species:** Cannabis (taxon 3482)

## Full-text entities

- **Genes:** LOC115697901 (cannabichromenic acid synthase) [NCBI Gene 115697901] {aka CBCAS, CBCAS1, CBCAS2, THCA1, THCA2, THCAS}, CsTPS1 [NCBI Gene 115716064]
- **Diseases:** CP (MESH:D003025), PR (MESH:C537153), MTC (MESH:C536911)
- **Chemicals:** phenol (MESH:D019800), (-)-Limonene (MESH:D000077222), LIM (MESH:C033762), chloroform (MESH:D002725), EDTA (MESH:D004492), ethanol (MESH:D000431), Terpene (MESH:D013729), carbohydrate (MESH:D002241), humulene (MESH:C042686), silver nitrate (MESH:D012835), CP (-), himachalene (MESH:C412292), CBD (MESH:D002185), nerolidol (MESH:C037055), silica (MESH:D012822), cannabinoid (MESH:D002186), geraniol (MESH:C007836), TER (MESH:C027009), farnesene (MESH:D012717), THC (MESH:D013759), anthocyanin (MESH:D000872), caryophyllene (MESH:C024714), MYR (MESH:C509595), linalool (MESH:C018584)
- **Species:** Humulus (genus) [taxon 3484], Homo sapiens (human, species) [taxon 9606], Golovinomyces ambrosiae (species) [taxon 1101212], Escherichia coli (E. coli, species) [taxon 562], Pseudocercospora fijiensis (species) [taxon 1873960], Pseudocercospora musae (species) [taxon 113226], Limosilactobacillus fermentum (species) [taxon 1613], Sclerotinia sclerotiorum (species) [taxon 5180], Helianthus annuus (common sunflower, species) [taxon 4232], Solanum lycopersicum (tomato, species) [taxon 4081], Fusarium oxysporum (species) [taxon 5507], Solanum pimpinellifolium (currant tomato, species) [taxon 4084], Fusarium oxysporum f. sp. cannabis (forma specialis) [taxon 654386], Botrytis cinerea (gray fruit mold, species) [taxon 40559], Cercospora beticola (species) [taxon 122368], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Aspergillus fumigatus (species) [taxon 746128]
- **Cell lines:** -67 — Mus musculus (Mouse), Hybridoma (CVCL_B7D2), Cannbio-2 — Homo sapiens (Human), Colon carcinoma, Cancer cell line (CVCL_A628)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12869074/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12869074/full.md

## References

108 references — full list in the complete paper: https://tomesphere.com/paper/PMC12869074/full.md

---
Source: https://tomesphere.com/paper/PMC12869074