# Population Structure of Genotypes and Genome-Wide Association Studies of Cannabinoids and Terpenes Synthesis in Hemp (Cannabis sativa L.)

**Authors:** Marjeta Eržen, Andreja Čerenak, Tjaša Cesar, Jernej Jakše

PMC · DOI: 10.3390/plants15020202 · 2026-01-08

## TL;DR

This study analyzes the genetic structure and associations in hemp varieties to identify SNPs linked to cannabinoid and terpene synthesis.

## Contribution

The study identifies significant SNPs associated with delta-9-THC, CBG, and myrcene in hemp using GWAS and population structure analysis.

## Key findings

- CS formed a single genetic cluster, while TS and FS each formed two clusters.
- 14 significant SNPs were found for delta-9-THC, 12 for CBG, and 1 for myrcene.
- Plausible genes near significant SNPs were identified for all detected associations.

## Abstract

Hemp (Cannabis sativa L.) is one of the oldest cultivated plants in the world. It is a wind-pollinated and heterozygous species, and diverse phenotypes can occur within population varieties. In our study, three different hemp varieties—(‘Carmagnola Selected’ (CS), ‘Tiborszallasi’ (TS) and ‘Finola selection’ (FS))—were grown. Based on visual characteristics, two, five and four phenotypes were identified within CS, TS and FS, respectively. According to Cannabis sativa L. transcriptome data from the Sequence Read Archive (SRA), 4631 single-nucleotide polymorphism (SNP) positions were identified to develop capture probes. DNA was isolated from 171 plants representing selected phenotypes of three cultivars. Next-generation sequencing (NGS) libraries were constructed and hybridized with capture probes for target enrichment. The population structure of the samples was analyzed using SNP data for each genotype. Based on genotype profiles, CS formed a single cluster, while TS and FS were each grouped into two clusters, with phenotypes randomly distributed among them. The GWAS results were visualized using Manhattan plots. Fourteen significant SNPs surpassing the false discovery rate (FDR) of 0.01 were identified for delta-9-tetrahydrocannabinol (delta-9-THC). For cannabigerol (CBG), 12 significant SNPs were detected, and for myrcene, one SNP exceeded the 0.01 FDR threshold. However, plausible genes located 1000 bp to the left and right of the SNP position were identified for all significant SNPs.

## Linked entities

- **Chemicals:** delta-9-tetrahydrocannabinol (PubChem CID 2978), cannabigerol (PubChem CID 5315659), myrcene (PubChem CID 31253)

## Full-text entities

- **Diseases:** FS (MESH:D009155), CS (MESH:D006223)
- **Chemicals:** CBG (MESH:C037036), delta-9-THC (MESH:D013759), Cannabinoids (MESH:D002186), myrcene (MESH:C509595), Terpenes (MESH:D013729)
- **Species:** Cannabis sativa (species) [taxon 3483]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12845109/full.md

---
Source: https://tomesphere.com/paper/PMC12845109