# Telomere-to-telomere genome assembly and multiomics analyses illustrate the high accumulation of quercetin glucosides in tetraploid Descurainia sophia

**Authors:** Weifeng Wu, Jianyong Wang, Chengcheng Cai, Xiaoyu Song, Hua Li, Tao Zhang, Meixin Xiong, Ying Wang, Jie Zhang, Bingbing Li, Lei Zhang, Feng Li, Mingkun Huang, Wei Li, Feng Cheng, Danyu Kong, Yi Liu

PMC · DOI: 10.1093/hr/uhaf335 · 2025-12-03

## TL;DR

This study identifies the genetic basis for high quercetin glucoside accumulation in tetraploid Descurainia sophia and explains the origin of its diploid relative's genome.

## Contribution

The study reveals how gene duplication and evolution of a UGT gene leads to high quercetin glucoside levels in tetraploid D. sophia.

## Key findings

- Gene duplication and functional evolution of Dscd6AG01520 UGT gene causes high quercetin glucoside accumulation in tetraploid D. sophia.
- Amino acid S213 in Dscd6AG01520 is critical for enzymatic activity.
- Diploid D. sophia evolved from an ancestral crucifer karyotype through chromosome fusion and rearrangement.

## Abstract

Quercetin glucosides are important phytopharmaceutical metabolites in Descurainia sophia seeds, which are widely used in traditional herbal medicine. However, the key genes involved in quercetin glucoside biosynthesis in D. sophia have not been characterized. Herein, we present the telomere-to-telomere genomes of a tetraploid D. sophia, which accumulates high levels of quercetin glucoside, and a diploid D. sophia, which accumulates only trace amounts. Multiomics analyses and uridine diphosphate glucosyltransferase (UGT) enzyme assays revealed that the gene duplication and functional evolution of Dscd6AG01520, an UGT gene, led to high quercetin-3-O-β-d-glucoside and quercetin-3,7-O-β-d-diglucoside accumulation in tetraploid D. sophia seeds. Further UGT enzyme assays with the point mutations of Dscd6AG01520 showed that S213 was a critical amino acid for the enzymatic activity of Dscd6AG01520. In addition, we found that diploid D. sophia evolved from an ancestral crucifer karyotype through chromosome fusion and rearrangement. Collectively, our findings illuminate the mechanism of high quercetin glucoside accumulation in tetraploid D. sophia, clarify the origin of the diploid D. sophia genome, and provide valuable genomic resources for comparative genomics and research into polyploid evolution.

## Linked entities

- **Proteins:** SLC35A2 (solute carrier family 35 member A2)
- **Chemicals:** quercetin-3-O-β-d-glucoside (PubChem CID 5280804)
- **Species:** Descurainia sophia (taxon 89411)

## Full-text entities

- **Genes:** KRP6 (KIP-related protein 6) [NCBI Gene 821449] {aka ACK1, ARABIDOPSIS CDK INHIBITOR 1, AtKRP6, ICK4, KIP-RELATED PROTEIN 6, KIP-related protein 6}, cytochrome P450 [NCBI Gene 107819388], AT2G32710 (Cyclin-dependent kinase inhibitor family protein) [NCBI Gene 817831] {aka ACK2, CYCLIN-DEPENDENT KINASE INHIBITOR 2, F24L7.15, F24L7_15, ICK7, INTERACTORS OF CDC2 KINASE 7}, UGT1A (UDP glucuronosyltransferase family 1 member A complex locus) [NCBI Gene 7361] {aka GNT1, UGT, UGT1, UGT1A@}, GH9A1 (glycosyl hydrolase 9A1) [NCBI Gene 835035] {aka ATGH9A1, CELLULASE, DEC, DEFECTIVE CYTOKINESIS, IRREGULAR XYLEM 2, IRX2}
- **Diseases:** TE (MESH:C565217), fevers (MESH:D005334), edema (MESH:D004487), asthma (MESH:D001249), cancer (MESH:D009369), diabetes (MESH:D003920), inflammatory (MESH:D007249), allergic reactions (MESH:D004342), osteoporosis (MESH:D010024), infertility (MESH:D007246), bronchitis (MESH:D001991), cardiovascular disease (MESH:D002318), dysentery (MESH:D004403)
- **Chemicals:** UDP-arabinose (MESH:C040817), quercimeritrin (MESH:C089996), acetic acid (MESH:D019342), alkaloids (MESH:D000470), DTT (MESH:D004229), carbenicillin (MESH:D002228), UDP-xylose (MESH:D014540), QS-21 (MESH:C078785), apigenin (MESH:D047310), TRIzol (MESH:C411644), nucleotides (MESH:D009711), phenolic acids (MESH:C017616), terpenoid indole alkaloids (MESH:D046948), kaempferol (MESH:C006552), dihydromyricetin (MESH:C472036), water (MESH:D014867), tanshinones (MESH:C021751), UDP-galactose (MESH:D014531), Quercetin (MESH:D011794), vincristine (MESH:D014750), baccatin III (MESH:C073950), formic acid (MESH:C030544), Sugar (MESH:D000073893), NaCl (MESH:D012965), methanol (MESH:D000432), amylose (MESH:D000688), UDP-glucuronic acid (MESH:D014535), Flavonoids (MESH:D005419), 4',6-diamidino-2-phenylindole (MESH:C007293), chlorogenic acid (MESH:D002726), anthocyanins (MESH:D000872), camptothecin (MESH:D002166), coumarins (MESH:D003374), naringenin (MESH:C005273), ADP-glucose (MESH:D000245), nitrous oxide (MESH:D009609), lipids (MESH:D008055), leonurine (MESH:C013587), Q (MESH:D005973), UDP-rhamnose (MESH:C022183), DHM (MESH:D004090), amino acids (MESH:D000596), UDP-glucose (MESH:D014532), Quercetin glucosides (MESH:C016527), proanthocyanidins (MESH:D044945), lignans (MESH:D017705), HEPES (MESH:D006531), quercetin-3-O-glucuronide (MESH:C443401), CD37001 (-), wogonin (MESH:C085514)
- **Species:** Isatis tinctoria (woad, species) [taxon 161756], Carica papaya (mamon, species) [taxon 3649], Raphanus sativus (radish, species) [taxon 3726], Nicotiana tabacum (American tobacco, species) [taxon 4097], Paraleonurus japonicus (Chinese motherwort, species) [taxon 4138], Capsella rubella (species) [taxon 81985], Descurainia sophia (species) [taxon 89411], Brassica oleracea (wild cabbage, species) [taxon 3712], Aethionema arabicum (species) [taxon 228871], Arabis alpina (alpine rockcress, species) [taxon 50452], Eutrema halophilum (species) [taxon 98038], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Scutellaria baicalensis (Baikal skullcap, species) [taxon 65409], Agaricomycetes sp. CK (species) [taxon 1134416], Schrenkiella parvula (species) [taxon 98039], Catharanthus roseus (chatas, species) [taxon 4058], Sisymbrium irio (species) [taxon 3730], Descurainia bourgeauana (species) [taxon 358665], Leavenworthia alabamica (species) [taxon 310722], Cichorium intybus (chicory, species) [taxon 13427], Anereuthina renosa (species) [taxon 2752825], Homo sapiens (human, species) [taxon 9606], Brassica rapa (field mustard, species) [taxon 3711], Camptotheca acuminata (species) [taxon 16922], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Paraleonurus sibiricus (Siberian motherwort, species) [taxon 405945]
- **Mutations:** F213S, S213F, V31A, A64P, A31V, A64, S213, P64, A31, F213, 7G, 7G in D, V31, T2T, P64A, Q7G
- **Cell lines:** IR1-002 — Homo sapiens (Human), Transformed cell line (CVCL_ZD54), CD37001 — Mus musculus (Mouse), Spontaneously immortalized cell line (CVCL_5731)

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12966014/full.md

---
Source: https://tomesphere.com/paper/PMC12966014