Genome-Wide Analysis Unveils the Evolutionary Impact of Allopolyploidization on the 14-3-3 Gene Family in Rapeseed (Brassica napus L.)
Shengxing Duan, Jing Wang

TL;DR
This study explores how the 14-3-3 gene family in rapeseed evolved after polyploidization, revealing changes in gene structure, regulation, and expression.
Contribution
The study provides a comprehensive analysis of the evolutionary impact of allopolyploidization on the 14-3-3 gene family in Brassica napus.
Findings
Allopolyploidization led to the loss of some collinear 14-3-3 genes in B. napus.
The 14-3-3 gene family in B. napus shows increased regulatory complexity due to additional cis-acting elements.
Gene expression patterns of GRF2 homologs changed after allopolyploidization.
Abstract
Background: Polyploidization drives the formation and evolution of angiosperms, profoundly reshaping genomic architecture and function. The 14-3-3 proteins (also known as G-box binding regulators, GRFs) are conserved signaling molecules involved in a range of physiological processes, including developmental signaling and stress responses. Elucidating the evolutionary trajectories of 14-3-3 genes in Brassica napus following allopolyploidization is critical for understanding polyploid crop evolution and developing molecular breeding strategies for improved stress resistance and yield. Results: In this study, forty-eight orthologous 14-3-3 genes were identified in the genome of B. napus, and twenty-two orthologous 14-3-3 genes were found in the genomes of both Brassica rapa and Brassica oleracea. Gene mapping analysis indicated that 14-3-3 genes were broadly distributed across all…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6- —National Key Research and Development Program of China
- —Key Research and Development Program of Hubei Province
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics14-3-3 protein interactions · Genomics and Phylogenetic Studies · Lipid metabolism and biosynthesis
1. Introduction
Polyploidization is a widespread evolutionary phenomenon in angiosperms [1,2,3,4] and plays a significant role in promoting speciation and evolutionary processes [1,2,3,4]. The formation of new polyploids is frequently accompanied by extensive genomic restructuring [5,6], which can lead to the subfunctionalization or neofunctionalization of homologous genes and concomitant alterations in their expression patterns [5,6]. Polyploidization substantially increases genomic complexity in plants and enhances trait diversity and environmental adaptability [7,8]. As a result, it is frequently utilized in crop domestication [8].
The 14-3-3 proteins were initially identified through separation from bovine brain proteins using column chromatography and electrophoretic mobility analysis [9]. As a component of nearly all eukaryotes, these proteins exhibit broad expression in diverse tissues [10]. 14-3-3 proteins were originally identified in the plant species Arabidopsis thaliana and became known as G-box binding regulators (GRFs) because their genes contain G-box elements [10]. Based on gene structure characteristics, 14-3-3 proteins can be classified into two distinct groups: epsilon and non-epsilon proteins [11]. These proteins play key regulatory roles as activators, repressors, or adaptors in numerous biological processes, such as hormone signaling [12,13,14,15,16], growth and development regulation [17,18,19,20], and stress responses [21,22,23,24,25,26,27]. Genome-wide identification of the 14-3-3 gene family has been conducted in various plant species, such as A. thaliana [10], rice [21], soybean [22], and alfalfa [28]. However, the evolutionary impact of allopolyploidization on this gene family remains poorly understood.
Globally, Brassica napus L. (AACC, 2n = 38) is an important oil crop in agriculture and economics [29]. This species originated approximately 7500 years ago from a natural hybridization event between the diploid progenitors Brassica rapa (AA, 2n = 20) and Brassica oleracea (CC, 2n = 18), which resulted in allotetraploidization [29,30]. This evolutionary event conferred considerable genetic diversity B. napus, inheriting stress resistance and high-yield traits from both parents, and established it as an ideal research model for evolutionary processes driven by allopolyploidization [30]. Recent advances in genome sequencing and high-quality assembly for B. rapa [31], B. oleracea [32], and B. napus [29] have provided deeper insights into its genetic architecture and breeding potential. In this study, we identified and compared 14-3-3 gene families across these three species. Comprehensive analyses were performed on their chromosomal locations, phylogenetic relationships, synteny, gene structures, selection pressures, cis-regulatory elements, and expression profiles to elucidate their evolutionary trajectories following allopolyploidization. Our findings provide important insights into the molecular evolution of 14-3-3 proteins and the identification of target genes to enhance stress resistance and yield in crops. Moreover, this study fills a critical gap in understanding how allopolyploidization shapes functional gene family evolution in Brassica crops and provides candidate genes for molecular breeding.
2. Materials and Methods
2.1. Identification of 14-3-3 Members
To identify putative 14-3-3 gene family members, the 13 known 14-3-3 protein sequences in A. thaliana [10] were used as queries for a BLASTP search against protein datasets of B. napus (v5.0), B. rapa (v1.5), and B. oleracea (v1.1) (available at http://brassicadb.cn, accessed on 26 August 2025) [33] using TBtools-II [34] with an E-value < 1E-5. The conserved domains of all candidate proteins were further verified using CDD (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, accessed on 26 August 2025) [35], Pfam (http://pfam.xfam.org/, accessed on 26 August 2025) [36], and SMART (http://smart.embl.de/, accessed on 26 August 2025) [37]. The identified genes were subsequently renamed according to their homology with corresponding A. thaliana orthologs (Table S1).
2.2. Chromosomal Location, Gene Duplication, and Syntenic Analysis
The locations of the 14-3-3 genes were retrieved from the BRAD database. The chromosomal locations of the 14-3-3 genes were mapped using TBtools-II [34]. Gene pairs exhibiting >80% sequence length coverage and >80% identity according to BLASTN alignments were classified as duplicated genes [38]. The Ka/Ks ratios were calculated in TBtools-II via the Nei–Gojobori method, following codon-based MAFFT alignment. Sequences with gaps, stop codons, or Ks > 1 were excluded from the analysis. The time at which duplication events occurred was calculated as follows: T = Ks/2λ (λ = 1.5 × 10^−8^). Syntenic relationships were obtained from the BRAD database and visualized using Circos software (V0.69-9, available at http://circos.ca/) to illustrate the syntenic gene pairs [39].
2.3. Characterization of 14-3-3 Proteins
The ProtParam tool, available on the ExPASy server (https://www.expasy.org/, accessed on 26 August 2025) [40], was employed to analyze various physicochemical parameters of the 14-3-3 proteins, such as the molecular weight (MW), grand average of hydropathicity (GRAVY), instability index (II), and isoelectric point (pI).
2.4. Phylogenetic Tree Construction and Analysis
To reconstruct the evolutionary relationships among the 14-3-3 proteins, sequences from A. thaliana, B. napus, B. rapa, and B. oleracea were first aligned with ClustalW, implemented in MEGA7 [41]. From this alignment, a neighbor-joining phylogenetic tree was generated using the same software, with branch support assessed through 1000 bootstrap replications. Then, this tree was visualized and annotated with iTOL (https://ngphylogeny.fr/, accessed on 26 August 2025) [42].
2.5. Gene Structure and Motif Identification
Gene structures and conserved motifs were visualized using TBtools-II [34] and the MEME suite [43], respectively.
2.6. Prediction of Cis-Acting Elements
Cis-acting elements in the 14-3-3 genes’ 2000 bp upstream sequences at the transcription start site (TSS) were predicted using the PlantCARE database (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/, accessed on 26 August 2025) [44].
2.7. Analysis of Gene Expression
RNA-seq data were obtained from a previous study [45]. The raw reads, comprising three biological replicates for each of the four major tissues (flowers, stems, leaves, and siliques), are publicly available in the NCBI database under accession numbers SRR7816633 to SRR7816668. Clean sequence reads were aligned to the reference genome using HISAT2 (v2.1.0) [46] under default settings. Following this alignment, only uniquely mapped reads were considered for subsequent quantification. The gene expression was quantified with the software featureCounts (v1.6.1) [47] by counting reads mapped to the reference genomes. Subsequently, the read counts for each gene were normalized to TPM (transcripts per million). Visualization of the expression profiling data was performed using Microsoft Excel.
3. Results
3.1. Identification of 14-3-3 Genes
To identify members of the 14-3-3 gene family, the A. thaliana 14-3-3 protein sequences were used as queries to perform a BLASTP search against the genomes of B. napus, B. rapa, and B. oleracea, respectively. After screening for conserved domains, a total of 48 candidate proteins were identified in B. napus, and 22 each were identified in B. rapa and B. oleracea. All identified genes were systematically renamed according to their homology with A. thaliana genes (Table S1). Chromosome localization analysis indicated that the 14-3-3 genes were unevenly distributed across nearly all chromosomes in B. napus, B. rapa, and B. oleracea (Figure 1), suggesting an extended 14-3-3 gene family in these genomes compared to Arabidopsis.
3.2. Phylogenetic Analysis of 14-3-3 Gene Families
To elucidate the phylogenetic and evolutionary relationships between 14-3 and 3 genes, a phylogenetic tree was constructed using protein sequences from five plant species: A. thaliana, Oryza sativa, B. napus, B. rapa, and B. oleracea. A. thaliana and O. sativa were designated as the outgroup to ensure accurate phylogenetic inference. The resulting tree demonstrated that the 14-3-3 genes were divided into two groups: epsilon and non-epsilon groups (Figure 2). This classification aligns with previously established groupings in A. thaliana [10,11] and rice [21], supporting the high conservation of the 14-3-3 gene family classification across plant species.
3.3. Gene Structure and Motif Analysis
Phylogenetic trees of the 14-3-3 gene families were generated using the neighbor-joining method. The exon–intron structures and conserved motifs were subsequently analyzed separately for the non-epsilon group (Figure 3A) and epsilon group (Figure 3B) in B. napus, B. rapa, and B. oleracea, respectively. Most genes contained 3 or 4 exons in the non-epsilon group (Figure 3A) and 5-7 exons (Figure 3B) in the epsilon group. Genes clustered on the same phylogenetic branch exhibited similar exon–intron architectures, implying evolutionary conservation. The motif analysis revealed 25 motifs in the non-epsilon group (Figure 3A) and 12 motifs in the epsilon group (Figure 3B). Notably, genes on the same branch shared similar motif compositions, although the number and type of motifs differed considerably between the two groups.
3.4. Synteny and Duplicated Gene Analysis of 14-3-3 Genes
The synteny analysis of 14-3-3 genes was performed using genomic location information from B. napus (A_n_ and C_n_ subgenomes), B. rapa (A_r_), and B. oleracea (C_r_). As a result, 67 paralogous gene pairs (within A_n_ and C_n_) and 172 orthologous gene pairs (between A_n_ and A_r_ and C_n_ and C_r_) (Figure 4) were identified, suggesting the potential loss of some syntenic 14-3-3 genes during polyploidization. Additionally, 12, 10, and 39 duplicated gene pairs were detected in the genomes of B. rapa, B. oleracea, and B. napus, respectively (Table 1). These duplication events were estimated to have occurred 1.74–16.72 million years ago (MYA). The Ka/Ks analysis revealed that all 61 duplicated gene pairs exhibited Ka/Ks ratios less than 1, suggesting that they had undergone purifying selection. Furthermore, all duplicated gene pairs except BnAGRF13a-BnCGRF13b were under strong purifying selection (Ka/Ks < 0.5). These results imply that the duplicated 14-3-3 gene pairs in B. napus and its two diploid progenitors were subject to purifying selection pressure following duplication.
3.5. Prediction of Physicochemical Properties of 14-3-3 Proteins
The molecular weight (MW) of 14-3-3 proteins in B. napus and its two diploid progenitors varied from 22,690 Da (BoGRF7c, BnCGRF7e, BnCGRF7f) to 65,584 Da (BnCGRF7g) (Table S1). On average, the MW of B. napus 14-3-3 proteins (30,190 Da) was slightly lower than its diploid progenitors (30,635 Da), which was attributed to polyploidization. The isoelectric point (pI) of 14-3-3 proteins ranged from 4.61 to 9.66. Aside from BoGRF7b/c and BnCGRF7e/f/g, all 14-3-3 proteins were classified as acidic (with pI < 7). Furthermore, all 14-3-3 proteins except BrGRF12c, BrGRF13, BnCGRF10e, and BnAGRF13a exhibited considerable instability indices greater than 40. Conversely, the aliphatic index of all 14-3-3 proteins exceeded 70, suggesting higher thermal stability. Additionally, the grand average of hydropathicity (GRAVY) was negative for all 14-3-3 proteins, consistent with their hydrophilicity.
3.6. Analysis of Cis-Acting Elements in 14-3-3 Genes
To investigate whether polyploidization influences the potential regulatory functions of 14-3-3 genes, we identified cis-acting elements within the 2000 bp promoter regions upstream of their transcription start sites. Three functional categories of cis-elements were examined: plant development and growth, phytohormone response, and stress responses (Figure 5). On average, the promoter of B. napus 14-3-3 genes contained 26 cis-acting elements, 1 more than those in B. rapa and B. oleracea, which was attributed to polyploidization. Among the plant development and growth-related elements, the GCN4_motif (associated with endosperm expression) and CAT-box (involved in meristem activity) were detected. The 14-3-3 gene promoters were also enriched in phytohormone response-related elements; many genes contained over six ABREs (abscisic acid-responsive elements) and more than four EREs (ethylene-responsive elements), indicating a potential role in hormone regulation. Additionally, cis-elements with light responsiveness such as Box-4 and G-box were abundant in these gene promoters, suggesting their role in light-mediated regulation. Interestingly, multiple ARE elements linked to anaerobic stress induction were also identified. The prevalence of these elements suggests the involvement of 14-3-3 genes in stress response mechanisms.
3.7. Expression Patterns of 14-3-3 Genes in Different Tissues
To further investigate the expression and potential biological functions of all identified 14-3-3 genes, their expression profiles across four tissues (leaves, stems, flowers, and siliques) were analyzed using previously published RNA-seq data [45]. Aside from BoGRF11 in B. oleracea and BnAGRF13a in B. napus, which were not expressed in any of the four tissues, all other genes showed detectable expression. As illustrated in Figure 6, BrGRF6b, BoGRF6a, and BnAGRF6b showed notably high expression in flowers and siliques, implying functionally conserved roles. Overall, the average expression level of 14-3-3 genes was higher in flowers, siliques, and stems than in leaves in both B. rapa and B. oleracea. In contrast, B. napus showed higher expression in flowers and stems compared to siliques and leaves (Figure 6). In B. rapa and B. oleracea, the expression of several genes, including GRF2 homologs, was especially low in leaves but significantly higher (1.7- to 2.0-fold) in flowers, siliques, and stems. In B. napus, however, the expression of these homologs was lower in siliques than in other tissues, indicating that polyploidization may have altered the expression patterns.
4. Discussion
Polyploidization is a significant evolutionary driving force of angiosperm diversity, promoting genome reorganization, functional differentiation, and adaptive innovation [48]. In newly formed allotetraploid crops such as B. napus [29], it is important to understand how polyploidy reshapes gene families to decipher the molecular basis of its advantages and unlock genetic potential for crop improvement. This study focuses on the 14-3-3 gene family, a conserved signaling hub, to dissect the multifaceted effects of allopolyploidization. By integrating genomic, structural, regulatory, and expression data, we reveal that polyploidy drives gene family evolution through the coordinated interplay of gene copy number modulation, regulatory network integration, functional conservation, and adaptive divergence. These mechanisms are broadly applicable to fundamental gene families and thus provide potential targets for improving polyploid crops.
4.1. Allopolyploidization as a Driver of Gene Family Expansion
In this study, the increased 14-3-3 gene copy number in B. napus resulted from the combined effects of multiple duplication mechanisms. The genus Brassica shares a common ancestor with A. thaliana [29]. Following phylogenetic divergence, the Brassica lineage underwent a whole-genome triplication event and subsequent genomic rearrangements, ultimately leading to the differentiation of B. rapa and B. oleracea [31,32]. Subsequently, B. napus originated from interspecific hybridization between B. rapa and B. oleracea, followed by natural chromosome doubling [29]. Polyploidization, or whole-genome duplication, significantly drives gene family expansion, while tandem duplication and segmental duplication at the gene level are also common drivers of copy number evolution and family expansion [48]. Notably, all 14-3-3 homologous genes identified in this study were classified as products of segmental duplication, with no tandem duplication events detected, indicating that this is the dominant mechanism for the expansion of this gene family in B. napus. Furthermore, the Ka/Ks ratios of duplicated gene pairs were all less than 1, indicating strong purifying selection, which is attributed to the functional integrity of these genes across species while permitting subtle functional innovations [49,50].
In summary, allopolyploidization is the primary factor driving the numerical expansion of the 14-3-3 gene family in B. napus, further reinforced by frequent segmental duplications. Similar duplication patterns have also been observed in other multigene families in B. napus, such as the WOX [51] and EIL [52] families, reflecting a certain degree of commonality in its genomic evolutionary pathways.
4.2. Polyploidization Drives Regulatory Pattern Diversification
Polyploidization not only influences gene copy number but also profoundly reshapes gene regulatory networks (Figure 5). Promoter analysis revealed that B. napus possesses, on average, one more cis-acting element per 14-3-3 gene compared to its diploid progenitors (Figure 5). This increase suggests that polyploidization may enhance the regulatory complexity of these genes, potentially facilitating their involvement in diverse biological processes. Elements such as ABRE, ERE, G-box, and ARE were abundant, underscoring the roles of 14-3-3 genes in abiotic stress adaptation and developmental regulation [45,53]. This enrichment and diversification of regulatory elements likely stem from the integration and innovation of regulatory modules from two distinct ancestral genomes during polyploid formation. Although the coding sequences of the 14-3-3 genes themselves were conserved under purifying selection, this implies that their expression “switches” became more refined and diverse. Such diversification in regulatory patterns likely provides B. napus with broader flexibility in regulating expression, enabling it to more effectively connect endogenous hormonal signals to external environmental stresses, thereby enhancing the environmental adaptability and plasticity of polyploid plants. This represents an important advantage of polyploids at the regulatory level [54].
4.3. Polyploidization Drives Structural and Biochemical Adaptation of 14-3-3 Proteins
In addition to changes in regulation and expression, the structural and physicochemical properties of 14-3-3 proteins also exhibit notable evolutionary trends. The reduced average molecular weight of B. napus compared to its progenitors may reflect structural optimization, potentially influencing protein interactions or stability (Table S1). The prevalence of acidic isoelectric points and large instability indices (above 40) in most 14-3-3 proteins suggests that they are generally unstable but hydrophilic, which may facilitate flexible signaling mediation in dynamic cellular environments. The high aliphatic index indicates robust thermal stability, a feature likely conserved to ensure functionality under stress conditions [55]. These biochemical characteristics underline the adaptive evolution of 14-3-3 proteins in balancing functional versatility with structural resilience.
4.4. Polyploidization Drives Diversification of Expression Patterns
The ultimate function of genes is reflected in their expression patterns. Tissue-specific expression profiling revealed that allopolyploidization has modified the expression patterns of 14-3-3 genes in B. napus (Figure 6). While genes in diploid species showed high expression in flowers, siliques, and stems, their homologs in B. napus exhibited reduced expression in siliques, indicating that polyploidization may have rewired their transcriptional regulatory networks. Notably, certain genes, such as BrGRF6b, BoGRF6a, and BnAGRF6b, maintained high expression in reproductive tissues across species, suggesting functional conservation. This rewiring of expression patterns is likely closely associated with the changes in regulatory sequences. This diversification in expression patterns provides a foundation for functional innovation in polyploid plants. In B. napus, this may enable optimized growth and developmental programs and potential trait advantages absent in its progenitors. This is crucial for its successful adaptation and domestication [45,56].
4.5. Evolutionary Implications and Future Perspectives
This study elucidates how allopolyploidization has shaped the evolution of the 14-3-3 gene family in B. napus, primarily through driving gene family expansion and reorganization, enhancing regulatory complexity, and diversifying expression patterns. Together, these mechanisms have conferred novel flexibility to the regulation and expression of this otherwise functionally conserved gene family in the context of polyploids. Our findings further provide broader insights into the evolutionary strategies employed by polyploid plants. The coexistence of gene loss, purifying selection, regulatory diversification, and expression reprogramming illustrates the multifaceted impact of allopolyploidization on gene family evolution. In B. napus, the 14-3-3 genes have evolved through a balanced process of genetic conservation and regulatory innovation, enhancing adaptive capacity without compromising core functions. Future research on the functional validation of specific genes, such as those with altered expression or unique regulatory elements, will be crucial for elucidating the mechanistic links between genetic evolution and phenotypic adaptation in polyploid crops. Moreover, the candidate genes identified, particularly those associated with stress and hormone responses, hold the potential to improve crop resilience and productivity through molecular breeding.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Otto S.P. Whitton J. Polyploid incidence and evolution Annu. Rev. Genet.20003440143710.1146/annurev.genet.34.1.40111092833 · doi ↗ · pubmed ↗
- 2Mayrose I. Barker M.S. Otto S.P. Probabilistic models of chromosome number evolution and the inference of polyploidy Syst. Biol.20105913214410.1093/sysbio/syp 08320525626 · doi ↗ · pubmed ↗
- 3Van de Peer Y. Mizrachi E. Marchal K. The evolutionary significance of polyploidy Nat. Rev. Genet.20171841142410.1038/nrg.2017.2628502977 · doi ↗ · pubmed ↗
- 4Wu Y. Lin F. Zhou Y. Wang J. Sun S. Wang B. Zhang Z. Li G. Lin X. Wang X. Genomic mosaicism due to homoeologous exchange generates extensive phenotypic diversity in nascent allopolyploids Natl. Sci. Rev.20218 nwaa 27710.1093/nsr/nwaa 27734691642 PMC 8288387 · doi ↗ · pubmed ↗
- 5Soltis P.S. Marchant D.B. Van de Peer Y. Soltis D.E. Polyploidy and genome evolution in plants Curr. Opin. Genet. Dev.20153511912510.1016/j.gde.2015.11.00326656231 · doi ↗ · pubmed ↗
- 6Tayale A. Parisod C. Natural pathways to polyploidy in plants and consequences for genome reorganization Cytogenet. Genome Res.2013140799610.1159/00035131823751271 · doi ↗ · pubmed ↗
- 7Salman-Minkov A. Sabath N. Mayrose I. Whole-genome duplication as a key factor in crop domestication Nat. Plants 201621611510.1038/nplants.2016.11527479829 · doi ↗ · pubmed ↗
- 8Scarrow M. Wang Y. Sun G. Molecular regulatory mechanisms underlying the adaptability of polyploid plants Biol. Rev. Camb. Philos. Soc.20219639440710.1111/brv.1266133098261 · doi ↗ · pubmed ↗
