Identification and Analysis of DUF506 Gene Family in Peanut (Arachis hypogaea)
Qing Song, Gideon Asare Aboagye, Ming Liu, Ying Lan, Minghong Hu, Yanbin Hong, Renfeng Wang, Miao Chen

TL;DR
This study identifies and analyzes the DUF506 gene family in peanuts, revealing their potential roles in plant development and stress responses.
Contribution
The study is the first to identify and characterize the DUF506 gene family in cultivated peanuts.
Findings
23 AhDUF506 genes were identified and grouped into 4 subfamilies across 15 peanut chromosomes.
Seven AhDUF506 genes showed significantly higher expression in flowers compared to other tissues.
All AhDUF506 genes were up-regulated in roots under abiotic stress conditions.
Abstract
The Domain of Unknown Function 506 (DUF506) family, part of the PD-(D/E)XK nuclease superfamily, has been shown to play a vital role in plant development and responses to abiotic stresses. However, the function of the DUF506 family in cultivated peanuts remains unknown. This study identified 23 AhDUF506 genes using bioinformatics approaches; these genes are spread across 15 chromosomes and grouped into 4 subfamilies. Additionally, by analyzing gene structure, upstream cis-acting elements, and transcriptional expression changes of AhDUF506 genes in different tissues and under various stress conditions, their expression levels and response mechanisms to abiotic stresses were examined. In mature tissues, the expression levels of seven AhDUF506 genes in flowers were significantly higher than those in other tissues. Under abiotic stress, their expression levels were all up-regulated in the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 7- —Natural Science Foundation of Guangdong Province
- —Scientific Research Start-up Fund of Guangdong Ocean University
- —Foundation of Department of Education of Guangdong Province
- —Modern Seed Industry Project
- —Science and Technology Project of Zhanjiang City
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant-Microbe Interactions and Immunity · Peanut Plant Research Studies · Insect-Plant Interactions and Control
1. Introduction
The “Domain of Unknown Function” (DUFs) constitutes a broad category encompassing numerous uncharacterized domains characterized by two distinct features: a relatively conserved amino acid sequence and an as-yet uncharacterized biochemical role [1,2,3]. Each DUF is temporarily designated with the prefix “DUF” followed by a unique identifier (for instance, DUF1, DUF2). Once experimental data reveal a domain’s function, it is either renamed to reflect its biochemical activity or integrated into an existing domain family [4]. Research indicates that DUFs are integral to various physiological processes, including plant growth, development, and adaptive responses to biotic and abiotic stresses [5]. Recent bioinformatic analyses utilizing an enhanced transfer Meta-BASIC search against updated PFAM and PDB databases have identified five novel PD-(D/E) XK nuclease families, one of which corresponds to the DUF506 family. These findings reveal that the DUF506 protein exhibits a predicted conserved core secondary structure pattern of αααβαβα [6,7], although its specific function remains undefined.
Recent studies indicate that members of the DUF506 family are associated with plant stress resistance. In Arabidopsis thaliana, At1g12030 responds clearly to iron deficiency, while At2g39650 contributes to stress resistance by interacting with salt stress transcription factors [8]. In crops, OsDUF50602 in rice actively responds to drought, and TaDUF1.3-D is triggered by salinity [9,10]. Similarly, the cabbage homolog Bra017099 shows increased expression under heat and drought stress [11]. Notably, in Malus domestica, MdDUF506 boosts aluminum stress tolerance by enhancing reactive oxygen species scavenging and regulating aluminum-responsive genes. Its interacting partner, MdCNR8, works with MdDUF506 to further improve aluminum tolerance [12]. Studies on their biological roles suggest that DUF506 proteins may differ across species. The low sequence similarity among DUF506 proteins reveals highly divergent homologs within this family [13]. This diversity makes it challenging to study the specific functions of the DUF506 family.
Peanut (Arachis hypogaea) is a globally significant crop used for the production of essential oils, food, and feed [14]. Its genome has been fully sequenced and annotated, facilitating advanced molecular research [15]. Although several gene families in peanut have been characterized, such as WRKY, PP2C, and SUAR [16,17,18], the domain of unknown function 506 (DUF506) family has received little attention.
This study aimed to identify all DUF506 members in peanut using bioinformatics techniques, predict their subcellular localization, analyze their physicochemical properties, and evaluate their homology with DUF506 members from other species. This approach enables a comprehensive understanding of the characteristics of AhDUF506 members. Additionally, in combination with transcriptomic datasets, quantitative PCR (qPCR) was performed on seven AhDUF506 genes to examine their transcriptional responses to abiotic stress, providing valuable insights for future gene-function studies.
2. Materials and Methods
2.1. Identification of the DUF506 Gene Family in Peanuts and Construction of a Phylogenetic Tree
The genome assembly, annotated gene models, and protein sequences of Arachis hypogaea are from the peanut genome database (http://peanutgr.fafu.edu.cn/Download.php, accessed on 20 March 2025) [19]. Whole-genome datasets for Arabidopsis thaliana, Oryza sativa, Glycine max, and Medicago truncatula were obtained from phytozome v13 (https://phytozome-next.jgi.doe.gov, accessed on 20 March 2025) [20]. 13 AtDUF506 protain sequences served as queries in BLASTP searches against the A. hypogaea proteome (score > 50, E value < 0.01). The Hidden Markov Model (HMM) file for the DUF506 conserved domain (accession number: PF04720) was downloaded from the Pfam database. Genome-wide identification in Arachis hypogaea was conducted using the Simple HMM Search module within TBtools II [21,22]. The structure of the AhDUF506 protein was analyzed by setting the expected value to 0.010000 and the maximum number of hits to 500 through the CDD search function on NCBI (https://www.ncbi.nlm.nih.gov/Structure/cdd, accessed 20 March 2025) [23,24]. The amino acid sequences of 23 identified AhDUF506 proteins, 13 Arabidopsis thaliana DUF506 proteins, 10 Oryza sativa DUF506 proteins, 17 Medicago truncatula DUF506 proteins, and 24 Glycine max DUF506 proteins were aligned using the MUSCLE method in MEGA11 software(MEGA11.0.13). A Maximum Likelihood (ML) phylogenetic tree was constructed with bootstrap set to 1000 replicates and using the JTT model (Jones-Taylor-Thornton model) [25,26]. The phylogenetic tree was visualized and annotated using iTOL v6 software [27].
2.2. Chromosome Localization of AhDUF506 Family Members
The most recent peanut (Arachis hypogaea) genome annotation (GFF3 format) was obtained from the Peanut Genome Resource database. The chromosomal localization of DUF506 family members was mapped utilizing the “Gene Location Visualization” module in TBtools II software [28]. Based on the physical coordinates of previously characterized AhDUF506 genes, each DUF506 locus was renamed to accurately reflect its corresponding chromosome and genomic order, thereby ensuring consistency in gene nomenclature. The amino acid sequences of all identified AhDUF506 proteins were imported into the “Protein Parameter Calculator” module of TBtools II to analyze their physicochemical properties.
2.3. Physical and Chemical Properties and Subcellular Localization of AhDUF506 Protein Analysis
The Protein Parameter Calculator module of TBtools II software analyzed the physicochemical properties of AhDUF506 protein. The primary amino acid sequence was provided in FASTA format to compute the total residue count, theoretical isoelectric point (pI), hydropathicity, and instability index. The probable subcellular compartment of AhDUF506 was predicted using WoLF PSORT (https://wolfpsort.hgc.jp/, accessed on 20 March 2025) and was used to determine the subcellular localization of the AhDUF506 protein [29].
2.4. AhDUF506 Conserved Motif, Conserved Domain, and Gene Structure Analysis
Full-length amino acid sequences of AhDUF506 were obtained from the cultivated peanut (Arachis hypogaea) genome database. Conserved motifs within the AhDUF506 protein family were identified using the MEME website (https://meme-suite.org/meme/tools/meme, accessed on 20 March 2025) [30]. The conserved domain architecture of AhDUF506 proteins was predicted via NCBI’s Conserved Domain Search tool (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, accessed on 20 March 2025). Gene structures (exon–intron organization), conserved motifs, and conserved domains of the AhDUF506 gene family were visualized concurrently using the gene structure visualization module of TBtools II software.
2.5. Analysis of AhDUF506 Gene Promoter
The 2000 bp nucleotide sequences upstream of 23 AhDUF506 genes were submitted to the PlantCare website (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/, accessed on 20 March 2025) for the purpose of predicting stress-related promoter elements and visualizing them using TBtools II.
2.6. Synteny and Ka and Ks Analysis of AhDUF506 Homologous Pair
Genomic assemblies and their associated annotation files for Arabidopsis thaliana, Oryza sativa, Glycine max, and Medicago truncatula were sourced from the Ensembl Plants database. Protein sequences of Arachis hypogaea (AhDUF506) were extracted to facilitate cross-species homology analysis. The collinearity between AhDUF506 genes and their orthologous regions in other species was assessed using MCScanX [31], and syntenic relationships were subsequently visualized using the advanced circular-plot module of TBtools II. To further evaluate the duplication events of the AhDUF506 gene, synonymous (Ks) and non-synonymous (Ka) substitution rates were calculated for each homologous gene pair utilizing the Ka/Ks calculation function within TBtools II.
2.7. Expression Analysis of the AhDUF506 Gene
An overview of the tissue-specific expression data for AhDUF506 was obtained from the Peanut Genome Resources Public Database (http://peanutgr.fafu.edu.cn/Transcriptome.php, accessed on 20 March 2025). The expression profiles of all AhDUF506 genes were quantified as transcripts per kilobase million (TPM) using the exon model, and fold changes were calculated by dividing the TPM values of the treatment group by those of the control group. A heatmap was generated with TBtools II to show the converted logarithms.
2.8. Peanut Plant Treatment and RT-qPCR Analysis
Peanut (Arachis hypogaea) seeds were immersed in 5% hypochlorous acid (HClO) for 2 min [32], then rinsed three times with double-distilled water (ddH_2_O). Sterilized seeds were placed on a moist cloth and incubated in a dark, well-ventilated environment to promote germination [33]. Once the primary roots reached 2 cm in length, seedlings were transferred to hydroponic boxes containing Hoagland nutrient solution. The seedlings were grown in a controlled-environment incubator under a 16 h photoperiod (27 °C, 60% relative humidity), followed by an 8 h dark period (25 °C, 60% relative humidity). Three-week-old seedlings, exhibiting uniform and vigorous growth, were subjected to the subsequent treatments: (i) osmotic stress: nutrient solution supplemented with 150 mM NaCl, (ii) dehydration: seedlings placed on Whatman 3 MM filter paper and dried at 25–27 °C, 60% humidity, (iii) cold stress: transferred to a growth chamber maintained at 4 °C, (iv) ABA treatment: a solution containing 100 µM ABA (KeMing Technology Co., Ltd., Zhanjiang, China) was applied to the seedlings, and (v) phosphorus deficiency stress: plants were supplemented with Hoagland nutrient solution lacking phosphorus. Harvested shoot and root tissues at 0, 3, 6, 12, and 24 h under osmotic, drought, cold stress, and ABA treatment, as well as days 0 and 8 under phosphorus deficiency stress, were immediately frozen in liquid nitrogen and stored at -80 °C for subsequent analysis. For tissue-specific expression analysis, leaves, stems, roots, pods, and dry seeds were collected from mature plants cultivated in soil.
Gene-specific primers were designed using Primer3Plus (https://www.primer3plus.com/, accessed on 20 March 2025) and validated for specificity using NCBI’s Primer-BLAST (https://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome, accessed on 20 March 2025) (Table S1). Total RNA was extracted from the peanut tissue using the Magen Polysaccharide Polyphenol Plant RNA Extraction Kit. RNA was reverse-transcribed into cDNA with the Takara Reverse Transcription Kit (HiSrip III First-Strand cDNA Synthesis Kit, Shiga, Japan). Quantitative Real-Time Polymerase Chain Reaction (QRT-PCR) was performed using Yi Sheng-related products/platforms. The expression level of the peanut ACT11 gene is used as an internal reference. Each biological sample has three biological replicates, and the relative expression of each gene was calculated using the 2^−∆∆CT^ method.
3. Results
3.1. Identification of AhDUF506 Members and Chromosome Localization of AhDUF506 Gene
Homologous sequence alignment was performed using BLAST, as previously described, which identified 26 potential members of the AhDUF506 gene family. Subsequently, 23 candidate AhDUF506 members were determined using the HMM Search module of TBtools II. Verification via CDD search identified 23 members of the AhDUF506 gene family. In total, these 23 AhDUF506 genes are located on 15 chromosomes (Figure 1). Chromosomal mapping indicated that these genes were spread across 14 chromosomes. Most DUF506 genes in peanut were situated near the edges of chromosomes, with chromosome 14 containing the highest number, a total of four genes.
3.2. Physical and Chemical Properties Analysis and Subcellular Localization Prediction of AhDUF506 Protein
The physicochemical properties of 23 AhDUF506 protein sequences were analyzed using the Protein Parameter Calc tool of TBtools II. Sequence lengths ranged from 262 to 401 amino acids. Predicted molecular weights ranged from 29,729.05 to 43,831.97, and theoretical isoelectric points (pI) varied between 5.29 and 9.06. Instability indices ranged from 42.03 to 62.45, indicating that all proteins are predicted to be unstable. The aliphatic index ranged from 63.38 to 80.05, suggesting moderate thermostability. Hydrophilicity indices (grand average of hydropathicity) ranged from −0.607 to −0.388, consistent with an overall hydrophilic character (Table S2).
3.3. Phylogenetic Analysis of DUF506 Gene in Peanuts and Four Other Plants
A total of 13, 10, 24, and 18 DUF506 genes were identified in Arabidopsis thaliana, Oryza sativa, Glycine max, and Medicago truncatula. To explain their evolutionary relationships, a Maximum Likelihood (ML) phylogenetic tree was constructed using these sequences, together with 23 Arachis hypogaea DUF506 proteins, yielding a total of 88 DUF506 members (Figure 2) [34]. The resulting tree resolved the family into 4 well-supported subfamilies (I–IV) (Table S3), in agreement with previous classifications. Notably, subfamily IIIa was the smallest, comprising only a few sequences, which suggests that this subgroup has undergone lineage-specific retention and potential functional specialization.
3.4. Analysis of Protein Structure and Gene Structure of AhDUF506 Family Members
Members of a protein group are expected to share conserved motifs [35]. To characterize the motif and domain architecture in 23 AhDUF506 proteins, their sequences were submitted to the MEME suite and NCBI’s Conserved Domain Search (Figure 3). This analysis identified 4 conserved motifs and 2 structural domains, the PDDEXK6 superfamily and A_thal-3542. Mapping the positions of these features along each sequence revealed that the A_thal-3542 domain is the most consistently conserved across all members of AhDUF506.
3.5. Analysis of AhDUF506 Cis-Acting Components
The study of promoters, primarily responsible for regulating gene expression at the transcriptional level, is vital for enhancing our fundamental comprehension of gene regulation (Figure 4) [36]. A total of 24 abiotic stress-responsive cis-acting elements were identified and categorized into 5 functional groups: (i) common elements, (ii) plant hormone-responsive elements, (iii) environmental-responsive elements, (iv) transcription factor (TF)-related elements, and (v) tissue-specific elements (Table S4).
3.6. Synteny Analysis of AhDUF506 Gene
Performing a collinearity analysis of AhDUF506 can reveal evolutionary relationships and genome rearrangement events among different DUF506 species. Therefore, using the MCScanX program to study the genes involved in peanut replication events. A total of 18 pairs of segmental repetitive genes were identified, and no tandem gene pairs were found (Figure 5A) (Table S5). Assessing the proportions of tandem and segmental genome duplications within gene families is essential for functional analysis [37]. The Ka/Ks of the AhDUF506 segmental repeat gene pairs were below 1 (ranging from 0.15 to 0.5), indicating that the gene functions of the peanut gene family are conserved during evolution. This discovery demonstrates that AhDUF506 family genes undergo purifying selection after gene duplication. Notably, 10 pairs of AhDUF506 proteins share high sequence similarity, which likely reflects the tetraploid genome of Arachis hypogaea and the concurrent duplication of these genes during chromosomal doubling events (Table S6).
To evaluate the genetic homology and evolutionary relationships among species, a collinearity analysis was performed between AhDUF506 genes and the DUF506 family genes from Oryza sativa, Arabidopsis thaliana, Glycine max and Medicago truncatula, which identified 7, 8, 52 and 32 pairs of homologous DUF506 gene pairs, respectively (Figure 5B) (Table S7). As expected, peanut and soybean share the most significant number of DUF506 orthologous pairs due to their closest genetic relationship. The Ka/Ks analysis of orthologous DUF506 gene pairs between peanut and the other four species revealed that the synonymous substitution rate in Arabidopsis thaliana was only slightly lower than that in Glycine max and Medicago truncatula.
3.7. Transcriptome Analysis of AhDUF506 Gene Tissue
Analysis of publicly available transcriptomic databases revealed distinct tissue-specific expression profiles among AhDUF506 family members (Figure 6) (Table S8). AhDUF506-2, AhDUF506-13 and AhDUF506-11 showed higher transcript levels in multiple tissues. In contrast, the expression of AhDUF506-22 was only relatively high in Testa, and the expression level of AhDUF506-23 was also significantly higher in root tips than in other tissues. Notably, members of subfamily I demonstrated minimal or undetectable expression in all sampled tissues, whereas genes in subfamilies II and IIIa were expressed at relatively elevated levels.
3.8. Quantitative RT-PCR (RT-qPCR) Analysis
Based on the tissue-specific transcriptome data of AhDUF506 and literature references, we finally selected 7 genes from the 23 AhDUF506 genes (including AhDUF506-15 and AhDUF506-5 from subfamily I, AhDUF506-23 from subfamily II, AhDUF506-2 and AhDUF506-13 from subfamily IIIa, as well as AhDUF506-10 and AhDUF506-22 from subfamily IIIb). Their expression profiles across peanut organs and developmental stages, and in response to diverse abiotic stresses, were quantified by qPCR [38].
Tissues from different parts of peanuts at the mature stage were collected, and the expression levels of 7 AhDUF506 genes in these tissues were analyzed (Figure 7). It was found that the expression levels of 6 of these genes in flowers were significantly higher than in other tissues, among which AhDUF506-2, AhDUF506-5, and AhDUF506-22 showed the highest expression levels. This indicates that the AhDUF506 gene family plays a vital role in the biological processes of flowers. In addition, the transcription levels of 6 AhDUF506 genes were the lowest in seeds, except AhDUF506-22.
Transcriptional expression analysis of 7 AhDUF506 genes in different tissues. The transcriptional level of each AhDUF506 gene was normalized against the root as the control. L, R, and St represent leaves, roots, and stems at the mature stage; F represents flowers; P represents pods; Se represents seeds. The data represent the mean values of three biological replicates ± SD. Statistical significance of differences was tested by one-way ANOVA analysis (p < 0.05) and is indicated by lower case letters.
3.9. Expression Patterns of AhDUF506 Genes Under Abiotic Stress
Under 150 mM NaCl exposure, transcript levels of all 7 AhDUF506 genes in peanut roots were significantly upregulated, reaching their highest expression between 12 and 24 h of treatment (Figure 8). In contrast, the expression of five AhDUF506 genes showed no significant changes in leaf tissues, while that of AhDUF506-13 and AhDUF506-23 was significantly downregulated. During drought stress, the 7 AhDUF506 genes exhibited different expression patterns in leaves, while in roots, the expression trends were similar to those under osmotic stress. Interestingly, AhDUF506-13 had a distinct expression pattern from the other six AhDUF506 genes, being downregulated in both leaves and roots, while the other genes were upregulated in roots (Figure 9). Exposure to 4 °C elicited a uniform upregulation of all 7 AhDUF506 genes in both roots and leaves (Figure 10). This coordinated transcriptional response indicates that members of the AhDUF506 family actively respond when peanuts are exposed to cold stress.
Under treatment with 100 μM abscisic acid (ABA), the expression pattern of AhDUF506 exhibited a completely distinct profile compared with that under the three aforementioned stress conditions: the expression of seven AhDUF506 genes in leaves all showed a pattern of initial upregulation followed by a decline, while the corresponding transcripts in roots peaked at the early stage of treatment and then decreased subsequently (Figure 11). In the phosphorus-deficiency test, most AhDUF506 genes were upregulated in both organs (Figure 12). Exceptions included AhDUF506-2, which was downregulated in roots and leaves, and AhDUF506-13 and AhDUF506-22, which were specifically downregulated in leaves.
4. Discussion
In the protein family database (Pfam database) [39], hundreds of protein families remain uncharacterized and poorly annotated, collectively known as DUF [40]. In the updated Pfam database, 5 new PD-(D/E)XK nuclease families have been identified; 4 of these belong to DUF families (DUF506, DUF524, DUF1626, and DUF1703) and are involved in numerous nucleic acid cleavage events that are crucial for various cellular processes [6]. Previous studies have shown that DUF506 is associated with stress resistance across different plants [8,9,10,11,12]. However, a comprehensive analysis of the DUF506 gene family in peanuts remains lacking. Consequently, this study identified 23 AhDUF506 genes, comparable to the 24 reported in Glycine max. Homology alignment analysis further revealed that Arachis hypogaea shares higher homology with soybean than with other species, confirming their evolutionary relationship [41]. Additionally, we found that the 23 AhDUF506 genes in peanut include 10 pairs of highly homologous genes with sequence similarities exceeding 96%. Arachis hypogaea is a tetraploid plant with an AABB genome [42]. These 10 gene pairs display a symmetrical distribution on the chromosomes, supporting Gregory’s view that Arachis duranensis and Arachis cardenasii may be ancestors of cultivated peanuts and aligning with Smartt’s hypothesis that cultivated peanuts originated from the hybridization of two diploid wild species with different chromosome sets, followed by a single polyploidization event [43,44]. The high similarity among these gene pairs may result from gene redundancy caused by tetraploidization. Since peanut is a tetraploid plant, it is hypothesized that this phenomenon results from chromosome doubling events [45,46].
The correlation between the content of protein domains and their functions holds significant importance for the study and comprehension of how domain combinations encode complex functionalities [47,48]. Genetic structure analysis has demonstrated that the sequence and number of conserved motifs in members of the AhDUF506 family differ from the two conserved motifs and one conserved domain identified in Arabidopsis thaliana [8,49]. The family members encompass not only the PDDEXK_6 superfamily domain but also A_thal_3542, which has an unidentified function. Gene duplication constitutes a critical mechanism for the acquisition of new genes and the generation of genetic novelty in plants [50,51]. Through collinearity analysis, we have identified 18 pairs of duplicated genes. Examination of the Ka and Ks ratio indicates significant purifying selection within the DUF506 gene family (Ka/Ks < 1; Table S4), implying a central role for this gene family in the adaptive evolution of Arachis hypogaea. This may suggest the presence of functional redundancy [52].
Most studies indicate that the DUF506 family is associated with plant stress resistance; however, few have examined its expression levels across different tissues [8,9,10,11,12]. To investigate the expression of peanut DUF506 in various tissues and under stress conditions, we selected 7 members based on their transcriptional data and subfamily classification for further analysis. In peanut tissues, the expression level in flowers was observed to be significantly higher than that in other tissues, a pattern distinct from that in Arabidopsis thaliana [8]. This suggests that AhDUF506 may be involved in floral metabolism. Under abiotic stress conditions, transcriptional alterations among the selected AhDUF506 genes are more pronounced in roots than in leaves. The tandemly duplicated gene pair, AhDUF506-2 and AhDUF506-13, is induced by dehydration or drought stress (Figure 7 and Figure 8). Nonetheless, their expression patterns differ, likely due to variations in their cis-regulatory elements (CREs). Under phosphorus deficiency, AhDUF506-22 is strongly induced in roots, consistent with the response observed in its homolog, OsDUF50605.
5. Conclusions
In this study, a comprehensive genome-wide identification and analysis were performed, resulting in the identification of 23 high-confidence AhDUF506 genes in peanut (Arachis hypogaea). These 23 AhDUF506 genes were mapped to 15 chromosomes and categorized into 4 subfamilies based on previous studies. The AhDUF506 family is relatively conserved, with all members containing 4 conserved motifs and 2 identical structural domains. Additionally, analyses of their physicochemical properties, cis-acting elements, and genomic collinearity were conducted to characterize their features. In addition, the transcriptional expression patterns of seven representative AhDUF506 genes were investigated across different tissues and under various environmental stress conditions, and an in-depth analysis of the expression pattern of AhDUF506-22 was performed. Overall, these findings improve our understanding of the role of the AhDUF506 gene family in plant adaptation to environmental stresses and identify potential target genes for breeding stress-tolerant peanut cultivars using genome editing technologies.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bateman A. Coggill P. Finn R.D. DU Fs: Families in search of function Struct. Biol. Cryst. Commun.2010661148115210.1107/S 1744309110001685 PMC 295419820944204 · doi ↗ · pubmed ↗
- 2Mistry J. Chuguransky S. Williams L. Qureshi M. Salazar G.A. Sonnhammer E.L. Tosatto S.C. Paladin L. Raj S. Richardson L.J. Pfam: The protein families database in 2021 Nucleic Acids Res.202149 D 412D 41910.1093/nar/gkaa 91333125078 PMC 7779014 · doi ↗ · pubmed ↗
- 3Jaroszewski L. Li Z. Krishna S.S. Bakolitsa C. Wooley J. Deacon A.M. Wilson I.A. Godzik A. Exploration of uncharted regions of the protein universe P Lo S Biol.20097 e 100020510.1371/journal.pbio.100020519787035 PMC 2744874 · doi ↗ · pubmed ↗
- 4Luo C. Akhtar M. Min W. Bai X. Ma T. Liu C. Domain of unknown function (DUF) proteins in plants: Function and perspective Protoplasma 202426139741010.1007/s 00709-023-01917-838158398 · doi ↗ · pubmed ↗
- 5Lv P. Wan J. Zhang C. Hina A. Al Amin G.M. Begum N. Zhao T. Unraveling the diverse roles of neglected genes containing domains of unknown function (DU Fs): Progress and perspective Int. J. Mol. Sci.202324418710.3390/ijms 2404418736835600 PMC 9966272 · doi ↗ · pubmed ↗
- 6Knizewski L. Kinch L.N. Grishin N.V. Rychlewski L. Ginalski K. Realm of PD-(D/E) XK nuclease superfamily revisited: Detection of novel families with modified transitive meta profile searches BMC Struct. Biol.200774010.1186/1472-6807-7-4017584917 PMC 1913061 · doi ↗ · pubmed ↗
- 7Schultz J. Milpetz F. Bork P. Ponting C.P. SMART, a simple modular architecture research tool: Identification of signaling domains Proc. Natl. Acad. Sci. USA 1998955857586410.1073/pnas.95.11.58579600884 PMC 34487 · doi ↗ · pubmed ↗
- 8Ying S. Genome-wide identification and transcriptional analysis of Arabidopsis DUF 506 gene family Int. J. Mol. Sci.2021221144210.3390/ijms 22211144234768874 PMC 8583954 · doi ↗ · pubmed ↗
