Transposon-Associated Small RNAs Involved in Plant Defense in Poplar
Cui Long, Yuxin Du, Yumeng Guan, Sijia Liu, Jianbo Xie

TL;DR
This study explores how small RNAs in poplar trees respond to fungal infection, revealing transposon-derived RNAs play a key role in plant defense.
Contribution
The study identifies transposon-derived small RNAs as significant contributors to plant defense mechanisms during fungal infection in poplar.
Findings
Transposon-derived siRNA clusters significantly increase during fungal infection stages in poplar.
Populus-specific miRNAs preferentially target NB-LRR genes, suggesting a unique defense strategy.
sRNAs regulate a large proportion of pathogen genes, especially effector genes during infection phases.
Abstract
Utilizing high-throughput Illumina sequencing, we examined how small RNA (sRNA) profiles vary in Chinese white poplar (Populus tomentosa) across two pivotal infection stages by the rust fungus Melampsora larici-populina: the biotrophic growth phase (T02; 48 h post infection) and the urediniospore development and dispersal phase (T03; 168 h), both essential for plant colonization and prolonged biotrophic engagement. Far exceeding random expectations, siRNA clusters predominantly arose from transposon regions, with pseudogenes also contributing significantly, and infection-stage-specific variations were notably tied to these transposon-derived siRNAs. As the infection advanced, clusters of 24 nt siRNAs in transposon and intergenic regions exhibited pronounced abundance shifts. An analysis of targets indicated that Populus sRNAs potentially regulate 95% of Melampsora larici-populina genes,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3- —Guangdong Provincial Key Laboratory of Silviculture, Protection, and Utilization
- —Beijing Forestry University “Outstanding Laboratory Technician” Training Special Fund
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant Molecular Biology Research · Plant-Microbe Interactions and Immunity · Chromosomal and Genetic Variations
1. Introduction
Regulating gene expression, maintaining genome stability and chromatin structure, and influencing plant development like seed germination, small RNAs (sRNAs) serve as critical mediators in diverse biological processes [1,2,3,4]. Spanning roughly 21 to 24 nucleotides (nt), these sRNAs typically inhibit gene function, fitting within the broader RNA-silencing mechanism [5].
Two major classes of endogenous sRNAs have been identified in plants, namely, microRNAs (miRNAs) and small interfering RNAs (siRNAs) [5,6]. Plant miRNAs are transcribed from imperfect hairpin-shaped precursors and are typically 21 nt long, while siRNAs are 21–24 nt in length and are generated from double-stranded RNA duplexes [7]. miRNAs reduce the protein abundances of their targets through the direct cleavage or posttranscriptional regulation of target mRNAs to reduce their translation [8], which has important functions in a wide range of biological processes [5]. In Arabidopsis, over 100 miRNAs have been shown to be important for plant development [9] and abiotic stress tolerance [10].
siRNAs have been sequenced in many plant species, such as Arabidopsis thaliana, rice (Oryza sativa), and wheat (Triticum aestivum) [11,12,13,14], and have the ability to silence their targets by directly complementing their transcripts to prevent their translation. Studies in maize (Zea mays) have revealed that sRNA populations vary between parents and their progeny, contributing to the dramatic vigor of maize hybrids [15].
Strengthening plant resistance against pathogenic challenges, small RNAs (sRNAs) hold a crucial position, as evidenced by growing studies [16,17,18]. For instance, during Arabidopsis infection by Pseudomonas syringae or following exposure to bacterial flagellin, shifts in levels of specific miRNAs like miR393, miR160, and miR773 have been observed [19,20,21]. Through this innate immunity, a long-established protective mechanism, plants counteract a wide array of pathogens [17,18].
Despite the sequencing of thousands of endogenous siRNAs [22], their defensive roles in plants remain largely unexplored, necessitating deeper investigation. A superfamily of miRNAs, including miR482 and miR2118, engages with plant NB-LRR (nucleotide-binding leucine-rich-repeat) defense genes, as revealed by evolutionary analyses [23]. In rice and other monocots, miR2118 initiates phased secondary siRNAs (phasiRNAs) from noncoding RNAs, bolstering both pathogen resistance and male fertility [24], while, in tomato (Solanum lycopersicum), miR482 variants target the conserved P-loop motif to downregulate NLR transcripts, a suppression lifted by viral infections like cucumber mosaic virus to trigger resistance genes [25]. Contributing to antibacterial defenses, sRNAs also operate within eukaryotic host cells [26]. To uncover the functions of these vital genomic components, association genetics—characterizing polymorphisms and linking them to phenotypes—offers a promising approach.
Notably, understanding sRNA-mediated defense is particularly relevant for woody plants, which face unique challenges due to their distinct biological traits. Woody plants have distinct features that distinguish them from most herbaceous plants, including their large size, long lifespans, and perennial growth [27]. Plant growth and wood formation are complex and dynamic processes requiring the coordinated regulation of diverse metabolic pathways and trade-offs with immune responses [28,29]. It is almost certain that a perennial plant will encounter a pathogen before it can reproduce, and the long generation intervals of trees makes it difficult for them to match the evolutionary rates of microbes [30].
Encompassing over 6000 species, rust fungi (Pucciniales, Basidiomycota) pose significant threats to plants, with Melampsora spp. notably hindering bioenergy initiatives involving domesticated poplars due to insufficient lasting host resistance [31]. Fully sequenced genomes of the Populus genus make it an ideal system for studying sRNA functions in pathogen interactions. This genus includes key woody species such as western balsam poplar (Populus trichocarpa) [30], desert poplar (Populus euphratica) [32], and Chinese white poplar (Populus tomentosa) [33]. Certain Populus species stand out as preferred experimental models, benefiting from rapid growth and compact genomes.
Plants have the capacity to recognize pathogens through strategies involving both conserved and variable defense regulators, while pathogens suppress plant defense responses by delivering virulence effectors [34]. To gain deeper insights into the origins, evolutionary trajectories, and defensive impacts of poplar sRNAs, we sequenced the sRNAome of Chinese white poplar during the two essential stages of rust fungus (Melampsora larici-populina) infection: plant colonization and biotrophic growth. These analyses reveal the evolutionary importance and functional novelty of the sRNAs.
2. Materials and Methods
2.1. Plant Material and Fungal Treatments
Four-month-old Chinese white poplar (Populus tomentosa) plants were cultivated in pots with a soilless approach, housed in a climate-controlled glasshouse at 25 ± 1 °C and 50 ± 1% humidity, receiving 12 h of controlled natural light at 1250 μmol m^−2^ s^−1^ PAR. Infection studies employed Melampsora larici-populina strain G5, initiating infections by administering 5 μL of a 10^6^ conidia mL^−1^ suspension to six spots per leaf across three replicates per condition. Fully expanded leaves were detached from several poplar plants and spray-inoculated on their abaxial surface with 10^6^ urediniospores mL^−1^, with three biological replicates per treatment. Incubated at 25 ± 1 °C and 95% humidity in an LT36VL incubator (Percival Scientific, Inc., Perry, IA, USA), these leaves were sampled at 48 hpi (T02) and 168 hpi (T03). For controls (T01), leaves treated with sterile water were harvested at 0 hpi, and three independent biological replicates were performed. Samples were immediately frozen in liquid nitrogen and stored at −80 °C awaiting RNA analysis.
2.2. Library Preparation and NGS Sequencing
Using an RNeasy Plant Mini kit (Qiagen, Hilden, Germany) as per its instructions, total RNA was extracted, with on-column DNase treatment performed via RNase-Free DNase (Qiagen, Hilden, Germany) to remove DNA during the process. A Small RNA Sample Prep Kit (Illumina, Inc., San Diego, CA, USA) facilitated the attachment of 3′ and 5′ adapters to the RNAs, enabling those bearing both adapters to act as templates for cDNA generation through RT-PCR. Following purification, the cDNA was quantified using a Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific, Waltham, MA, USA) on a Qubit 2.0 Fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) and an Agilent 2100 Bioanalyzer (Agilent Technologies, Inc., Santa Clara, CA, USA), then subjected to clustering and sequencing on an Illumina HiSeq 2500 (Illumina, Inc., San Diego, CA, USA) according to the cBot and HiSeq 2500 protocols (Illumina, Inc., San Diego, CA, USA).
2.3. sRNAome and mRNA Analysis
To evaluate mature sRNA and mRNA transcript levels in leaves across three infection conditions, sequences were processed as follows: each library generated over 16 million raw reads, with 33.5–44.0% aligning to the Populus trichocarpa genome (version 3.0) for sRNAs and TopHat2 (version 2.1.1) [35] with parameters --read-mismatches 2 -p -G (permitting two mismatches) for mRNAs. The process of mapping was used Bowtie2 (version 2.4.5) [36]. These aligned sRNA reads were annotated by BLAST (version 2.11.0) against GenBank and Rfam (version 13; http://rfam.xfam.org/) with a one-mismatch threshold, while non-sRNA sequences—including tRNAs, rRNAs, snRNAs, snoRNAs, and scRNAs—were excluded. Unannotated sRNAs were then matched to miRBase 22.0 (http://www.mirbase.org/) with a tolerance of two mismatches, and novel miRNAs were predicted and identified using miRDeep2 (version 2.0.1). For mRNA analysis, transcript abundance and differential expression were determined via CuffDiff (version 2.2.1) with options -p -b [37].
To identify the siRNA clusters, the strategy used by [38] was applied, with some modifications. The 21–24-nt sRNAs from all the samples that did not match the miRNAs, ribosomal rRNA, or tRNAs and that could map to the P. trichocarpa genome were processed together. siRNAs within 100 bp of each other were merged into blocks referred to as siRNA clusters. The coordinates of the clusters were defined by the first and last siRNAs of the overlapping sequences. siRNA clusters with a transcription abundance of at least 5 tpm were included in the downstream analysis. The clusters were labeled according to the most abundant length of siRNA they contained (21, 22, or 24 nt), and the type of genomic region in which they were located.
2.4. Effector Prediction
Effector prediction was performed using a machine-learning program, EffectorP, with default parameters [39]. In this analysis, all of the 16,399 protein sequences in the M. larici-populina genome were used.
2.5. Target Prediction
The miRNA targets were annotated using the standard settings of psRNATarget [40] with an expectation value of 2.0 and top targets of 200. For targets in poplar, the transcripts of P. trichocarpa were downloaded from Phytozome 12 (https://phytozome.jgi.doe.gov/pz/portal.html, accessed on 18 April 2025). For targets in M. larici-populina, transcripts from this species were downloaded from the Ensemble database (ftp://ftp.ensemblgenomes.org/pub/release-39/fungi/fasta/melampsora_laricipopulina/cdna/).
2.6. Locations of miRNAs in the P. trichocarpa Genome
Within the Populus trichocarpa genome (Phytozome v12), siRNA clusters were mapped to genomic features—including transposons, introns, pseudogenes, intergenic regions, 5′ UTR, 3′ UTR, and 2 kb flanking zones of coding genes—with a minimum overlap of 80%, aiming to determine their overrepresentation via randomization tests. Genomic segments matching the length of these siRNA clusters were sampled randomly 1000 times, and their overlap with known elements was analyzed. A one-tailed p-value, reflecting the probability beyond the observed count, was derived from a Z-score calculated as the difference between the actual count and the mean of 1000 surrogate counts, normalized by the standard deviation of those surrogates, using a one-tailed Z test.
2.7. TE Annotation
Employing RepeatMasker (RM) version 4.0.6 with the Dfam_Consensus-20170127 and RepBase-20170127 databases tailored to Populus species, annotations for transposable elements (TEs) were generated. From these RM outputs, non-TE components—such as satellites, simple repeats, low-complexity sequences, and rRNAs—were filtered out.
2.8. Pseudogene Annotation
The intergenic sequences of the P. trichocarpa genome were used to identify the putative Pseudogenes. The overall pipeline used for this identification was generally based on the PlantPseudo workflow [41], and consisted of four major steps: (1) identify the masked intergenic regions with sequence similarity to known proteins using BLAST; (2) eliminate redundant and overlapping BLAST hits in places where a given chromosomal segment has multiple hits; (3) link homologous segments into contigs; and (4) realign sequences using tfasty to identify features that disrupt contiguous protein sequences.
3. Results
3.1. Poplar sRNAs Expressed During a Rust Fungus Infection
RNAs between 18 and 30 nt were extracted from Chinese white poplar under three conditions—mock-treated control (T01), biotrophic growth (T02), and urediniospore formation and release (T03)—and sequenced using an Illumina HiSeq 2500, generating 8.1 million clean reads (Table S1). These sRNA libraries enabled an examination of responses to Melampsora larici-populina infection across its key stages [42], with sequences processed to distinguish unique sRNAs for infected and control samples. Predominantly spanning 21 to 24 nt, typical of functional siRNAs and miRNAs, 56.6–65.4% of reads fell within this size range (Figure S1), and 33.4–44.0% aligned to the Populus trichocarpa genome v3.0 with up to one mismatch [30].
3.2. Pseudogenes and Transposons Act as Catalysts for the Formation of siRNA Clusters
Pseudogene-derived siRNAs may co-opt transposon-targeting siRNA pathways, amplifying RNA interference (RNAi) signals to broadly silence homologous genomic regions, including transposon-rich loci, thereby linking genome plasticity to adaptive epigenetic regulation. We then investigated the genomic positions of siRNAs to uncover their evolutionary origins. The sRNA reads matching miRNA, tRNA, or ribosomal DNA sequences were removed from the datasets, and only those that mapped to the genome and were of the characteristic siRNA length (21–24 nt) were retained. The vast majority of siRNAs expressed during the rust fungus infection were found to be unique to one stage, while few siRNAs were common to all three samples (Figure 1A). Thus, infections could therefore create stage-specific siRNAs, producing a more complex and targeted siRNA population.
We used an approach similar to that reported by Barber et al. [15] to identify the types of genetic features associated with the stage-specific differences in siRNAs. We grouped overlapping 21–24-nt siRNAs within a ≤100-bp window on the P. trichocarpa genome into clusters based on their location within the genome. Totally, we identified 301,155 siRNA clusters. The relative abundances of the siRNA clusters were calculated (tpm: transcripts per million mapped reads). We then classified and annotated the siRNA clusters according to their common siRNA length (21, 22, or 24 nt) and their location in the genome. The 22 nt clusters accounted for less than 10% of all the clusters, and the 24 nt clusters accounted the largest proportion (>50%). The siRNA clusters with an abundance of at least 5 tpm in at least one stage were retained for the downstream analyses (Data S1).
Subsequently, we assessed where siRNAs reside within the Populus genome, spanning regions such as 5′ UTRs, 3′ UTRs, introns, 2-kb flanking sequences of coding genes, transposons, pseudogenes, and intergenic zones. Of the siRNA clusters, 625 (4%) were located within pseudogenes, 2973 (18%) were in upstream regions, 2586 (15%) were in downstream regions, 330 (2%) were in 5′ UTR regions, 395 (2%) were in 3′ UTR regions, 982 (6%) were in introns, 8159 (48%) were in transposons, and 4703 (19%) were in intergenic regions (Figure 1B).
The siRNA clusters were enriched for transposon element (TE)-associated sequences, as shown by the large percentages of siRNAs derived from the TE sequences (Figure 1B). Many of the TE sequences were annotated as high-copy retrotransposon families (Data S2); for example, 26.9% (2556) were long-terminal-repeat (LTR) sequences. Strikingly, siRNA clusters in pseudogenes and transposons occurred at rates far exceeding random expectation (p < 0.001, one-tailed z test; Table 1), indicating that these elements, particularly retrotransposons, likely fuel the emergence of certain Populus siRNAs.
3.3. Infection Stage-Specific Differences in the siRNAs Primarily Originate from the Transposon Regions
In the samples from the two infection stages, the proportions of 21 nt and 22 nt siRNAs included in any given cluster were significantly correlated (total r = 0.96; T01 r = 0.93; T02 r = 0.93; T03 r = 0.94; p < 2.2 × 10^−16^), and their proportions were also positively correlated with the 24 nt siRNAs (total r = 0.82 and r = 0.85 for 21 nt and 22 nt siRNAs, respectively; T01 r = 0.64 and r = 0.75, respectively; T02 r = 0.64 and r = 0.75, respectively; T03 r = 0.66 and r = 0.76, respectively; p < 2.2 × 10^−16^) (Data S1). The 21 nt clusters have a shorter mean length (205 bp) and were more abundant (80 tpm) than the 24 nt clusters (370 bp, 17 tpm; Wilcoxon rank-sum p-value < 2.2 × 10^−16^). To investigate the clusters exhibiting large stage-specific differences, we ordered the clusters by their absolute value of fold change values (T02/T01 or T03/T01). The degree of difference in the 24 nt siRNA cluster abundances between stages increased for those located in the transposon and intergenic regions (Figure 1C–F), with the strongest trend observed in the transposon regions.
To further investigate the global differences in siRNA abundance in the retrotransposon families between the different infection stages, we mapped the 21 nt, 22 nt, and 24 nt siRNAs that perfectly matched the Populus genomes onto the retrotransposons. Figure 2A shows the retrotransposon families that showed an siRNA abundance of at least 100 tpm in at least one of the stages. The production of 21 nt siRNA lengths and the abundance of these siRNAs located in the retrotransposon families are more similar across the three stages than those of the 22 nt and 24 nt siRNAs.
The stage-specific differences in the abundance of siRNAs located in the transposons were examined using an X^2^ test for each siRNA length. The differences in the siRNA abundance for most transposon families between the stages (T02 vs. T01; and T03 vs. T01) was contingent on the 24 nt length sequences (p < 0.05; Chi-square test; Data S3; Figure 2A). The stage-specific differences in siRNA abundance for these retrotransposon families thus primarily resulted from the 24 nt siRNAs.
3.4. Populus siRNAs Mediate Plant–Pathogen Interactions, as Revealed Using a Shotgun Strategy
To explore the potential role of the Populus siRNAs in plant–pathogen interactions, we profiled the siRNA sequences that mapped to the P. trichocarpa genome which could regulate the genes of the fungal pathogen Melampsora larici-populina. In the biotrophic growth (T02) and urediniospore formation and release (T03) stages, more than 95.0% of the fungal genes (T02: 15,954; T03: 15,472; and total fungal genes: 16,372) were predicted to be targeted by siRNA sequences (Data S4). Nearly one-third of the expression of Melampsora larici-populina genes detected in our transcriptome were altered at least by twofold between sample pairs (T02/T01; T03/T01; and T02/T01) (Data S5). We hypothesize that some siRNAs could suppress pathogen pathogenicity by targeting pathogen virulence genes using a shotgun strategy. Importantly, if siRNA–pathogen interactions do not occur randomly, we would expect sRNA to preferentially target specific pathogen genes. As expected, compared with the control sample (T01), pathogen effectors were targeted by more siRNAs at the biotrophic growth and urediniospore formation and release phases (p < 0.001; permutation test; Figure 2B), suggesting there is a clear selection for the siRNA–target interactions.
3.5. The Populus-Specific miRNAs Are More Involved in the Regulation of the Disease-Resistance (DR) Genes
To investigate miRNAs in our sequencing data, recognized for their critical role in plant immunity [26,43], we mapped sRNA reads against Rfam (version 13) to annotate the sRNAome. Following the exclusion of scRNAs, tRNAs, rRNAs, snRNAs, and snoRNAs, the remaining 423 sRNA precursors were classified into 211 miRNA families, averaging 2.0 genes per family across the three infection stages. We found that 67.7% of the miRNAs could be annotated in miRBase (version 22); however, most of the putative novel miRNAs did not remain in the final dataset. These excluded miRNA sequences may not represent bona fide miRNAs or may be expressed at levels too low to reliably show that they have the characteristics of miRNAs (at least one read matching their star sequence; Data S6). The expression values of each miRNA gene were also calculated and normalized (Data S7).
Combining miRNAs identified here with those from closely related species in miRBase (v22), we distinguished Populus-specific miRNAs, finding 181 of 211 families unique to Populus, of which 70 aligned with prior findings (Figure 3A). Employing psRNATarget [40], we predicted 65,777 miRNA–target gene pairs with an expectation score of 5 (Data S8), and a subsequent expression analysis uncovered 1940 pairs exhibiting negative correlations (p < 0.05; Pearson correlation; Data S9). A public degradome dataset [44]; Data S5, available at https://nph.onlinelibrary.wiley.com/action/downloadSupplement?doi=10.1111%2Fnph.14046&file=nph14046-sup-0002-NotesS1-S12.xlsx, accessed on 18 April 2025) validated 553 pairs involving 165 miRNAs (Data S10), encompassing conserved pairs such as miR156-SPL, miR172-AP2, and miR397-LAC [45,46,47]. Excluding predicted disease-resistance (DR) gene targets from both conserved and Populus-specific sets left 161 DR targets for conserved miRNAs and 711 for Populus-specific ones, with 66% (120 of 181) of Populus-specific miRNAs regulating DR genes—far exceeding the 23.3% of conserved miRNAs (p = 0.009; Fisher’s exact test; Table 2, Data S11; Figure 3B).
To gain a better understanding of the functional roles of the Populus-specific miRNA families, we performed a pfam domain analysis of their predicted targets. The top 20 domains of the Populus-specific miRNA targets were highly correlated with the domains of the plant DR genes (Figure S2); for example, NB-ARC and LRR domains, which are known to dissociate upon effector-dependent activation [48], were overrepresented in the samples.
3.6. Populus-Specific miR6579 and miR6590 Target Numerous NB-LRR Genes
Plant NB-LRRs (encoded by NB-LRR genes) are commonly targeted by miRNAs, typically generating phased small interfering RNAs (phasiRNAs), which can reduce the levels of these targets in a cis or in a trans manner [44]. In the two stages of the rust fungus infection, a conserved miRNA superfamily composed of miR482 targeted the NB-LRRs at the encoded and conserved P-loop motifs. This family was found to be highly conserved among divergent plant species, including rice, Arabidopsis, grape (Vitis vinifera), and soybean (Glycine max) [23,49,50]. A second conserved family, miR2111, was identified in the eudicotyledons (Figure 3C), indicating the existence of this miRNA family prior to the divergence of these species from a common ancestor around 103 Mya [51]. The highly conserved 20 nt miR482s were predicted to target 221 NB-LRRs. Two related Populus-specific families, miR6579 and miR6590, which both contain 21 nt miRNAs, were predicted to target 132 and 49 NB-LRRs, respectively (Figure 3D–F). These two Populus-specific miRNA families had the highest transcript abundance during the mock-inoculated control plants, decreasing to a low level during the biotrophic growth (T02) and urediniospore formation and release (T03) stages (Data S7). By contrast, the highly conserved 22 nt miR2111s were predicted to target only nine NB-LRRs, three of which were also predicted targets of miR482, while two were predicted to be targeted by miR6579 (Figure 3D–F). These results indicate that the NB-LRRs in Populus are primarily targeted by conserved miR482s and Populus-specific miR6579s.
A careful examination of their precursor sequences revealed that miR6579 and miR6590 were intron-derived and pseudogene-derived miRNAs, respectively (Figure 3E). The parent of the pseudogene Chr03|4847214-4847421 was an NB-LRR gene (Potri.001G426600), suggesting that pseudogenes could act as catalysts for the formation of miRNAs. In poplar, the 21 nt miR6590 family showed characteristics canonical of the typical miRNA triggers associated with ARGONAUTE1 (AGO1) for the initiation of the PHAS loci, such as “U” in the 5′ position, “A” in position 10, and “C” in the 3′ position, indicating that this miRNA may trigger the biogenesis of secondary siRNAs [52]. It seems that species-specific miRNAs could become fixed in the regulatory network once a number of targets have evolved for which the presence of a miRNA-binding site would be beneficial.
4. Discussion
4.1. Transposons and Posttranscriptional Regulation
Assessing poplar siRNA distribution across genomic elements—including 5′ UTRs, 3′ UTRs, introns, 2-kb coding gene flanks, transposons, pseudogenes, and intergenic regions—we observed that transposons contributed 48% of siRNAs, with clusters notably tied to high-copy retrotransposon families. Long regarded as mere genomic noise, these ubiquitous transposons are now valued for their profound impact on eukaryotic genome structure, diversity, and flexibility [53], evolving faster than coding sequences due to greater mutation susceptibility [54]. Exhibiting distinct stage-specific fluctuations, the 24 nt siRNAs derived from these retrotransposons likely mediate posttranscriptional silencing by inducing repressive histone modifications, thus enhancing TE repression [55,56].
Our findings further illustrate that it is difficult to assess the connection between siRNAs and transposons in the process of plant defense. In general, the mobilization of transposons can be mutagenic [57], so host genomes have evolved elaborate mechanisms to suppress their activities. Most of the 24 nt siRNAs are likely to be involved in the transcriptional regulation of transposons through RNA-directed DNA methylation [1], but many TE-associated siRNAs were also found to influence non-TE transcripts exerting certain regulatory roles [58]; for example, 24 nt siRNAs were found to target pathogen genes [57]. Transposons are rapidly activated by biotic and abiotic stresses, however [59,60,61], potentially complicating the regulatory loop between the siRNAs and TEs.
siRNAs are an ideal tool for use in plant defense because of their target specificity and highly efficient responses [62,63] proposed that the retrotransposon-associated siRNAs may target genes with homologous sequences. Moreover, direct evidence in Arabidopsis suggests that 21 nt siRNAs derived from an Athila retrotransposon could posttranscriptionally regulate the Upstream binding protein 1b (UBP1b) gene to mediate the stress response [64]. In rice, transposon-derived 24 nt siRNAs could target the genes involved in hormone homeostasis, suggesting that transposon-associated regulatory modules could play a variety of roles in plant development and defense [65]. It is, therefore, possible that the 21–24 nt siRNAs may have another purpose besides the maintenance of genome stability, and may enhance plant defense by targeting the pathogen genes. Considering the high genetic diversity and mobilization of transposons, the regulatory system mediated by the TE-derived siRNAs could play a major role in the performance of plant defense.
4.2. Cross-Kingdom RNAi and sRNAs Mediate Plant–Pathogen Interactions
An endless arms race drives host–pathogen coevolution [66]. In plants, pathogens can invoke multiple layers of immune responses, and many defense genes were found to play key roles in this process [67,68]. Most potential pathogens are blocked by nonhost resistance and pathogen-associated molecular pattern (PAMP)-triggered immunity (PTI) [62]; however, many pathogens deliver a variety of effectors into plant cells to suppress host immunity. To counteract this attack, plants have evolved resistance proteins and immune mechanisms to sense these effectors and trigger a robust resistance response [69]. Plant sRNAs are essential regulatory noncoding RNAs that can posttranscriptionally silence their target genes [62,70]. Recent progress in profiling sRNAs, especially advances in next-generation sequencing technology, has revealed their extensive and complicated involvement in interactions between plants and viruses, bacteria, and fungi [70].
Fungal pathogens deliver sRNAs into plant cells to induce cross-kingdom RNAi of plant immunity genes, a mechanism initially identified by Weiberg et al. [43] and further elucidated through extracellular vesicle-mediated transport [71,72]. A similar strategy was also observed in animal–parasite systems [73]. Furthermore, sRNAs generated from the host plant cells can also be transferred into pathogen cells [66], as fungal pathogens are known to be capable of taking up external sRNAs and long dsRNAs [66,74]. Pathogen genomes encode variable effector repertoires for suppressing host immunity [75]; specifically, these effectors are often encoded in genomic regions of high plasticity [76], such as TEs [77]. Given that effectors are often clustered in these regions, TEs must function as an evolutionary force shaping these virulence factors by contributing to their diversification [77]. Sequence diversification is particularly important for pathogen effectors, which are essential components facilitating their coevolution with, and outmaneuvering of, the host.
Considering the high diversification of virulence factors generated by pathogens, it is reasonable to assume that transposon-derived sRNAs are also the best sources of new regulators for plant defense resistance, facilitating cross-kingdom RNAi. TEs, the most abundant component of most eukaryote genomes, are hotspots for generating sRNAs, which is particularly true of the retrotransposons. Here, we found that pathogen effectors were targeted by more poplar siRNAs in the biotrophic growth and urediniospore formation and release stages of infection by a rust fungus, in comparison with control. It is, therefore, reasonable to conclude that the activation of the siRNA repertoire does not occur randomly, and that their targets are instead involved in more central roles in the plant–pathogen interactions. Supporting this hypothesis, a previous study reported that the transfer of host sRNAs could silence virulence-related genes and suppress fungal pathogenicity in Arabidopsis [71].
From a plant breeding perspective, the siRNA-mediated cross-kingdom RNAi mechanism offers a promising strategy for engineering durable disease resistance. For instance, selecting or editing crop varieties to amplify siRNA production targeting conserved pathogen effectors could provide broad-spectrum resistance while minimizing evolutionary constraints imposed by single-gene R strategies [78]. Furthermore, the dynamic interplay between transposon-derived siRNAs and rapidly evolving pathogen virulence factors suggests that leveraging TE-rich genomic regions as “sRNA reservoirs” may enhance adaptive immunity in perennial crops like poplar. This approach aligns with evolutionary breeding principles, where harnessing natural genomic plasticity could yield cultivars capable of counteracting pathogen coevolution [79]. Critically, integrating sRNA profiling into marker-assisted selection programs may accelerate the identification of elite germplasm with enhanced RNAi-based defense traits, particularly against rust fungi exhibiting high effector diversification rates.
4.3. Disease Resistance Associated with Populus-Specific miRNAs
Acting as key detectors in effector-triggered immunity, plant NB-LRR genes—categorized into TIR-NB-LRRs with a Toll/Interleukin-1 receptor-like domain and CC-NB-LRRs with a coiled-coil domain [80]—fortify innate defenses against a wide range of pathogens and pests [81], with miRNAs triggering phasiRNA production to reduce transcript abundance locally and across genes [82]. These genes, marked by frequent duplication, loss, structural changes, subfamily diversification, and species-specific copy number variations [81,83,84], contrast with stable-copy genes, driving rapid, lineage-specific immune adaptations. While conserved miRNAs often target conserved domains like P-loop motifs across families [52,85], newer miRNAs exhibit less specificity, potentially binding mRNAs less selectively [86]. In Populus, the salicoid whole-genome duplication (WGD) event, common to Salicaceae, has enriched genes in specialized functional groups [87], with targets of recently emerged miRNAs spanning DR NB-LRR genes, ABC transporters, and select transcription factors, aligning with retained duplicates in this genus [30]. Our analysis shows Populus-specific miRNAs, despite fewer total targets, surpass conserved miRNAs in regulating NB-LRR genes, reflecting a defense-linked diversity that likely governs newly evolved DR genes, boosting host adaptability. Over time, these emerging miRNAs may refine their roles within regulatory networks, extending control to additional specialized targets [44,86], shedding light on siRNA’s origins and regulation in Populus trichocarpa against Melampsora larici-populina.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Matzke M.A. Birchler J.A. RN Ai-mediated pathways in the nucleus Nat. Rev. Genet.20056243510.1038/nrg 150015630419 · doi ↗ · pubmed ↗
- 2Ghildiyal M. Zamore P.D.J.N.R.G. Small silencing RN As: An expanding universe Nat. Rev. Genet.2009109410810.1038/nrg 250419148191 PMC 2724769 · doi ↗ · pubmed ↗
- 3Liu P.P. Montgomery T.A. Fahlgren N. Kasschau K.D. Nonogaki H. Carrington J.C. Repression of AUXIN RESPONSE FACTOR 10 by micro RNA 160 is critical for seed germination and post-germination stages Plant J.20075213314610.1111/j.1365-313X.2007.03218.x 17672844 · doi ↗ · pubmed ↗
- 4Olovnikov I. Aravin A.A. Toth K.F. Small RNA in the nucleus: The RNA-chromatin ping-pong Curr. Opin. Genet. Dev.20122216417110.1016/j.gde.2012.01.00222349141 PMC 3345048 · doi ↗ · pubmed ↗
- 5Carthew R.W. Sontheimer E.J. Origins and Mechanisms of mi RN As and si RN As Cell 200913664265510.1016/j.cell.2009.01.03519239886 PMC 2675692 · doi ↗ · pubmed ↗
- 6Nykänen A. Haley B. Zamore P.D. ATP requirements and small interfering RNA structure in the RNA interference pathway Cell 200110730932110.1016/S 0092-8674(01)00547-511701122 · doi ↗ · pubmed ↗
- 7Harding J.L. Horswell S. Heliot C. Armisen J. Zimmerman L.B. Luscombe N.M. Miska E.A. Hill C.S. Small RNA profiling of Xenopus embryos reveals novel mi RN As and a new class of small RN As derived from intronic transposable elements Genome Res.2014249610610.1101/gr.144469.11224065776 PMC 3875865 · doi ↗ · pubmed ↗
- 8Khraiwesh B. Arif M.A. Seumel G.I. Ossowski S. Weigel D. Reski R. Frank W. Transcriptional control of gene expression by micro RN As Cell 201014011112210.1016/j.cell.2009.12.02320085706 · doi ↗ · pubmed ↗
