Evolutionary Refinement of Mitochondrial and Plastid Targeting Sequences Coincides with the Late Diversification of Land Plants
Parth K Raval, Carolina García García, Maria-Darline Somoano Sanchez, Sven B Gould

TL;DR
This study explores how plant cells evolved better ways to transport proteins into mitochondria and plastids, helping land plants thrive.
Contribution
The study reveals evolutionary changes in targeting sequences and import mechanisms that prevent mis-sorting in land plants.
Findings
N-terminal targeting sequences in eudicots evolved to avoid positive charges and favor phosphorylatable amino acids.
Experimental evidence shows evolved targeting peptides prevent mis-sorting between mitochondria and plastids.
A phosphatase, PAP2, is linked to targeting sequence processing and originated in embryophytes.
Abstract
Plastids and mitochondria are key to plant survival and adaptation. The evolutionary progress of land plants (embryophytes) witnessed gene and genome duplications, and the expansion of organelle-localized proteins. To deal with the increase of nuclear-encoded proteins, targeting to and import by the mitochondrion and plastid are known to have adapted in multiple ways. It included the addition of entirely new import channels and lineage-specific import receptors. Through comparative genomics and experimental biology, we uncover further changes in the organelle import machineries. Their evolution likely served to enhance the rate of protein import and improve its physiological regulation, e.g. via interactions between the import channel and respiratory complex. On the cargo side, nuclear-encoded N-terminal targeting sequences of mitochondrial targeting peptide (TP) and plastidal (pTPs)…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6- —Deutsche Forschungsgemeinschaft10.13039/501100001659
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotosynthetic Processes and Mechanisms · Algal biology and biofuel production · Genomics and Phylogenetic Studies
Introduction
Mitochondria and plastids, two key compartments of photosynthetic eukaryotes, originate from Gram-negative bacteria (Keeling 2010 ; Archibald 2015; Martin et al. 2015), and part of their transformation from a free-living organism to an organelle includes the transfer of genetic information to the host nucleus (Timmis et al. 2004). A plethora of core functions they perform (Hatefi 1985; Eberhard et al. 2008; Wang and Youle 2009; Mesmin 2016; Hölzl and Dörmann 2019) hence hinge upon cytosolic protein biosynthesis and the subsequent import of proteins across a pair of membranes (Scotti et al. 2000; Hewitt et al. 2011; Day et al. 2014; Diederichs et al. 2020; Richardson and Schnell 2020). Protein import complexes, consisting of dozens of subunits, serve as a necessary gateway to all organelle functions and, by that, they affect fitness and macroevolutionary trends (Leebens-Mack et al. 2019; Knopp et al. 2020). Protein translocation across organelle membranes comprises an interplay of targeting sequences, chaperones, transport channels, and also kinases and phosphatases (Richardson and Schnell 2020; Schulte et al. 2023; den Brave et al. 2024). Owing to the challenge of simultaneous protein sorting between two organelles of endosymbiotic origin—which is based on very similar principles (Garg and Gould 2016; McKinnon and Theg 2019; Knopp et al. 2020)—this process is necessarily more complex in algae and plants and has likely been under a pertinent selection pressure since plastid origin.
The emergence of major archaeplastidal lineages such as the Chloroplastida or Embryophyta witnessed the expansion of organelle proteomes, which likely improved the ability to respond to abiotic stresses (Heinnickel and Grossman 2013; de Vries and Archibald 2018; Knopp et al. 2020; Raval et al. 2024). Translocation across the correct organellar membranes is key for these proteins to be able to perform their function. The main import channel of the outer chloroplast membrane in Chloroplastida, TOC75, likely originated through the duplication of the original import channel OEP80, and allowed for a division of labor between the two (Knopp et al. 2020). Concomitant with the origin of TOC75 in the green lineage was the loss of an N-terminal phenylalanine—and, to a lesser degree, other bulky amino acids (AA) such as W, Y, or L—based targeting motif (Wunder et al. 2007; Köhler et al. 2015; Knopp et al. 2020) which acted as the first point of contact to discriminate between mitochondrial and plastid cargo in rhodophyte and glaucophyte algae, apart from the features downstream of this motif that further conferred organelle specificity (Steiner et al. 2005; Gould et al. 2006; Gruber et al. 2007). We argue that the loss of this first point-of-contact at least partly compromised the initial identification of cargo at the outer organelle membranes (e.g. for a subset of proteins that strongly depended on F-based differentiation between plastids and mitochondria) and paved the way for additional mechanisms for import specificity to evolve. The expansion of organelle proteomes in land plants (Raval et al. 2024) likely provided a further positive feedback loop for the selection of a more flexible, yet also accurate cargo recognition.
Several strategies ensure correct protein sorting in the land plant models studied, among which N-terminal targeting sequences are one. The literature refers to such targeting sequences with various denominations, including presequence, transit sequence, targeting signal, or targeting peptide (TP). Mitochondrial TPs (mTPs) of matrix proteins are on average 50 AA long, while those of plastid proteins (pTPs) are usually longer, with some reaching up to 100 AA (Kim and Hwang 2013). mTPs and pTPs can vary substantially across proteins and species, and although some signature secondary structures are attributed to mTPs and pTPs (Cramer et al. 1990; Wang and Weiner 1993), detailed differences in secondary structures are not that well defined, and neither are all their physio-chemical properties (Lee et al. 2019; Caspari et al. 2023). One critical physicochemical property that aids cells in distinguishing cargo concerns charge differences, which are apparent even when considering only the first 20 AA of TPs (Garg and Gould 2016). In Arabidopsis, the first 20 AA of mTPs (approximately one-third of the entire TP) are characterized by a higher positive charge, which is typical for nuclear-encoded mitochondrial matrix proteins in general, whereas those of the pTPs are enriched with serine residues (Bhushan et al. 2006; Lee et al. 2006a; Pujol et al. 2007; Ge et al. 2014). Phosphorylation of pTP residues, followed by dephosphorylation at the plastid outer membrane, regulates the import rate of some plastid cargo (May and Soll 2000) and simultaneously suppresses import by mitochondria of otherwise plastid-targeted proteins (Garg and Gould 2016). While in-silico modeling suggests that features such as the overall charge are sufficient to separate an mTP from a pTP (Ge et al. 2014; Garg and Gould 2016), more experimental support is needed (Bhushan et al. 2006; Holbrook et al. 2016; Lee et al. 2019). Considering the key role of the first 20 AA of TPs as the first point of contact, the causes and consequences of the substantial evolutionary shift from a highly conserved bulky AA-based motif to a serine-enriched motif are important to understand.
Current data regarding the interplay of targeting sequences and import components of mitochondria and plastids are largely restricted to the model system Arabidopsis and a few other angiosperms. A comprehensive picture of how conserved the interplay is across Archaeplastida or how changes might have influenced major transitions in algae and plant evolution is missing (de Vries et al. 2016). We do not yet fully understand how plastid proteins are exactly distinguished from mitochondrial ones, how dual-targeted cargo is selected, or what exact regulatory role the evident cytosolic phosphorylation and de-phosphorylation at the outer membranes of these organelles might play (Waegemann and Rgen Soll 1996; Martin et al. 2006; Garg and Gould 2016; Zhang et al. 2016; Law et al. 2018). Here, combining comparative genomics, phylogenetics and experimental biology in the chlorophyte Chlamydomonas and an early diverging land plant, the bryophyte Marchantia, we track the origins and evolution of the protein import components across the archaeplastidal lineage and characterize changes that improved accuracy, and likely the rate, of organelle protein import.
Results
Divergence of Mitochondrial and Plastid Targeting Sequences upon Terrestrialization
Membrane receptors and channels can recognize a majority of their cargo based on N-terminal TPs, the initial region (e.g. the first 20 AA) of which acts as the first point-of-contact at the organelle outer membrane, and it is also the region where major evolutionary changes occurred. Note that we only focus on the first 20 AA of the TPs for these reasons, but refer to them as “the features of TPs” for simplicity. To trace the evolutionary history of changes in physicochemical properties of the first 20 AA of TPs from algae to angiosperms, we analyzed the TPs of proteins from five major chloroplastidal lineages inferred (based on orthology with experimental proteomes) to be either mitochondria-, plastid-, or dual-targeted (mTP, pTP, or dTP, respectively; Table S1b, Fig. S1). For these inferred TPs, we plot average numbers of serine and threonine (phosphorylatable AA) and their electric charge for each of the five clades. In algae, the distributions of phosphorylatable AA and charge for mTP and pTP overlapped (no statistically significant differences), making mTP and pTP rather indistinguishable for these features at the beginning of the TPs (Fig. 1a and b). Land plants (monocots and eudicots in particular) differ and show similar values across species with a clear separation between mTPs and pTPs on two counts: the beginning of the pTPs is enriched in phosphorylatable AA (Ser and Thr) from streptophyte algae to eudicots (Fig. 1a) and the beginning of the mTPs is enriched with respect to a positive charge across the sequences analyzed (Fig. 1b). Angiosperms, both mono- and eudicots, showed the clearest separation. The values of these features in the TPs of eudicotic proteins imported by plastids and mitochondria (the dual targeted proteins) range between those with only a single destination, underscoring a compromise with respect to dually targeted proteins exemplified in eudicots (Fig. 1a and b).
The evolution of AA charge across the TPs of photosynthetic eukaryotes. The total number of a) phosphorylatable AA (serine + threonine) and b) electric charge for each protein, in a given species, was calculated from the first 20 AA of N-terminal TPs of mitochondrial, plastid, and dual-targeted proteins. Mean of all proteins of a given organelle in a given species was treated as a representative and plotted for each clade within a violin plot. Each violin, therefore, represents organelle proteomes of an entire clade, with the top three representing land plants and the bottom two representing green algae. Within each violin plot, the vertical line represents the median, the thick horizontal line the interquartile range (from 25th to 75th percentile), and the trailing thin horizontal line represents the full data range. Multiple ANOVA (with Bonferroni correction for multiple comparisons) was performed across and within clades and organelles. Select statistically significant pairs are shown on the right side (asterisks), full statistics are summarized in the figure source data. The exact N, number of organelle TPs for a given species from which the mean was calculated to be included in the clade-violin plot, depended on a species; therefore, a range is shown for a given organelle and clade on the right side of each violin; n, number of species in a given clade violin are indicated on the left.
To investigate where organelle TPs began to diverge during evolution (e.g. at the base of chloroplastida vs. later along with the divergence of land plant clades), we reconstructed the ancestral states of the two traits of interest (mean charges and phosphorylatable AA) at the beginning of the TPs across five plant lineages (Fig. 2, Figs S2 to S3) using these trait values in extant species (Fig. 1). A comparison between eudicots and chlorophyte algae ancestors underscores that at the eudicot ancestral node, the pTPs were already enriched in phosphorylatable AA and the mTPs accumulated positively charged AA, traits that were much less prominent in the chlorophyte ancestor (Fig. 2). Among eudicots, the mustard and brassicaceae families showed that the highest values and changes in the two traits occurred concurrently: species with a higher positive charge in mTPs also showed a higher number of phosphorylatable AA in the pTPs, and vice versa. The concomitant divergence of both is likely to aid in further import specificity. On the contrary, in chlorophyte algae, changes in these two traits did not occur simultaneously, and across species, the variation is high (Fig. 2b). Generally, positive charges in mTPs and phosphorylatable AA in pTPs started to become evident at the time of streptophyte and land plant origin (Fig. 2c and d).
*Ancestral state reconstruction of TP divergence. Ancestral states of phosphorylatable AA in the first 20 AA of pTPs and charges of mTPs for five plant lineages were reconstructed using phylogenies for each lineage (Fig. S2) and the phylogenies were color-coded based on these values across ancestors. A complete phylogeny and ancestor state values are shown for a) eudicots and b) chlorophyte algae. Based on a similar reconstruction, inferred means of phosphorylatable AA residues across c) all pTPs and means of positive charges across and d) all mTPs of the last common ancestors of each group are plotted (c to d, P < 0.05, Welch's t-test). The complete ASR of streptophyte algae, bryophytes and monocots, together with the raw data for all five clades, is summarized in Fig. S3 and its source data.
If TP divergence is indeed aiding the correct targeting and maintenance of organelle biology, it is expected to be enriched based on protein function and a contribution to defining the organelle's biology. To probe this, we analyzed a more reliable localization and functional annotation dataset of Arabidopsis thaliana. We found that plastid proteins involved in photosystem biology and redox reactions are more enriched in phosphorylatable AA (Fig. S4). They have a lower positive charge than the rest of plastid proteins and of mTPs, i.e. the pTPs of these functional categories are more diverged from the mTPs than generally observed. Some key mitochondrial proteins, e.g. TCA cycle proteins, also show a higher mTP charge than the average of all mTPs (Fig. S4).
The Divergence of Targeting Sequences Enhances the Accuracy of Protein Sorting
To test the hypothesis that the divergence of TP charge alone changes targeting destination, we utilized the mitochondrial malic enzyme 1 (MME1) as an exemplary protein since the first 20 AA of this protein showed a substantial positive charge in algae Chlamydomonas reinhardtii and Marchantia polymorpha (a system to study land plant evolution [Bowman et al. 2017; Bowman et al. 2022]), allowing us to lower the charge and generate variants in both algae and land plant. Cloning of the MME1 homolog of C. reinhardtii (CrMME1; Cre06.g268750) proved difficult, but the first 20 AA of the mitochondrial protein, with a charge of +5, are sufficient for correct mitochondrial targeting and the positive charge in the TP is crucial, as constructs in which the charge is changed to −5 or 0 localize predominantly in the cytosol (Fig. 3, Fig. S5). The MME1 homolog of M. polymorpha (MpMME1; Mp4g02270) has an overall charge of +4. The two arginine and two lysine residues that make up the positive charge allowed us to address the relative contributions of R vs. K. As predicted, the reporter fusion with the full MpMME1 sequence shows a mitochondrial localization, while the −4 mutant construct accumulates in the cytosol (Fig. 3). It is debated, whether the positive charge or AA identity per se is crucial for mitochondrial import (Lee et al. 2019; Caspari et al. 2023). Substituting arginine for lysine (and vice versa) still leads to correct mitochondrial targeting (Fig. 3), and so for MpMME1, a positive charge matters and not whether it is mediated by arginine or lysine. The analysis of the first 20 AA of 214 mTPs from 72 eudicots, where the positive charge is pronounced (above +3), shows a general preference for arginine (Fig. S6). Those proteins that almost exclusively use arginine in the mTPs are bioenergetics related, while those that strongly prefer lysine are mostly information processing related (Fig. S6). A preference for either lysine or arginine might hence depend on the context of protein function, which appears odd considering targeting sequences are cleaved upon import into the matrix (Ghifari et al. 2019).
Altering only the charge of the first 20 AA changes the targeting behavior of TPs. Subcellular localization of MME1 homologs in a) the bryophyte plant Marchantia and b) the chlorophyte alga Chlamydomonas. Each row shows the localization of an independent reporter fusion, and the top row (for each species) represents the wild type (WT). Below the WT are the variants where key AA in the first 20 amino acid region of the TPs were substituted. The numbers of these key AA (in single-letter codes) are shown on the left. The full sequence of the first 20 AA and resulting charge (Ph., phosphorylation) are shown in the table on the bottom right. MME1 (and its variants) was localized in Marchantia gemma, thallus, and protoplasts via citrine and in Chlamydomonas single cells via eGFP fusion constructs. Plastids were imaged through their autofluorescence and mitochondria through MitoTracker Red. Chlamydomonas images were deconvoluted from the Z-stacks and Marchantia images are from single plane imaging. Scale bars 5 µm. See Fig. S5 for co-localization analyses and Table S7 for a range of brightness-contrast settings for the images.
Regardless of AA identity, the divergence between mitochondrial and plastid TPs (and an enrichment of serines in pTPs) appears to be a necessity. When the positively charged AA of the MpMME1 mTP are substituted with serine residues, the fusion construct now predominantly targets the plastids. Only in the protoplast preparations did we observe some targeting of the construct to mitochondria (Fig. 3a, Fig. S5). It shows that replacing positively charged AA with serine residues is sufficient to re-direct—or, in this case, mistarget—one of the key mitochondrial marker proteins from mitochondria to plastids. Note that a similar experiment in Chlamydomonas did not result in any targeting of CrMME1 to the plastid (Fig. 3). This is maybe because of a requirement, of unknown nature, for a generally longer pTP in the chlorophyte alga (Caspari 2022) and a possible lack of phosphorylation in light of an evolutionary late enrichment of phosphorylatable AA in pTPs (Figs 2 and 3). These observations, together with a previous study showing mistargeting of rbcS to mitochondria upon serine-to-arginine substitutions (generating a chloroplast-aversion signal) (Lee et al. 2019), underscore the importance of the initial 20 AA in determining organelle specificity.
Origin of a Membrane-anchored TP Dephosphorylase at the Water-to-Land Transition
Two purple acid phosphatases, PAP2 and PAP9, are C-tail anchored into the outer plastid membrane in Arabidopsis, where they work on pTPs (Sun et al. 2012; Zhang et al. 2016; Zhang et al. 2021). AtPAP2 (AT1G13900) and AtPAP9 (AT2G03450) share a 70% global sequence identity and are both characterized by the presence of an N-terminal purple acid phosphatase domain (N-PAP) and a C-tail anchor (C-TMD; Fig. 4). In addition, AtPAP2 contains two lysine residues and a stretch of AA preceding the C-TMD, which, unlike PAP9, might enable AtPAP2 to simultaneously anchor into the mitochondrial outer membrane (Zhang et al. 2021) to also process mTPs (Law et al. 2015).
Membrane-docking of PAP2/9 evolved at the time of plant terrestrialization. a) Key differences between Arabidopsis PAP2 and PAP9 include an additional stretch of AA before the C-terminal membrane domain and a lysine couplet present in PAP2. The Marchantia PAP2/9 homolog lacks these features and is likely a PAP9. b) Domain architecture of the PAP2/9 family across various Chloroplastida. A characteristic N-terminal PAP domain (N-PAP) already evolved in chlorophyte algae, but a C-tail membrane anchor only later in streptophytes. Copy number increase in certain species is indicated by the additional icons under the column paralog domain architecture.
Purple acid phosphatases are highly conserved across all domains of life (Bhadouria and Giri 2022). We clustered all homologs gene families from 42 species (Table S4c) based on sequence similarity (Emms and Kelly 2019) and screened the clusters (Table S4b) for known import components (Table S4c-d). Our protein clustering identified AtPAP2 and AtPAP9 to be part of a smaller and likely Chloroplastida-specific family of PAPs that we termed the PAP2/9 family. Like many other organelle related proteins, the PAP2/9 family also underwent a substantial copy number increase in embryophytes (Fig. 4). Further sequence analysis suggests that sometime during plant terrestrialization, and commencing in streptophyte algae such as Klebsormidium, some of these PAP2/9 members acquired a C-TMD (Fig. 4). In some embryophytes, the members with a C-TMD duplicated and diverged further, likely giving rise to a plastid-anchored PAP9 and a dual-anchored PAP2. To experimentally probe this, we focused on PAP2/9 homologs of Chlamydomonas (Cre11.g468500) and Marchantia (Mapoly0122s0035). The algal homolog encodes a putative transmembrane domain at the very N-terminus (ca. 1 to 30 AA), whereas the Marchantia homolog encodes a land plant-like C-TMD (Fig. S7), albeit lacking an AA stretch before the C-TMD that is present in AtPAP2. Reporter fusion of the algal N-terminal region alone led to a mitochondrial (likely matrix) localization, whereas that of Marchantia localized to the plastid (Fig. S7).
Remodeling of Import Platforms Alongside Plant Diversification
The protein clustering (Table S4) allowed us to trace the evolutionary history of import components across the Chloroplastida, which shows that many components of mitochondrial and plastid protein import were duplicated around the time of plant terrestrialization (Table S4e, Fig. S8), i.e. from the streptophyte algae Chara and Spirogloea onwards. This is in line with a more general pattern regarding the expansion of mitochondrial and plastidal functions during the transition from water to land (Raval et al. 2024).
At the inner mitochondrial membrane, we traced the duplication and divergence of the main import channel TIM23 (Fig. 5a and b). It expanded the TIM23 1 to 3 family and added isoforms such as TIM23-II, which in Arabidopsis allows the fine-tuning of protein import in response to changing levels of respiratory complex I occurring, e.g. under oxidative stress (Wang et al. 2012 ). Associated proteins such as TIM21 were also duplicated (Fig. S8), with the copies likely interacting with proteins of the electron transport chain (Murcha et al. 2014b). We also capture duplications of peripheral components of TIM23, TIM22, and the TIM23-17 complex at large, such as PRAT (Preprotein and AA transporters) in the green lineage (Fig. 5a and b, Fig. S8). These newly recruited components, which connect protein import with respiration and oxidative stress, likely benefited a life on land where oxidative stress is more pronounced. At the outer mitochondrial membrane, the receptors TOM20 and TOM22, and the main import channel TOM40, also significantly diverged from their nonphotosynthetic homologs (Duncan et al. 2013). One reason could be the need for better cargo selection now that the plastid endosymbiont was present with its own emerging import apparatus, namely TOC/TIC (Garg and Gould 2016). That is in accordance with the Arabidopsis mitochondrial TPs, unlike those of animals, displaying two hydrophobic domains that bind to two separate binding sites of the AtTOM20 (Abe et al. 2000; Perry et al. 2006; Rimmer et al. 2011; Murcha et al. 2014a; Murcha et al. 2014c; Heidorn-Czarna et al. 2022 ). In combination, the changes at the two mitochondrial membranes likely improved import rate, accuracy, and regulation.
Recruitment, expansion, and divergence of import receptors. a) GCNs of key import channels at the inner membranes of mitochondria (crimson) and plastids (green) across species of interest (n = 42 Archaeplastida species) and b) a distribution across each clade of interest. (c to e) Ancestral state reconstruction for the key components of the TIC complex (n = 137 Archaeplastida species, six outgroups). Ancestors of Archaeplastida, rhodophytes, chlorophytes, and embryophytes are indicated by light green, red, green, and mustard arrows. The phylogeny of the species used and full ASR of the three import components is shown in Figs S9 to S12.
At the inner plastid membrane, duplications of TIC20, TIC62, and TIC22 (Fig. 5, Fig. S8) gave rise to quantitative and qualitative differences in plastid import. For instance, a duplication of TIC22, captured as two TIC22 sub-families (Fig. 5a), provided functionally redundant isoforms that increase the rate of import in land plants (Rudolf et al. 2013). In contrast, isoforms resulting from duplications of TIC20 contributed to a qualitatively different import complex characterized by TIC20, TIC56, and TIC214 (Kikuchi et al. 2013). While the green lineage-specific TIC20-IV (Fig. 5a and b) and TIC56 (Fig. 5c) and an ancient TIC100 (Fig. 5d) formed the core of this complex, a massive divergence of another TIC20 isoform likely resulted in TIC214, which only shares homology to TIC20 in the region encoding the six transmembrane spanning domains (de Vries et al. 2015). After its emergence in Chloroplastida, as also substantiated by its presence in Chlamydomonas (Jin et al. 2022; Liu et al. 2023), this complex was further modified in land plants via the addition of TIC12 (Fig. 5d), a recently identified component of this complex (Zhao et al. 2022 ).
Discussion
The eukaryotic cell is required to orchestrate the correct targeting of thousands of cytosolically translated proteins. Eukaryotes, especially photosynthetic ones, have therefore evolved a variety of mechanisms that contribute to the correct targeting and import selection: organellar mRNA localization (Weis et al. 2013; Tian and Okita 2014), alternative transcription and translation initiation (Mackenzie 2005), ubiquitination (Thomson et al. 2020), piggy-back transport (Thoms 2015), phosphorylation (May and Soll 2000; Garg and Gould 2016; Lee et al. 2023), and a plethora of small GTPases when taking vesicle trafficking as a mean to specifically transport proteins from one destination to another into account (Stenmark 2009; Guo et al. 2014). Some work constitutively, while others secure protein homeostasis of a compartment and can swiftly respond to environmental changes (Bauer et al. 2015). Each import strategy adapts according to the requirements of a specific evolutionary niche and, conversely, changes in protein targeting can offer novel possibilities.
The expansion of plastid proteomes underpinned the evolutionary success of the chloroplastidal lineage and its subsequent adaptation to the land and its novel stressors (de Vries et al. 2016; Raval et al. 2024). Concomitant changes of the import machinery, such as a duplication of the main outer membrane import channel TOC75, allowed for a better rate of import of an increasing amount of cargo, which, in return, is likely associated, for instance, with a better response to high light stress in the green lineage (Knopp et al. 2020). Similarly, the expansions of plastidal and mitochondrial inner membrane channel components (Fig. 6, Fig. S8) likely increased import rate, which can help to adapt to altering levels of oxidative stress by better regulating import.
Schematic of mitochondrial and plastid protein import evolution. For simplicity, a linear evolution of the main archaeplastidal lineages is pictured at the top. Chloroplastidal diversification on land was characterized by organellar adaptation toward various stressors (Raval et al. 2024) and a recruitment of novel proteins that require import. At the origin of the Archaeplastida, a phenylalanine-based motif in the pTP aided the plastid targeting as the first point of discrimination between mitochondria and plastids. That feature was lost at the origin of the green (chloroplastidal) lineage, arguably also lowering the import specificity and causing some protein mistargeting between plastids and mitochondria (Knopp et al. 2020). The selection against such a mistranslation at the origin of the green lineage and subsequently during its evolution (in light of the expansion of the organelle proteomes in the land plants [Raval et al. 2024]), contributed to the evolutionary remodeling of the import machinery. Duplication and divergence of key import channels (e.g. TOC75, TIC20, and TIM23) gave rise to components such as an additional, TIC214-based import complex. It aided in the import of an expanded organelle proteome, and likely at an increased rate, facilitating adaptation to light and oxidative stress (Knopp et al. 2020). On the cargo side, the N-terminal targeting sequences for plastids (pTPs) and mitochondria (mTPs) diverged, whereby pTPs accumulated phosphorylatable AA (depicted in green) and averted positively charged AA (depicted in Salmon) and hereby added a component from which to differentiate import cargo in a phosphorylation- and charge-dependent manner. The evolutionary divergence of TPs was undergirded by Δψ and a coevolution of receptors by mechanisms such as (i) initial, and likely ancient, charge-based screening by the outer membrane receptor TOM20/44, and (ii) a Δψ-dependent activation of a cation-selective channel TIM23, screening for positively charged mTPs on the matrix side. Chaperons might improve accuracy (May and Soll 2000; Qbadou et al. 2006; Jarvis 2008; Lamberti et al. 2011) and the membrane recruitment of a dephosphorylase such as PAP2 likely contributed to import the regulation of (dual-targeted) cargo.
Co-evolution of import receptors and N-terminal targeting sequences also contributed to improved cargo selection. For instance, TOM20 and TOM22, which mediate the entry of the mTP (Rimmer et al. 2011), appear to be substantially diverged in Archaeplastida, and the mechanism of cargo selection differs between plants and animals: in animals, the cytosolic domain of TOM22 interacts with the mTP and imports it further, while the plant TOM22 serves as a gate-keeper by occupying a hydrophobic cradle on TOM20 (Macasev et al. 2000; Carrie et al. 2010; Rimmer et al. 2011). The plant TOM22 gets outcompeted by mTPs and displaced, but likely not by a pTP (Macasev et al. 2000; Carrie et al. 2010; Rimmer et al. 2011), which adds the kind of TP divergence we observe for angiosperms but not algae (Figs 1 and 2). TOM22 interacts with the mTPs via a negatively charged region and likely selects positively charged mTPs in the glaucophyte alga Cyanidioschyzon (Hirata et al. 2024), once more underscoring the relevance of charge-based selection for organelle cargo at the outer mitochondrial membrane in early branching phototrophic eukaryotes. Ancestral states of these groups suggest that the divergence of charge predates the diversification within eudicots and angiosperms, and it might have played a role in the evolutionary success of the groups.
At the inner mitochondrial membrane, TIM23 forms a cation-selective channel that is activated by the mTP and the membrane potential (Δψ) of the membrane (Truscott et al. 2001). Δψ can actively facilitate the entry of positively charged mTPs, while averting a negatively charged one, such as a phosphorylated pTP (Garg and Gould 2016). At least two major receptors at the inner and outer membranes, along with Δψ, hence contribute to a selection pressure that suppresses positive charges in pTPs. It is this divergence that our substitutions erase, resulting in the mis-targeting (and dual targeting in the case of protoplasts) of a mitochondrial marker protein (Fig. 3).
On the plastid side, changes at the inner envelope membrane and in the pTP also improved the accuracy of protein import. It is feasible that the chloroplastidal recruitment of the TIC214-containing import complex (Fig. 5) further aided in improving the level of import accuracy that was at least partly compromised with the loss of a phenylalanine-based TP selection system at the origin of the Chloroplastida (Knopp et al. 2020). Molecular dynamics simulation on the TIC–TOC super complex structure predicted an interaction of the pTP with TIC214 (Liu et al. 2023), which was confirmed by cross-linking experiments that revealed an interaction of the rbcS TP with positively charged residues of TIC214 in its preprotein translocation path (Jin et al. 2022). Such an interaction provides a possibility to differentiate better between the TPs (e.g. repel the generally positively charged mTP and select a pTP), contributing to the evolutionary divergence of the TPs witnessed along with the late diversification of plants.
Evolving phosphorylatable sites in a pTS only serve a purpose when accompanied by appropriate kinases and phosphatases. The kinase STY17 in Arabidopsis phosphorylates pTPs, which is then imported at an increased rate, aided by HSP70–14/3/3 guidance complex (Waegemann and Rgen Soll 1996; May and Soll 2000; Martin et al. 2006; Lamberti et al. 2011; Schwenkert et al. 2011; Von Loeffelholz et al. 2011; Law et al. 2018). The evolutionary history of such chaperons, kinases, and phosphatases remains blurry, owing to the conserved nature of functional domains and their general abundance. By combining protein clustering, domain analyses, and experimental approaches, however, we were able to verify the embryophytic recruitment of at least one outer plastid membrane-localized dephosphorylase, a member of the PAP2/9 family (Fig. 4), which plays a role in protein import in Arabidopsis (Law et al. 2015; Zhang et al. 2016). Identifying the members of this large protein family in individual species remains challenging and notwithstanding the caveats common to orthologue annotations, we identify homologs of AtPAP2 in Chlamydomonas and Marchantia. The mitochondrial localization of CrPAP2 might hint at the ancestral localization of the PAP2/9 protein family represented by chlorophyte algae. Marchantia encodes only a single C-TMD containing PAP (Mapoly0122s0035), which lacks the characteristic AA stretch present in AtPAP2 but encodes a C-TMD (Fig. S7). We refer to this protein as MpPAP9, whose plastid localization (Fig. S7) might capture an intermediate state of PAP2/PAP9 protein family evolution. Along with a concomitant enrichment of phosphorylatable AA in the pTPs, particularly pronounced in angiosperms (Figs 1 and 2), a recruitment of a plastid outer membrane dephosphorylase (Fig. 4) hints at a layer of differential targeting control to mitochondria and plastids still poorly understood (Fig. 6).
A duplication of this membrane-anchored phosphatase, likely in angiosperms (Fig. 4), gave rise to a paralog such as AtPAP2, which is dually localized to plastid and mitochondrial outer membranes (Sun et al. 2012; Zhang et al. 2016; Zhang et al. 2021). Since phosphorylated AA in the pTPs are thought to act as a mitochondria aversion signal (Lee et al. 2006; Garg and Gould 2016), the presence of a dephosphorylase on the mitochondrial outer membrane appears to functionally reverse the evolutionary changes, potentially allowing for organelle mis- or dual-targeting. Indeed, a known substrate of AtPAP2 is a dTP, mitochondrial import of which hinges upon de/phosphorylation (Law et al. 2015). Some 110 proteins in Arabidopsis are known to be dually targeted (Carrie et al. 2009; Carrie and Small 2013), and the dTPs physicochemical features represent an intermediate value between those of pTPs and mTPs (Garg and Gould 2016).
The relative contributions of TP secondary structure and its physicochemical features to correct targeting are a matter of ongoing investigations, especially due to a lack of distinct features other than the abundance of serines in pTPs and alpha helices in mTPs (von HEIJNE et al. 1989; Bruce 2000; Jarvis 2008; Sidorczuk et al. 2023; Thagun et al. 2024). Structural predictions for the wild-type MME1 sequences from the Chlamydomonas and Marchantia (predicted using Alfafold3 (Abramson et al. 2024; Fig. S13) show them to have an alpha helix in the first 20 AA. The Marchantia MME1 with six serines and an intact alpha helix, however, indeed localizes to the plastid and provides an example, where physicochemical features surpass the secondary structure to convert a mitochondrial protein into a dual-targeted one. To what extent this could be a way toward dual targeting remains to be addressed, along with the role of de/phosphorylation (e.g. by PAP2), and the mechanism for the membrane recruitment of dephosphorylases. That the first 20 AA of CrMME1 are sufficient for correct mitochondrial targeting is in line with an evolutionary early dependence of positive charges for mitochondrial matrix targeting, which manifested itself during eukaryogenesis and in early diverging eukaryotes (Garg et al. 2015; Gould et al. 2016; Hirata et al. 2024). It was a necessity before plastid origin and pTPs evolved accordingly (Knopp et al. 2020) (Fig. 6).
A subset of plastid proteins, including the proteins involved in metabolism and translation, have a pTP charge closer to the mean mTP charge, i.e. the pTP of these proteins are less diverged from mTPs. Moreover, proteins involved in translation in general have a wider distribution of charge and number of phosphorylatable AA. Proteins whose mistargeting is detrimental are likely under stronger selection to diverge from mTPs than those whose mistargeting is not. The latter, over time, provides ground to evolve functionally relevant dual targeting. We propose that the gradual loss of phenylalanine-based targeting at the origin of Chloroplastida offered additional ground for the origin of dTPs due to some mis-sorting, and with which import receptors and components, such as PAP2/9, coevolved for an additional layer of import regulation. Experimental information regarding protein dual targeting outside of angiosperms, however, is rare, and the prediction of dTPs remains unreliable (Gould et al. 2024). Studying the origin and evolution of (regulated) protein dual targeting is hence a promising avenue and calls for more dedicated research outside of angiosperms.
In closing, our computational analyses across a wide range of photosynthetic eukaryotes underscore major shifts in the TPs from algae to angiosperms. The sample size of reliable genomes and organelle proteomes is skewed toward embryophytes and affects our inference as to where along the evolution of plants the TPs diverged exactly. Nevertheless, subsampling at the lowest range of organelle protein numbers (n = 100 to 300) and species numbers (N = 6 to 9) shows that eudicot pTPs are indeed significantly different from eudicot mTPs and from algal TPs (Fig. S14). Additional genomes, organelle proteomes, and experimental studies on protein import in algae (including rhodophyte and glaucophyte), will aid in tracing the evolution of TPs at a finer resolution. The role of TP modifications other than phosphorylation (Giglione and Meinnel 2021; Hoernstein et al. 2024) also remains to be explored. Notwithstanding the current limitations, it is clear that the divergence of physicochemical properties of TPs played a crucial role in preventing protein mistargeting (Figs 2 to 4) and it co-evolved with their processing counterparts at the outer membrane (Fig. 5). In light of the loss of a phenylalanine-based motif from the pTP (Knopp et al. 2020) and a general lack of conserved determinants between the two TPs (von HEIJNE et al. 1989; Jarvis 2008), this divergence improved the initial recognition of cargo at the outer membranes of organelles of endosymbiotic origin (Fig. 6) and enhanced the import accuracy of the expanding organelle proteomes that coincided with the diversification of plants (Raval et al. 2024). We indeed find the divergence between pTP and mTP (across species) to correlate with the total proteome size from algae to angiosperms (Fig. S15). The ecological factors responsible for both, organelle proteome expansion and TP divergence, are likely to include the access to more oxygen and an increase in photosynthesis performance associated with plant terrestrialization (Schreiber et al. 2022).
Conclusion
Contributions of mitochondria and plastids to plant's adaptation and divergence are plenty (de Vries et al. 2016; de Vries and Gould 2018; Dhabalia Ashok et al. 2024; Raval et al. 2024). In this study, we focused on the evolution of the organelle protein import itself, a gatekeeper for the organellar contributions to species fitness. Origin and diversification of the Chloroplastida, particularly that of embryophytes, was facilitated by major changes in import, also evident in targeting sequence divergence between mitochondrial and plastid cargo. Experimentally contrasting organelle protein import in the single cell alga C. reinhardtii with that in the early diverging bryophyte M. polymorpha, we validate the importance of targeting sequence divergence with respect to charge for enhancing targeting accuracy. Our revised model highlights novel mechanisms that determine import fidelity and regulation, such as de/phosphorylation at the outer membranes and ΔΨ at the inner mitochondrial membrane. These mechanisms are not confined to plants but also work on mitochondrial-nuclear localization shifts in humans, where the phosphorylation of a transcription factor was recently shown to be responsible for maternal inheritance of mitochondrial DNA (Lee et al. 2023). Membranes, ΔΨ and phosphorylation are traits as old as life itself. How their interplay has evolved into a selection and regulation mechanism for protein cargo transfer across membranes of endosymbiotic origin presents a promising avenue of future research.
Materials and Methods
Genomes Used in the Study
To study the evolution of TPs and import components, an in-house database of 137 Archaeplastida and 6 sister species was consolidated from available databases (species names and sources summarized in Table S1a). TPs analyses were conducted on 102 genomes from major clades where organelle proteomes could be reliably inferred based on available experimental proteomes (Table S1b). The presence–absence patterns of import components were plotted for a subset of genomes (Table S1c), representative of each clade, to visualize the broad patterns clearly. The ancestral state reconstructions (ASR) of import components were conducted on all genomes.
Analyses of TPs from Algae to Angiosperms
To study the features of TPs across organelle-targeted proteins, we used 102 Chloroplastida species from five major clades: Eudicots (66), Monocots (14), Bryophytes (7), Streptophyte (6) and Chlorophytes (9) algae (Table S1b). Within-clade all vs all reciprocal best blasts hits (rbbh) were retrieved for all proteins from all the species of these clades. To infer organelle proteomes of all species in a given clade, we used experimental plastid and mitochondrial protein (Gene IDs of the experimental proteins summarized in Table S2a-c) available across Chloroplastida (Sun et al. 2009; Terashima et al. 2011; Mueller et al. 2014). For a protein in a given species, if an rbbh was found against an experimentally validated organelle protein in the same clade, we inferred that protein to be organelle localized for that species (Gene IDs of inferred proteins summarized in Table S3, method schematic in Fig. S1). As there are no experimental proteomes available for streptophyte algae, we inferred their organelle proteomes based on chlorophyte algae. From these inferred plastid, mitochondrial- and dual-targeted proteomes, we analyzed the first 20 AA for electric charge and content of phosphorylatable AA of each protein using (the script). The resulting 20 AA sequences across species, charge/phospho values of each protein across all species are available on Zendeo; means for all species are consolidated in Source Data Fig. 1 and plotted in Fig. 1.
To investigate ancestral states of these two traits, a phylogenetic tree of each clade was constructed using the concatenation of organelle proteins present in all species in that clade (sequences and Newick files available on Zenedo), in IQ-TREE (Minh et al. 2020) v2.0.3 after aligning them with MAFFT (Katoh and Standley 2013) v7.505. The resulting unrooted trees were rooted using minimal ancestral deviation (Tria et al. 2017). Using this phylogenetic tree for the clade and based on values of charge and phosphorylatable AA across all species in the clade, ASR was conducted in rstudio (Phytool package 0.7.80; Revell 2012) (individual outcomes summarized on Zenedo; means consolidated in Source Data Fig. 2 and plotted in Fig. 2 as well as in Fig. S3 and Source Data Fig. S3).
Cloning and Transfection
Genes of interest from Marchantia, MpMME1 (Mp4g02270 alias Mapoly0080s0072) and MpPAP9 (Mp2g16290 alias Mapoly0122s0035) were amplified from the cDNA and cloned into Gateway (Invitrogen) entry vector pDONR221 (Invitrogen) by BP reactions set as per manufacturer's protocol. Substitutions of interest were made into the wild-type MME1 using PCR-mutagenesis. All constructs were transferred at the N-terminal of Citrine on pMpGWB106 (Ishizaki et al. 2015), or at the C-terminal of citrine on pMpGB105 (for MpPAP9) via Gateway cloning by LR reactions set as per manufacturer's protocol. The N-terminal of interest from Chlamydomonas CrMME1 (Cre06.g268750) and CrPAP2 (Cre11.g468500) were in vitro synthesized and amplified from the genomic DNA, respectively. Subsequently, they were cloned at the N-terminal of eGFP into the vector pJR38 (Neupert et al. 2009) by digestion with ndeI–bglII (bglII site was introduced to pJR38 prior to cloning the sequences of interest) and sticky-end ligations.
Localization vectors were integrated into Marchantia genome via *Agrobacterium-*mediated transfections previously optimized (Ishizaki et al. 2008). Briefly, the vectors were first introduced into electrocompetent Agrobacterium tumefaciens (GV3101 without pSOUP) by electroporation (Bio-Rad GenePulser Xcell, 1.44 kV). Epical notches were cut away from two-week-old thalli and after 2 d of recovery, these explants were incubated with Agrobacterium transfectants (starting OD_600_ of 0.02) for 3 d. After washing three to four times with 25 to 30 mL water, the explants were incubated with 1 mg/mL cefotaxime (to remove any leftover Agrobacterium) and again washed two times. The explants were then plated onto Hygromycin (10 μg/mL) and Cefotaxime (100 μg/mL) and selected until they produced gemmae. Gemmae from multiple explants were taken forward to the next generation (G0, the first generation starting from a gemma) and grown until they produced gemmae, which were screened for the reporter expression under the microscope. Expression-positive gemmae were further propagated and used for the localization experiments.
The localization constructs for the alga were introduced to Chlamydomonas via the glass bead transfection method (Kindle 1990). Briefly, 1 µg of linearized plasmid, 5% PEG6000, 330 µL Chlamydomonas culture (concentrated to 10^8^ cells per ml) and 300 mg glass beads were vortexed for 15 s. The cells were plated, selected, and further propagated on 10 µg/mL paromomycin. The algal and plant lines used in the study are summarized in Table S5.
Plant and Algal Growth Conditions
M. ecotype Takaragaike-1 (Tak-1) plants were grown from gemmae on half-strength Gamborg B5 vitamin agar-plates (GVA) under continuous light (70 μmol m^−2^ s^−1^). Individual colonies of C. reinhardtii UVM4 (Neupert et al. 2009) were grown in high salt medium supplemented with acetate as a carbon source under continuous light (40 μmol m^−2^ s^−1^).
Protoplast Isolation
Gemmae from expression-positive plant lines were grown for 1 to 2 weeks on GVA. A 5 to 10 mL of 8% mannitol, 6.3 g/L Gamborg B5 vitamins (MGV) containing 15 mg/ml cellulase and 5 mg/mL macroenzyme R10 was poured onto the plates and incubated overnight with gentle shaking (10 to 20 rpm). The plates were agitated to dislodge the protoplasts, and the liquid was filtered with 50 µm strainer and washed twice with MGV by centrifuging at 300 × g, 3 min and gently resuspending by swirling the tube.
Sample Preparation for Microscopy
Marchantia gemmae were placed in 30 µL of water on a slide and covered with a coverslip. Marchantia thalli, grown (from gemmae stage) for 2 d, were placed in 30 µL of water on a slide and covered with a coverslip. Marchantia protoplasts and Chlamydomonas single cells were strained with 300 nM MitoTracker™ Red CMXRos as per manufacturer's instructions (50 µL protoplast or cell suspension in MGV or growth media + 300 nM dye, incubated for 20 min at RT, washed twice by centrifuging the cells at 2,000 × g and protoplasts at 100 × g for 5 min) to visualize mitochondria and immobilized in 1% soft agarose before microscopic analysis.
Microscopy
Nikon Eclipse Ti imaging platform (illuminated with Nikon Intensilight C-HGFIE for fluorescence) was connected to the camera via Nikon Digital Sight DS-U3 and to the PC via NIS-Elements Basic Research version 4.30. The light intensity was adjusted via neutral density filters at 16-8 for plastids and 1 for mitochondria (1 = minimum attenuation, maximum light allowed through). Plastids were visualized via chlorophyl autofluorescence at 480/590 nm, GFP/citrine reporters at 470/520 nm, and mitochondria at 540/605. Exposure times were set between 20 and 80 ms for plastids, up to 300 ms for GFP, and 200 to 500 ms for mitochondria (depending upon the extent of staining).
The Z-stacks (0.33 µm per stack) of the whole Chlamydomonas cells were imaged and deconvoluted via the Lightning adaptive deconvolution module (Leica Microsystems) on Leica TCS SP8 platform (Leica, Wetzlar, Germany) with an HC PL APO CS2 93× glycerol objective (NA 1.3, Leica), connected to the PC via (LAS-X, Leica). Plastids were visualized via chlorophyl autofluorescence at 510/670 to 710 nm, eGFP reporter at 488/495 to 513, citrine reporter at 510/520 to 540 nm, and mitochondria at 560/580 to 620 (White Light Laser, at 25% strength). Autofluorescence from plastid was filtered out during GFP and mitochondria reading by time gating (emitted signal was collected between 0.8 and 6 ns) (Kodama 2016).
Microscopy Image Processing
Prior to making the microscopy panels, thresholds of each image were adjusted to filter out the halo/bleeding out signals (retain the middle 50% of the total data by treating each pixel on the plane equally). All colocalization analyses were conducted in ImageJ (Fiji) with the BIOP JACoP plugin (auto thresholds).
Origin and Evolution of Import Components Across Archaeplastida
In total, 42 Archaeplastida species (Table S4a), from each available representative clade, were clustered using OrthoFinder version 2.5.4 (Emms and Kelly 2019), after all vs all blasts were conducted (E-value cutoff 10e−6) using diamond blast version 2.011 (Buchfink et al. 2014). The protein clusters (Table S4b) were annotated based on the presence of experimentally validated import components (Table S4c to d), and gene copy numbers (GCN) for each component were inferred across species (Table S4e), and a subset was plotted in Fig. 5a (see Fig. S8 for GCN plots of all components). PAP2/9 homologs from all species were retrieved from the same clusters that were analyzed through Interproscan (Paysan-Lafosse et al. 2022) and DeepTMHMM (Hallgren et al. 2022) to annotate functional domains and membrane anchoring domains, respectively.
To investigate the origins of a select component more rigorously, we utilized the full database of 137 Archaeaplastida species with 6 outgroup species (Table S1a). Considering the sequence divergence of these components, iterative hidden Markov searches were conducted using HMMER (Eddy 1998) 3.3.2 and seed sequences from Arabidopsis. The homologs thus retrieved were used for ASR in rstudio (Phytool package 0.7.80 [Revell 2012]).
Supplementary Material
msaf240_Supplementary_Data
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abe Y et al Structural basis of presequence recognition by the mitochondrial protein import receptor Tom 20. Cell. 2000:100:551–560. 10.1016/S 0092-8674(00)80691-1.10721992 · doi ↗ · pubmed ↗
- 2Abramson J et al Accurate structure prediction of biomolecular interactions with Alpha Fold 3. Nature. 2024:630:493–500. 10.1038/s 41586-024-07487-w.38718835 PMC 11168924 · doi ↗ · pubmed ↗
- 3Archibald JM . Endosymbiosis and eukaryotic cell evolution. Curr Biol. 2015:25:R 911–R 921. 10.1016/j.cub.2015.07.055.26439354 · doi ↗ · pubmed ↗
- 4Bauer NC, Doetsch PW, Corbett AH. Mechanisms regulating protein localization. Traffic. 2015:16:1039–1061. 10.1111/tra.12310.26172624 · doi ↗ · pubmed ↗
- 5Bhadouria J, Giri J. Purple acid phosphatases: roles in phosphate utilization and new emerging functions. Plant Cell Rep. 2022:41:33–51. 10.1007/s 00299-021-02773-7.34402946 · doi ↗ · pubmed ↗
- 6Bhushan S, Kuhn C, Berglund AK, Roth C, Glaser E. The role of the N-terminal domain of chloroplast targeting peptides in organellar protein import and miss-sorting. FEBS Lett. 2006:580:3966–3972. 10.1016/j.febslet.2006.06.018.16806197 · doi ↗ · pubmed ↗
- 7Bowman JL et al Insights into land plant evolution garnered from the Marchantia polymorpha genome. Cell. 2017:171:287–304.e 15. 10.1016/j.cell.2017.09.030.28985561 · doi ↗ · pubmed ↗
- 8Bowman JL et al The renaissance and enlightenment of Marchantia as a model system. Plant Cell. 2022:34:3512–3542. 10.1093/plcell/koac 219.35976122 PMC 9516144 · doi ↗ · pubmed ↗
