Population Genetics of the Emergence and Evolution of Allogenic Recognition During Fertilization
Masahiro Naruse, Takako Saito, Midori Matsumoto

TL;DR
This study uses population genetics to explore how self and nonself recognition during fertilization evolved in species like ascidians and plants.
Contribution
A novel population genetics approach is introduced to model the emergence and evolution of allorecognition mechanisms.
Findings
Allorecognition alleles can increase under specific evolutionary conditions.
Simulations showed coexistence of allorecognition and nonallorecognition genotypes is possible.
Modeling in Ciona confirmed consistency with experimental data on allorecognition.
Abstract
Allorecognition, or distinguishing between the self and nonself within the same species, is observed in both animals and plants, particularly in the context of immune reactions and self-incompatibility in sexual reproduction. Polymorphic recognition molecules are known to be responsible for such allorecognition during fertilization. Previous studies have reported that in ascidians and flowering plants, inbreeding avoidance relies on a pair of polymorphic recognition molecules with a receptor-ligand relationship that are encoded at a single locus, the S locus (Self-incompatibility locus), but the process by which such pairs of recognition molecules emerge and evolve to become polymorphic is not known. Here, a population genetics study was carried out as a novel approach for investigating allorecognition. To study the process by which self-recognition emerges, we simulated a situation in…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —JSPS KAKENHI
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReproductive Biology and Fertility
1. Introduction
Fertilization, or gamete fusion, is a pivotal event in sexual reproduction in animals and plants. In general, a male gamete, or sperm, is released and attracted to a female gamete, or egg. Whereas most flowering plants are hermaphrodites, a majority of the animals are dioecious, with some exceptions, including ascidians. Among angiosperms, approximately half of all species, including rice and Arabidopsis, are self-compatible [1,2,3]. Similarly, both self-incompatible and self-compatible species of ascidians have been characterized: Ciona intestinalis and Halocynthia roretzi are well-known self-incompatible species, whereas Phallussia mammillata is self-compatible [4,5].
Self-compatibility appears to be advantageous for population growth, since it allows female gametes to be fertilized with male gametes from the same animal even in the absence of a mating partner: a good example of a self-compatible animal species is Caenorhabditis elegans [6]. On the other hand, individuals produced by self-fertilization have lower genetic diversity than those produced by nonself-fertilization, and as a result, self-fertilization causes inbreeding depression [7]. For example, in Ciona, self-fertilization results in the production of only a small number of normal embryos and larvae [8]. Self-fertilization in hermaphroditic colonial tunicates of the genus Botryllus also results in the production of an extremely small number of progeny due to embryonic death [9]. Therefore, although the prevention of self-fertilization has a certain disadvantage in terms of reducing opportunities for reproduction, it appears to have a significant advantage in eliminating harmful genes and promoting the spread of beneficial genes.
Although some species avoid self-fertilization by modulating reproductive behavior, such as by releasing male and female gametes at different times [10,11], many species prevent self-fertilization through allogenic recognition (allorecognition) mechanisms. In Ciona intestinalis type A (another name Ciona robusta), sperm can recognize eggs from the same individual upon attachment to the vitelline coat (VC), causing detachment from the VC or eventual cessation of motility, ultimately preventing self-gamete fusion. The sperm-VC recognition protein s-Themis, a PKD-like transmembrane protein, and the VC protein v-Themis, a fibrinogen-like protein, play key roles in this process [4]. Three pairs of s-Themis and v-Themis genes, s/v-Themis-A, s/v-Themis-B, and s/v-Themis-B2, are found at two loci: s/v-Themis-A on chromosome 2q (locus A) and s/v-Themis-B and s/v-Themis-B2 on chromosome 7q (locus B). In every case, v-Themis genes are located in the first intron of s-Themis genes but are transcribed in the opposite direction [4,5,12,13]. If three allelic pairs (A, B and B2) of s-Themis and v-Themis are matched, i.e., the same haplotype, fertilization is prevented, enabling self-incompatibility [12].
Similarly, in Papaveraceae, if self-pollen reaches the stigma, pollen germination and pollen tube elongation do not occur, and as a result, self-fertilization never takes place. The pollen recognition protein PrpS and the stigmatic recognition protein PrsS, genes of which are encoded in a single diverse genetic locus, the S-locus, in close proximity (within 0.5 kbp) to one another, are responsible for self-incompatibility in Papaveraceae [1,14].
In both cases, male and female recognition molecules, encoded by genes at one or a few loci, play pivotal roles in ligand-receptor interactions. When VC or stigmatic ligand molecules interact with receptors on self-sperm or self-pollen, respectively, intracellular Ca^2+^-mediated signal transduction occurs, resulting in the prevention of self-fertilization [15,16]. Both male and female allorecognition proteins are thought to be highly polymorphic and mediate specific interactions [17]. The polymorphic alleles responsible for such allorecognition are presumed to have originated from a single ancestral gene, encoding a pair of ancestral male and female recognition molecules believed to have diverged to produce polymorphisms during coevolution [18].
Here, we investigated the emergence and evolution of allorecognition molecules using a population genetics approach. For the emergence process, we evaluated whether a primitive self-recognition gene that emerged by mutation was maintained from generation to generation. The process, by which a pair of self-recognition proteins acquire variation by mutation was then simulated using a model in which a male or female mutation preceded the creation of a new allorecognition gene pair. Finally, we examined how the proportions of allorecognition alleles and nonallorecognition alleles changed when the number of allorecognition alleles or loci increased.
2. Materials and Methods
For this study we considered three preconditions for our modeling, and the model represents phenomena that occur in animals. Therefore, the sperm and the egg in the animal model correspond to the pollen and the pistil in plants, respectively.
The first precondition is that the pair of allorecognition molecules is specialized for fertilization and is not shared with other types of allorecognition, such as nonself recognition in the immune system. At least in the ascidian, Halocynthia roretzi, different molecular mechanisms have been suggested for fertilization and the immune response, since the characteristics of self-incompatibility and allogenic refusion of somatic cells (contact reaction) differ [19,20]. Additionally, the Fu/HC gene, which plays a key role in histocompatibility, was not expressed in germline cells in the ascidian Botryllus schlosseri [21].
The second precondition is that male recognition molecules are expressed from haploid cells, whereas female recognition molecules are derived from diploid cells, whether from eggs arrested at the first metaphase of meiosis in ascidians or from the somatic cells surrounding eggs, as reported in C. intestinalis [5,22]. In plants, this type of self-recognition in mating is called gametophytic, distinguished from sporophytic, in which male recognition molecules are synthesized in the anther tapetum (somatic cells) and attached to the surface of the pollen, although both gametophytic and sporophytic self-recognition have been reported [1,23].
The third precondition for modeling was that neither mutations nor gene migration occured during the simulation period and that generations did not overlap, as is typical in the modeling of population genetics [24]. This is to focus our examination on each step of emergence and the evolution of allorecognition, which consists of the appearance of a certain new allele by mutation and subsequent convergence to equilibrium through the spread of the selection of alleles in the population, rather than simulating the mutation process itself in real-time. In the simulation, genotypes compete through natural selection rather than genetic drift.
2.1. Modeling for the Emergence Process of Allorecognition Genes
The model for the process of self-recognition gene emergence is shown in Figure 1. X represents a nonallorecognition allele, Y represents an allorecognition allele that emerged as a result of the mutation, and and indicate the proportions of individuals with the XX and XY genotypes, respectively, in the n-th generation, where , because the YY genotype does not exist (as described below).
Since the numbers of sperm and eggs should be proportional to the number of individuals, X-genotype sperm are produced from XX individuals, while X-genotype and Y-genotype sperm are produced from XY individuals (Figure 1A). Similarly, the ratio of XX-genotype eggs to XY-genotype eggs can be regarded as vs. . Although X-genotype sperm can fertilize any egg, sperm carrying the Y allele can fertilize only XX-genotype eggs (Figure 1B).
Assuming that all the eggs undergo fertilization with an excess amount of sperm, the number of offspring in the next generation produced by each combination of gametes can be expressed as the product of the proportions of eggs and sperm, as shown in Figure 1C. Here, YY-type offspring cannot occur, since Y-type sperm cannot fuse with XY-type eggs. Before the number of offspring for each genotype is summed, the survival probability must be adjusted if the parental genotypes are the same because inbreeding depression must be considered for offspring of pairs of the same genotype, i.e., self-fertilized offspring. For cases of inbreeding depression, the value of should be between 0 and 1, where indicates that inbreeding is completely lethal, indicates that inbred and outcrossed offspring exhibit equal survival, and indicates that inbred offspring have greater survival than outcrossed offspring do, which is unlikely. Finally, the sums of each genotype are defined as and , respectively; then, and , which are normalized values obtained by dividing each value by their sum, defined as , are the proportions of XX and XY genotype individuals of the n+1st generation (Figure 1D).
For the simulation, and were set to arbitrary initial values, and the generation calculation in Figure 1D was simply repeated as many times as necessary for the analysis. The reproducibility and robustness of the convergence points of each population were confirmed by the simulation using randomized initial conditions; the same was applied to the following models.
2.2. Modeling of the Evolution of Allorecognition Genes
2.2.1. Divergence Through Mutation
To study the evolutionary process of allorecognition genes, namely, the generation of variation in allorecognition alleles, the process of allorecognition allele divergence must be considered first. Thus, situations in which a mutation occurred in a gene encoding the male or female recognition molecule of a primitive allorecognition gene in the previous model (Figure 1) were modeled as shown in Figure 2, Appendix A.1, and Supplementary Material S1.
In addition to the definition in Figure 1, we defined an allele Y^F^, in which a mutation occurred in a gene encoding the female recognition molecule on allele Y and whose function was lost, and an allele Y^M^, in which a mutation occurred in a gene encoding the male recognition molecule (Figure 2A). A mutated female recognition molecule derived from the Y^F^ allele does not lead to fertilization failure for sperm with a male recognition molecule encoded by either the Y allele or the Y^F^ allele, whereas the sperm of Y^F^ cannot fertilize eggs with the Y allele. Therefore, this mutation results in one-way sterility. With respect to the Y^M^ allele, sperm with a mutated male recognition molecule derived from the Y^M^ allele are permitted to fertilize eggs with the Y allele or Y^M^ allele, whereas eggs with the Y^M^ allele cannot be fertilized by sperm with the Y allele, similar to eggs with the Y allele. Models for alleles Y^F^ or Y^M^ were subsequently constructed (Figure 2B,D, respectively), and the mating characteristics of the genotypes in these models are shown in Figure 2C,E.
On the basis of these features, the relationships of the proportions of the n-th generation and the n+1-st generation were derived as Equations (A1) and (A2) through works similar to those in Figure 1C,D (see Figure S1A,B). Changes in the population occupancy rates of the Y^F^ allele or the Y^M^ allele were analyzed by repeating these generation calculations.
For further analysis of the divergence process, a situation in which a new allele encoding a male recognition molecule adapted to the mutated female molecule appears, must be considered. This new allele was defined as allele Z, and the populations consisting of X, Y, Z and Y^F^ were modeled as shown in Figure 2F. In this case, the XX, XY, XZ, YZ, XY^F^, YY^F^, ZY^F^, and Y^F^Y^F^ genotypes are present.
Only the X-type sperm can be accepted by any egg, the Y-type sperm can be fertilized with only the XX- and XZ-type eggs, and the Z-type and Y^F^-type sperm can fertilize XX, XY, XY^F^, YY^F^, and Y^F^Y^F^ genotype eggs (Figure 2G). On the basis of these features, the relationship between the proportions of the n-th generation and the n+1-st generation was derived by Equation (A3) through a process similar to those in Figure 1C,D (see Figure S1C). Changes in the population occupancy rates of alleles X, Y, Y^F^, and Z were simulated by repeating the generation calculations.
2.2.2. Model with Divergent Alleles
To simulate a situation in which the number of allorecognition alleles increases, it is necessary to consider three groups of genotypes, namely, a homozygous genotype of the nonallorecognition allele, a heterozygous genotype comprising the no allorecognition allele and one of the allorecognition alleles, and a heterozygous genotype comprising two different allorecognition alleles ( , , and in Figure 3A, respectively).
As shown in Supplementary Material S2, a preliminary study in which two types of allorecognition alleles (Y_1_ and Y_2_ in Figure S2A) coexisted with the nonallorecognition allele X (defined as ) revealed that heterogenotypic populations of a nonallorecognition allele and one of the allorecognition alleles ( and in Figure S2C,D), placed under the same conditions, converged to the same proportion as the number of generations increased, even if the simulations started from different initial proportions ( and ). Additionally, a preliminary study with three types of allorecognition alleles (defined as ) revealed that the heterogenotypic populations consisted of two different allorecognition alleles ( , , and in Figure S2G,H) converged to the same proportion, since these genotype populations were selected under the same conditions.
On the basis of the results of these preliminary studies, the simulation model was extended to and generalized. The characteristics of the model with “M” types of allorecognition alleles are shown in Figure 3. Here, heterogenotypic populations under the same conditions were substituted as and (Figure 3A). Afterward, the relationship between the proportions of the n-th generation and the n+1-st generation was derived by Equation (A4) through procedures similar to those in Figure 1C,D (see Supplementary Material S3). Changes in the occupancy rates of alleles X, and Ys were simulated by repeating the generation calculations.
2.3. Modeling the 2-Locus Situation
To investigate the effects of the presence of multiple allorecognition loci, a 2-locus model was constructed according to the characteristics of C. intestinalis. To simulate the fertilization of C. intestinalis, sperm cannot fertilize eggs that share the same alleles at all the loci (Figure 4A).
The definitions of the genotypes and their characteristics are described in Figure 4B and Figure S4A. Loci A and B have “M” and “L” types of allorecognition alleles, respectively. As in the model with divergent alleles above, genotypes under the same conditions were combined; e.g., represents M types of populations, with homononallorecognition in locus A and heteroallorecognition in locus B, under the same conditions.
The relationships between the proportions of the n-th generation and the n+1-st generation were subsequently derived by Equation (A5) through a process similar to that in Figure 1C,D. Changes in the population occupancy rates were simulated by repeating the generation calculation.
3. Results
3.1. Emergence Process of Allorecognition Genes
Using a model for the emergence process described in the Materials and Methods (Figure 1), we observed the proportions of genotypes across generations.
The changes in the proportion of XY genotype individuals ( ) across generations when the initial value is fixed at 0.01 and the survival probability varies between 0.01 and 1.1 are shown in Figure 5A. As shown in Figure 5A, converges to a certain value when , whereas it converges to 0 for and decreases monotonically for , indicating that genotypes XX and XY coexist as long as .
As shown in Figure 5B, when α is fixed at 0.2, simulations with any initial value yielded convergence of the XY proportion at the same value of approximately 0.45 after multiple generations. In other cases where , the proportion of XY genotype individuals which converged over generations was dependent on only regardless of the values, and reproducibility was confirmed by the simulation using randomized initial conditions, as shown in Supplementary Material S5.
In conclusion, the allorecognition allele can be fixed in the population at a certain proportion that depends on . In such a situation, the allorecognition allele (Y) and nonallorecognition allele (X) are in a state of balanced selection. Although heterozygous dominance is known to lead to balanced selection as in the case of sickle-cell polymorphisms in malaria-endemic regions [25,26], the relationship between X and Y here involves a different mechanism.
3.2. Process of Allorecognition Gene Evolution
3.2.1. Process of Divergence Through Mutations in Male/Female Genes
When polymorphism of allorecognition alleles occurs, it is unlikely that a gene pair encoding male and female recognition molecules will simultaneously undergo mutation to generate a new gene pair that encodes a pair of functional recognition molecules. Therefore, it is natural to hypothesize that mutation occurs first within only one sex (gene encoding a male or female recognition molecule), followed by adaptive mutation of the gene encoding the corresponding molecule. Under this hypothesis, we simulated situations in which only the male or female allorecognition molecule underwent loss-of-function mutation via the models described in the Materials and Methods (Figure 2), and the resulting changes in the allele occupancy rates were determined.
First, when the Y^F^ allele emerges from the mutation in the population consisting of alleles X and Y, the Y^F^ allele is fixed in the population at a certain rate, and the population transitions to a state comprising X, Y, and Y^F^ alleles when . At this time, the converged proportion values of each genotype depended not on the initial value but rather on . Representative results of the simulations are shown in Figure 6A, and the summarized results of the simulation at convergence are shown in Figure 6B. These results indicate that the Y^F^ allele coexists with the X and Y alleles.
In contrast, when the case in which the Y^M^ allele appears in the population consisting of alleles X and Y is simulated, the Y allele tends to disappear from the population. As shown in a representative result (Figure 6C), genotypes with the Y allele, namely, XY and YY^M^, converged to 0 in the presence of the Y^M^ allele at . The summarized results of the simulations (Figure 6D) show the converged proportion values of each genotype for various values, indicating that the Y allele does not survive in all the cases. Therefore, the elimination of the original Y allele occurs through mutation of the gene encoding the male recognition molecule on Y allele.
Considering the evolution of polymorphisms in the allorecognition gene, we focus first on the case in which the mutation occurs in the gene encoding the female recognition molecule, since this case can be a transition state to the creation of a new allorecognition allele, whereas mutation in the male gene leads to the disappearance of the original allorecognition allele.
To create a new allorecognition allele, the male recognition molecule must adapt to the mutated female recognition molecule encoded by the Y^F^ allele. This requires a mutation in the gene encoding the male recognition molecule of allele Y^F^, and an allele that accomplishes such a mutation is defined as allele Z, as shown in Figure 2A. Therefore, simulations for the state comprising alleles X, Y, Z and Y^F^ were conducted using the model described in the Materials and Methods (Figure 2).
An example of the results is shown in Figure 6E; the proportions of each genotype tended to converge to a fixed value, and none of them were eliminated when . The convergence proportions depend on regardless of the initial value, as in previous simulations. The relationships between and the proportions of alleles after different numbers of generations are shown in Figure 6F, indicating that all the genotypes coexist in all the cases where . A comparison of the data in Figure 6B,F revealed that the proportion of Y^F^ allele was significantly reduced by the emergence of the Z allele even at the same .
The simulations in this section suggest that the creation of the new allorecognition allele consists of two steps. First, one of the existing alleles is mutated, and a “transition allele” encoding a new female recognition molecule and the original male recognition molecule is generated and can be maintained in the population. In the second step, a “new allorecognition allele” encoding a male recognition molecule that can recognize the new female recognition molecule, encoded by the transition allele, is generated by the mutation and coexists in the population, accompanied by a significant reduction in the proportion of transition alleles, which ultimately can be eliminated by genetic drift.
3.2.2. Behavior of the Model with Divergent Alleles
In the above studies, allorecognition alleles and nonallorecognition allele coexisted across generations, which was related to balancing selection. Next, we focused on how this relationship changes when the number of allorecognition alleles increases.
Using the models described in the Section 2 (Figure 3), the proportion of nonallorecognition allele after different numbers of generations was simulated in the presence of multiple allorecognition alleles. When the number of types of allorecognition alleles is defined as “M”, there are M types of heterozygous genotypes of the nonallorecognition allele and one of the allorecognition alleles (“ ”) and M_C_2 types of heterozygous genotypes of the two different allorecognition alleles (“ ”), as described in Figure 3A.
The results of the simulation up to M = 50 are summarized in Figure 7. The proportions of the XX genotype at the point of convergence against M for several values are shown in Figure 7A, indicating that even when M increases to 15, the XX genotype still accounts for a certain percentage of the whole population. Even if , meaning that cross-fertilization between the different genotypes is 100 times more advantageous for survival than fertilization between the same genotypes, more than 10% of the populations exhibited the XX genotype at M = 15. Additionally, a large proportion of individuals possessed the nonallorecognition allele X alone (Figure 7B). Further simulation up to M = 50 (Figure 7C,D) revealed that the proportion of the XX genotype and the X allele occupancy rate decreased monotonically with increasing M, although the decrease slowed. The data in Figure 7C,D indicate that even when M increases, a considerable number of nonallorecognition allele remains in heterozygosity with allorecognition alleles (“b” in Figure 7C).
Therefore, as the number of allorecognition alleles increases, the occupancy rate of the nonallorecognition allele decreases, but the allele survives in the population, mainly in heterozygosity with allorecognition alleles, and direct selection does not occur. These findings suggest that there is a high likelihood that nonallorecognition allele persists in populations of animals and plants that avoid self-mating by allorecognition.
3.3. Simulation Using the 2-Locus Model
In C. intestinalis, three gene pairs encoding allorecognition molecules, s/v-Themis-A in locus A and s/v-Themis-B and s/v-Themis-B2 in locus B, are similar to each other and have been suggested to share a common ancestral locus. In this section, we analyzed the effects of the presence of multiple allorecognition loci, especially the occupancy rates of nonallorecognition alleles at different loci.
A 2-locus model simulating C. intestinalis was constructed as described in the Materials and Methods (Figure 4), and changes in the occupancy rates of nonallorecognizing alleles were observed. The results of the simulations are shown in Figure 8, in which “L” is the number of alleles at locus A and “M” is the number of alleles at locus B. When the numbers of alleles at both loci were the same ( ) and the initial proportions of the nonallorecognition alleles at both loci were equal ( ), all the genotypes, as defined in Figure 4B, coexisted after multiple generations (Figure 8A,B). However, when either or was unbalanced, the proportion of the nonallorecognition allele (X or x) at the locus with the greater number of allorecognition allele types (L and M) or the smaller initial proportion of the nonallorecognition allele was greatly reduced, trending toward disappearance. Figure 8C shows an example where (M L), and Figure 8D shows a case where ( ). The proportion of nonallorecognition alleles, which decreased in this way, continued to decrease monotonically even after five thousand generations, suggesting a very high probability for the elimination of such nonallorecognition alleles through genetic drift. Such alleles are no longer in a state of balanced selection but rather in a state of direct selection. These reductions did not depend on the value of (Figure 8E). In addition, in simulations of situations in which the nonallorecognition allele of one locus had disappeared, the nonallorecognition allele of the remaining other locus tended to persist (Supplementary Material S4 (Figure S4D–G)).
Therefore, when each of the two loci is polymorphic, the nonallorecognition allele tends to be eliminated at one locus, and survive at the other locus. In C. intestinalis, a nonallorecognition allele has been observed only at locus B (B2-3 allele in Ref. [12]), indicating that our model is consistent with observed phenomena.
4. Discussion
4.1. Validation of Simulations with Analytical Results
The reproducibility and robustness of the convergence points of each population were confirmed by the simulation using randomized initialization as shown in Table S1 and Supplementary Material S6. All the results showed consistent convergence to identical equilibrium states under various random initial conditions, indicating the reliability and robustness of the modeling.
The analytical solution of the model for the emergence process shown in Figure 1 was examined. When the proportion of the XX genotype converges, and should be equal. Therefore, the equilibrium points were calculated by substituting the equations in Figure 1C into the simultaneous equations and . Then, real numbers between 0 and 1 satisfying Equation (1) are the equilibrium points for the model.
When 1 > α > 0, Equation (1) has two equilibrium points. Although one ( = 1) is an unstable equilibrium point, the other is a locally asymptotically stable equilibrium point such that the proportion of the genotype XX will converge to this stable equilibrium point regardless of the initial conditions. On the other hand, when , = 1 is the only equilibrium point of Equation (1), a state in which only the XX genotype exists. As shown in Table 1, the convergence points from the simulations match the stable equilibrium points obtained as the analytical solution.
Additionally, the analytical solution for the model with diversified alleles in Figure 3 was considered. As in the model for the emergence process, the following simultaneous Equation (2) were obtained by substituting the equations in Equation (A4) into simultaneous equations , , and . Then, “ ” and “ ”, which are between 0 and 1 and satisfy the equations in Equation (2), are the equilibrium points for the model.
As shown in Table 2, the convergence points from the simulations coincided with the stable equilibrium points obtained as the analytical solution. These alignments indicate that the simulation model and the approach of this study are appropriate for understanding the emergence and evolutionary processes in the allorecognition system.
An increase in the number of types of alleles leads to an increase in the probability of elimination of nonallorecognition alleles, especially if there are two or more loci. With respect to the one-locus situation, theoretically, when the number of allorecognition allele types (M) increases to infinity, the following simultaneous equations (Equation (3)) are obtained by considering only the highest-order terms, namely, the cubic and quartic terms of Equation (2).
The solution of Equation (3) is = 0. Since = 1 or = 0 satisfies Equation (2), ( , , ) = (1, 0, 0) and ( , , ) = (0, 0, 1/M_C_2) are equilibrium points. Whereas (1, 0, 0) is unstable, (0, 0, 1/M_C_2) is a stable equilibrium point. Therefore, as “M” continues to increase to infinity, the proportions of genotypes “ ” and “ ” converge to zero; that is, the nonallorecognition allele X will theoretically be eliminated from the population.
4.2. Comparison with Observed Phenomena
The simulations in this study suggest that the nonallorecognition allele may not be eliminated from the population; in nature, such an allele (B2-3 allele in C. intestinalis) exists and cannot perform the allorecognition function [12]. The sequence of this B2-3 allele is not particularly close to that of any of the other alleles of Themis-B2 (Supplementary Material S7), suggesting that the B2-3 allele has been independent for many generations.
The simulation indicates that the fitness of the nonallorecognition allele generally depends on . There are no data comparing the fitness of offspring of the same allorecognition genotype with different genotypes, and the size of may change depending on the size of the population, since the probability that mating between the same genotypes represents self-mating decreases with increasing population size. However, as mentioned above, in ascidians, self-fertilized embryos very often fail to develop [6,9], and abnormalities in seed formation caused by self-pollination have been reported in plants [27]. These observations indicate that self-fertilization carries considerable risk, and must be very small.
4.3. Asymmetric Characteristics of Female and Male Recognition Molecules
Mutation of the gene encoding a female recognition molecule allows the coexistence of a transition allele (Y^F^) and an original allele (Y) over a wide range of values of . Therefore, the divergence of allorecognition alleles may originate from mutation of the gene encoding the female recognition molecule. In contrast, mutation of the gene encoding the male recognition molecule must be somewhat suppressed, since mutations that inactivate the male recognition molecule reduce the diversity of allorecognition loci, namely, the number of alleles, through selection for the original allele.
In sexual conflict, since the female typically exerts selection on the maleside, the male side undergoes changes to its elements or behaviors to expand the opportunities for mating or to exclude others, whereas the female side changes its elements or behaviors strictly in opposition to the changes on the male side [28,29]. Conversely, as revealed in this study, during the process of allorecognition molecule divergence, mutations in the gene encoding the female recognition molecule occur first, although they increase the risk of self-fertilization, and are not desirable for females. This phenomenon in self-sterility would be an interesting example from the standpoint of sexual conflict, since genes encoding female molecules are mutated first, followed by mutation in the male molecules.
4.4. Tolerance to Mutations and Molecular Evolution
In C. intestinalis, locus B consists of the tandem genes s/v-Themis-B and -B2 [12]. The ancestral locus is thought to have duplicated. With tandem loci, loss of the allorecognition function through a mutation in one copy can be complemented by the other, although s/v-Themis-B and -B2 are not always linked (manuscript in preparation). The tolerance of mutations in this way may promote the creation of variation in such loci, since mutation is necessary for generating polymorphisms, as shown in this study. In fact, the number of polymorphisms is greater in locus B than in locus A. This finding is consistent with previous statistical research suggesting that mutation and recombination play key roles in the creation and maintenance of polymorphisms in allorecognition loci [30].
5. Conclusions
In this research, we introduced a new modeling and simulation approach to investigate the emergence and evolution of allorecognition and observed its consistency with real-world phenomena, such as the coexistence of nonallorecognition allele.
Our study suggests that allorecognition genes, newly generated by mutation, and nonallorecognition allele established a balanced selection state, after which allorecognition diversified to construct the current allorecognition system. We also revealed that mutations in genes encoding male and female allorecognition molecules showed different characteristics and predicted the diversification process of the allorecognition molecules in which female molecules differentiate first.
Although the current study was based on the ascidian C. intestinalis, self-incompatibility has also been studied in other animals and plants such as B. campestris, and the detailed mechanisms differ among species; for example, both gametophytic and sporophytic mechanisms have been reported in plants. Therefore, we expect that our models could be tailored to different situations in different species to provide novel tools for furthering our knowledge and understanding of allorecognition.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Iwano M. Takayama S. Self/non-self discrimination in angiosperm self-incompatibility Curr. Opin. Plant Biol.201215788310.1016/j.pbi.2011.09.00321968124 · doi ↗ · pubmed ↗
- 2Fujii S. Kubo K. Takayama S. Non-self- and self-recognition models in plant self-incompatibility Nat. Plants 201621910.1038/nplants.2016.13027595657 · doi ↗ · pubmed ↗
- 3Zhang D. Li Y.Y. Zhao X. Zhang C. Liu D.K. Lan S. Yin W. Liu Z.J. Molecular insights into self-incompatibility systems: From evolution to breeding Plant Comm.2024510071910.1016/j.xplc.2023.100719 PMC 1087388437718509 · doi ↗ · pubmed ↗
- 4Sawada H. Saito T. Mechanisms of Sperm-Egg Interactions: What Ascidian Fertilization Research Has Taught Us Cells 202211209610.3390/cells 1113209635805180 PMC 9265791 · doi ↗ · pubmed ↗
- 5Harada Y. Takagaki Y. Sunagawa M. Saito T. Yamada L. Taniguchi H. Shoguchi E. Sawada H. Mechanism of Self-Sterility in a Hermaphroditic Chordate Science 200832054855010.1126/science.115248818356489 · doi ↗ · pubmed ↗
- 6Sawada H. Saito T. Shimada Y. Nishimura H. Fertilization mechanisms in hermaphroditic ascidians and nematodes: Common mechanisms with mammals and plants Curr. Top. Dev. Biol.2025162551154018051710.1016/bs.ctdb.2025.01.010 · doi ↗ · pubmed ↗
- 7Schoen D.J. Baldwin S.J. Self-incompatibility and the genetic architecture of inbreeding depression New Phytol.20232371040104910.1111/nph.1855036263709 · doi ↗ · pubmed ↗
- 8Morgan T.H. The genetic and the physiological problems of self-sterility in Ciona J. Exp. Zool.19429019922810.1002/jez.1400900205 · doi ↗
