Allostery and conformational changes upon binding as generic features of   proteins: a high-dimension geometrical approach

Anton S. Zadorin

arXiv:1905.02815·q-bio.BM·May 9, 2019

Allostery and conformational changes upon binding as generic features of proteins: a high-dimension geometrical approach

Anton S. Zadorin

PDF

Open Access

TL;DR

This paper demonstrates that proteins' ability to undergo conformational changes and allosteric regulation upon ligand binding is a generic feature arising from evolutionary pressures for ligand discrimination, supported by a high-dimensional geometric analysis.

Contribution

It extends previous models to high-dimensional smooth systems, showing that allostery and conformational changes are generic outcomes of evolutionary selection for ligand discrimination.

Findings

01

Allosteric regulation and conformational changes are generic features of proteins.

02

High-dimensional geometric analysis supports the universality of these features.

03

Evolutionary solutions are near a codimension-1 subspace in genotypical space.

Abstract

A growing number of experimental evidence shows that it is general for a ligand binding protein to have a potential for allosteric regulation and for further evolution. In addition, such proteins generically change their conformation upon binding. O. Rivoire has recently proposed an evolutionary scenario that explains these properties as a generic byproduct of selection for exquisite discrimination between very similar ligands. The initial claim was supported by two classes of basic examples: continuous protein models with small numbers of degrees of freedom, on which the development of a conformational switch was established, and a 2-dimensional spin glass model supporting the rest of the statement. This work aimed to clarify the implication of the exquisite discrimination for smooth models with large number of degrees of freedom, the situation closer to real biological systems. With…

Equations68

U (x, ℓ, a) = \frac{1}{2} k (∣ x ∣ - r)^{2} - (ℓ - a) x,

U (x, ℓ, a) = \frac{1}{2} k (∣ x ∣ - r)^{2} - (ℓ - a) x,

U (x, ℓ, a) = \frac{1}{2} k (x - r)^{2} - (ℓ - a) x,

U (x, ℓ, a) = k (x^{2} + d^{2} - r)^{2} - (ℓ - a) x,

F (ℓ_{r}, a) < F (ℓ_{\emptyset}, a) < F (ℓ_{\varw}, a),

F (ℓ_{r}, a) < F (ℓ_{\emptyset}, a) < F (ℓ_{\varw}, a),

min U_{ℓ_{r}, a} < min U_{ℓ_{\emptyset}, a} < min U_{ℓ_{\varw}, a} .

min U_{ℓ_{r}, a} < min U_{ℓ_{\emptyset}, a} < min U_{ℓ_{\varw}, a} .

M = X \times L \times A .

M = X \times L \times A .

U (x, ℓ, a) = U_{0} (x_{0}, x_{1}, x_{2}, a) + U_{1} (x_{1}, \lambdaup, a) + U_{2} (x_{2}, \rhoup, a) .

U (x, ℓ, a) = U_{0} (x_{0}, x_{1}, x_{2}, a) + U_{1} (x_{1}, \lambdaup, a) + U_{2} (x_{2}, \rhoup, a) .

min U_{ℓ_{00}, a} = min U_{ℓ_{10}, a} and min U_{ℓ_{01}, a} = min U_{ℓ_{11}, a}

min U_{ℓ_{00}, a} = min U_{ℓ_{10}, a} and min U_{ℓ_{01}, a} = min U_{ℓ_{11}, a}

min U_{ℓ_{00}, a} = min U_{ℓ_{10}, a} but min U_{ℓ_{01}, a} \neq = min U_{ℓ_{11}, a} .

min U_{ℓ_{00}, a} = min U_{ℓ_{10}, a} but min U_{ℓ_{01}, a} \neq = min U_{ℓ_{11}, a} .

M^{(s)} = {x \in M^{s} : \forall i, j, 1 ⩽ i < j ⩽ s \Rightarrow x_{i} \neq = x_{j}} .

M^{(s)} = {x \in M^{s} : \forall i, j, 1 ⩽ i < j ⩽ s \Rightarrow x_{i} \neq = x_{j}} .

Ω_{0} = {\sigmaup \in O : \forall \nuup_{0}, \omegaup_{\nuup_{0}} = 0} = {\sigmaup \in O : rank \omegaup_{\nuup \lambdaup} ⩽ 0},

Ω_{0} = {\sigmaup \in O : \forall \nuup_{0}, \omegaup_{\nuup_{0}} = 0} = {\sigmaup \in O : rank \omegaup_{\nuup \lambdaup} ⩽ 0},

Ω_{1} = {\sigmaup \in O : \forall \nuup_{0}, \nuup_{1}, \omegaup_{\nuup_{0}} \land \omegaup_{\nuup_{1}} = 0} = {\sigmaup \in O : rank \omegaup_{\nuup \lambdaup} ⩽ 1},

\dots

Ω_{k} = {\sigmaup \in O : \forall \nuup_{0}, \dots, \nuup_{k}, \omegaup_{\nuup_{0}} \land \dots \land \omegaup_{\nuup_{k}} = 0} = {\sigmaup \in O : rank \omegaup_{\nuup \lambdaup} ⩽ k},

\dots

\forall\nuup\quad\!\underset{\tilde{\gammaup}}{\overset{}{\rotatebox[origin={rc}]{15.0}{\large$\int$}}}\omegaup_{\nuup}=0,\quad\textrm{and thus}\quad\!\underset{\iotaup(\tilde{\gammaup})}{\overset{}{\rotatebox[origin={rc}]{15.0}{\large$\int$}}}dy_{\nuup}=y_{\nuup}\big{(}\iotaup(\sigmaup_{1})\big{)}-y_{\nuup}\big{(}\iotaup(\sigmaup)\big{)}=0.

\forall\nuup\quad\!\underset{\tilde{\gammaup}}{\overset{}{\rotatebox[origin={rc}]{15.0}{\large$\int$}}}\omegaup_{\nuup}=0,\quad\textrm{and thus}\quad\!\underset{\iotaup(\tilde{\gammaup})}{\overset{}{\rotatebox[origin={rc}]{15.0}{\large$\int$}}}dy_{\nuup}=y_{\nuup}\big{(}\iotaup(\sigmaup_{1})\big{)}-y_{\nuup}\big{(}\iotaup(\sigmaup)\big{)}=0.

\varmathbb R ≃ Δ \varmathbb R^{2} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

\varmathbb R ≃ Δ \varmathbb R^{2} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

T^{*} M ≃ T^{*} X \oplus T^{*} (L \times A)

T^{*} M ≃ T^{*} X \oplus T^{*} (L \times A)

V \oplus W \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces

V \oplus W \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces

P_{X} = 0_{X} \oplus T^{*} (L \times A), \piup_{T^{*} M} = (p_{T^{*} M} \times p_{T^{*} M}) ∣_{J_{2}^{1} (M, \varmathbb R)},

P_{X} = 0_{X} \oplus T^{*} (L \times A), \piup_{T^{*} M} = (p_{T^{*} M} \times p_{T^{*} M}) ∣_{J_{2}^{1} (M, \varmathbb R)},

W_{1} = (\piup_{E})^{- 1} (Im \iotaup_{E}) \cap (\piup_{T^{*} M})^{- 1} (Im \iotaup_{P_{X}}) \cap (\piup_{L} \circ \piup_{1})^{- 1} (Im \iotaup_{L}) \cap (\piup_{A} \circ \piup_{1})^{- 1} (Im \iotaup_{A}) \subset J_{2}^{1} (M, \varmathbb R),

W_{1} = (\piup_{E})^{- 1} (Im \iotaup_{E}) \cap (\piup_{T^{*} M})^{- 1} (Im \iotaup_{P_{X}}) \cap (\piup_{L} \circ \piup_{1})^{- 1} (Im \iotaup_{L}) \cap (\piup_{A} \circ \piup_{1})^{- 1} (Im \iotaup_{A}) \subset J_{2}^{1} (M, \varmathbb R),

W_{2} = W_{1} \cap (\piup_{X} \circ \piup_{1})^{- 1} (Im \iotaup_{X}) \subset J_{2}^{1} (M, \varmathbb R),

Y_{1} = Im j_{2}^{1} U \cap W_{1} \subset J_{2}^{1} (M, \varmathbb R),

Y_{2} = Im j_{2}^{1} U \cap W_{2} \subset J_{2}^{1} (M, \varmathbb R),

V_{1} = \piup_{1} (Y_{1}) \subset M^{(2)},

V_{2} = \piup_{1} (Y_{2}) \subset M^{(2)},

U_{1} = \iotaup_{A}^{- 1} (\piup_{A} (V_{1})) \subset A,

U_{2} = \iotaup_{A}^{- 1} (\piup_{A} (V_{2})) \subset A .

codim (\piup_{E})^{- 1} (Im \iotaup_{E}) = 1, codim (\piup_{T^{*} M})^{- 1} (Im \iotaup_{P_{X}}) = 2 dim X,

codim (\piup_{E})^{- 1} (Im \iotaup_{E}) = 1, codim (\piup_{T^{*} M})^{- 1} (Im \iotaup_{P_{X}}) = 2 dim X,

codim (\piup_{L} \circ \piup_{1})^{- 1} (Im \iotaup_{L}) = 2 dim L, codim (\piup_{A} \circ \piup_{1})^{- 1} (Im \iotaup_{A}) = dim A,

codim (\piup_{X} \circ \piup_{1})^{- 1} (Im \iotaup_{X}) = dim X, dim M^{(2)} = 2 (dim X + dim L + dim A) .

codim V_{1} = codim W_{1} = 1 + 2 dim X + 2 dim L + dim A,

codim V_{1} = codim W_{1} = 1 + 2 dim X + 2 dim L + dim A,

codim V_{2} = codim W_{2} = 1 + 3 dim X + 2 dim L + dim A,

dim V_{1} = dim A - 1, dim V_{2} = dim A - dim X - 1.

P_{X} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

P_{X} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

U (x_{0}, ℓ_{0}, a) = U (x_{1}, ℓ_{1}, a) and U (x_{1}, ℓ_{0}, a) = U (x_{2}, ℓ_{1}, a) .

U (x_{0}, ℓ_{0}, a) = U (x_{1}, ℓ_{1}, a) and U (x_{1}, ℓ_{0}, a) = U (x_{2}, ℓ_{1}, a) .

U (x_{0}, ℓ_{0}, a) = U (x_{1}, ℓ_{0}, a), and thus, U (x_{1}, ℓ_{0}, a) = U (x_{1}, ℓ_{1}, a) .

U (x_{0}, ℓ_{0}, a) = U (x_{1}, ℓ_{0}, a), and thus, U (x_{1}, ℓ_{0}, a) = U (x_{1}, ℓ_{1}, a) .

P_{X} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

P_{X} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

f_{i} : a \mapsto d E_{s_{i} (ℓ_{1}, a)} d (s_{i})_{(ℓ_{1}, a)} (\varv, 0), i \in I,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProtein Structure and Dynamics · Enzyme Structure and Function · Hemoglobin structure and function

Full text

Allostery and conformational changes upon binding as generic features of proteins: a high-dimension geometrical approach.

A.S. Zadorin

(*Chimie Biologie Innovation, ESPCI Paris, CNRS, PSL University, 75005 Paris, France.

Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, PSL Research University, Paris, France.*)

Abstract

A growing number of experimental evidence shows that it is general for a ligand binding protein to have a potential for allosteric regulation and for further evolution. In addition, such proteins generically change their conformation upon binding. O. Rivoire has recently proposed an evolutionary scenario that explains these properties as a generic byproduct of selection for exquisite discrimination between very similar ligands. The initial claim was supported by two classes of basic examples: continuous protein models with small numbers of degrees of freedom, on which the development of a conformational switch was established, and a 2-dimensional spin glass model supporting the rest of the statement. This work aimed to clarify the implication of the exquisite discrimination for smooth models with large number of degrees of freedom, the situation closer to real biological systems. With the help of differential geometry, jet-space analysis, and transversality theorems, it is shown that the claim holds true for any generic flexible system that can be described in terms of smooth manifolds. The result suggests that, indeed, evolutionary solutions to the exquisite discrimination problem, if exist, are located near a codimension-1 subspace of the appropriate genotypical space. This constraint, in turn, gives rise to a potential for the allosteric regulation of the discrimination via generic conformational changes upon binding.

1 Introduction

Significant conformational changes in proteins upon their binding to specific ligands are ubiquitous in nature. Their occurrence spans from signaling proteins and transcription factors to enzymes. They are observed even outside the realm of proteins: in aptamers and ribozymes. Allosteric regulation, the modulation of the primary function by binding of another molecule at a distant site, is often associated with such biopolymers either in an actual or in a potential form. There are two common features in all these systems. First, they are flexible. Second, the primary task that all of them solve includes a fine discrimination between different but close ligands vs. solvent. Namely, the desirable ligand must be more preferably than the solvent with the contrary for undesirable ligands. For signaling proteins it is the ability to distinguish between the specific signal and similar molecules. For transcription factors it is the recognition site among similar DNA motifs. For enzymes and ribozymes it is the ability to sufficiently strongly bind the substrate and/or the transition state and to release the (often similar) product [1].

The main focus of researchers since the discovery of conformational changes has been on the nature and mechanisms of these changes. Their evolutionary origin is usually seen either as a requirement for the function of the protein in question or as a subsequent development of regulation of this function. For example, for a signal transduction receptor, a conformational change upon binding is necessary to initiate the signal response pathway (be it binding to a DNA site or a transmembrane activation of a signaling cascade). For enzymes, such explanations include the correct positioning of aminoacids in the reaction center and creation of the correct environment around the substrate, as well as kinetic control of the reaction rate. For the enzymes that are molecular motors, conformational changes are the essence of their function. Finally, the conformational change is seen as an adaptation to allosteric regulation of protein function [2]. It is important to emphasize that in this view allostery is actively selected for and a conformational change serves as a means to fulfill this demand. Such explanations assume some adaptive value of the conformational change in light of selection for a complex property of the protein. The conformational changes themselves are understood as highly orchestrated events.

The main two hypothetical mechanisms of the conformational change itself, however, assume it to be generic. These two mechanisms are the hypothesis of induced fit and the hypothesis of conformational selection. In the induced fit scenario, the ligand provokes a conformational change in the sufficiently flexible binding protein after an initial weak binding. This results in a strongly bound complex [3, 4]. In the conformational selection paradigm, the native state and the conformation of the strongly bound complex exist as possible conformations in the population of the free protein under normal conditions. The native state is assumed to reflect the global minimum of the free energy while the other state is assumed to be metastable with lower probability in the population. The binding of the ligand change their roles and the other state becomes predominant. The conformations themselves, in this scenario, do not significantly change—only their free energy levels change [5, 6].

Recently, a different view on the problem of allostery and conformational changes in proteins was introduced by O. Rivoire in [7]. Rivoire noticed that allostery can be a consequence of an existing conformational change in a discriminating protein. Indeed, if a conformational change is invelved in an exquisite discrimination, the ability to discriminate can be turned off by blocking the movement itself and thus changing the energies of bound states. If the conformation changes far from the initial binding site, as it takes place in sufficiently large conformational change, the regulating binding can happen far from the initial binding pocket. This situation is interpreted as allosteric regulation. The conformational change in the first place, in this scenario, comes as a byproduct of the selection towards the exquisite discrimination. Thus, a potential for allostery emerges from a selection for a much simpler property.

The validity of this scenario was demonstrated in [7] on two types of models: 1) extremely simple elastic network models and 2) a spin glass model of proteins. The elastic models treated the protein as a single mass on one or two springs with only one degree of freedom. Possible ligands were treated as numerical values on an axis of environmental variables that served as an additional force constantly acting on the system. In addition, the evolutionary degree of freedom also was considered to be a single continuous variable that imposes another force of the same sort. The spin glass model, from the other hand, had multiple configurational and evolutionary degrees of freedom, but a configuration of each “aminoacid” was described only by either “up” or “down” state. Both these models are strongly simplified descriptions of proteins.

Both models gave the same qualitative result, formulated in [7] as follows. Under the assumption of a system’s flexibility, the discrimination of particular similar ligands requires the system to be evolutionary finely tuned to respect requirements on free energies of the complexes. This constraint causes a generic conformational change upon binding. A hypothesis, tested only on the spin glass model of proteins, was formulated that connects a large enough conformational change with the potential for allosteric regulation by involvement of distant parts of the protein in the movement. Being involved in keeping a delicate free energy balance, such sites become a potential target for further regulation by another ligand, for example. Furthermore, it was shown (on the continuous elastic model) that the conformational change may have a form of continuous deformation of the initial state to the final one or these states may be different ones and may even coexist as a global and a local minimum. Thus the distinction between the induced fit and the conformational selection becomes moot from this point of view.

However, the simplicity of the models used for illustration of this powerful principle comes with strong limitations that may prevent a direct generalization. The drawback of the spin glass model is in its intrinsic discontinuity, and it is difficult to say if the observed effects are related to general properties of protein-like systems or to this particularity of the model. This is especially true for the claim about a conformational switch, since the behaviour of the system is switch-like at the level of each element from the beginning.

The continuous elastic model suffers from its low dimensionality. It is not clear how the conclusion can be drawn from an example with one physical degree of freedom, one scalar phenotypic trait, and a ligand space described by a single number. Furthermore, the particularly simple relation between the ligand and the system’s potential may turn out to be a very particular case.

In the current work, it is proven that the conclusions of [7] are, indeed, valid, under a certain interpretation, for a much wider class of models: continuous systems with any number of degrees of physical freedom, any dimensionality of the phenotypical trait space, and any number of parameters describing ligands. In addition, an estimate on the abundance of evolutionary solutions to the exquisite discrimination of particular ligands (equivalently, on the required fine tuning of the protein sequence) is derived in terms of the dimensionality of the set in the trait space around which the solutions are concentrated. It is also shown that the proposed scenario for the origin of allosteric regulation is plausible in these settings, too.

The work is organized in the following way. In Section 2, the problem is formulated in terms of physical chemistry. In Section 3, it is translated to a mathematical model. In Section 4, the problem is rigorously formalized in the language of differential geometry and three main theorems are stated, constituting the main result about the exquisite discrimination problem, the conformational changes, and allostery: Theorem 1, 2, and 3. A biological interpretation and some implications of theses results are outlined in Section 5. The final Section 6 is completely devoted to formal mathematical proofs of the main theorems.

2 Physical formulation of the problem

Following [7], we assume that evolution of a protein involves three types of variables: 1) physical (conformational) degrees of freedom $x$ , 2) environmental degrees of freedom $\ell$ that define the surrounding medium, and 3) evolutionary degrees of freedom $a$ associated with the protein sequence.

Typically, the variables in $x$ include positions of single atoms or distances between pairs of them and completely describes the shape (conformation) of the molecule. Depending on the coarse graining of the model, it may describe mutual orientation of larger portions of the molecule, like individual aminoacids. In the latter case, $x$ may also involve angles on top of distances. The variables in $\ell$ contain information about the environment around the molecule. In this particular case they are restricted to the identification of a ligand bound to a particular site of the protein or to the absence of any such ligand (a molecule of the solvent can be taken as the ligand for this case). Finally, $a$ describes the genetic information involved in building the molecule. In the most direct case it is its aminoacid sequence. Alternatively, it can reflect some higher level aggregated phenotypical properties of the molecule or its parts that define its behaviour in the selected level of abstraction.

These three types of variables are linked via the parametrized potential energy $U(x,\ell,a)$ of the protein, where $\ell$ and $a$ are parameters. Thus, a function $U(x,\ell,a)$ describes a family of potential energies $U_{\ell,a}(x)$ with a constant parameter $a$ (it defines the protein) and an environment-dependent parameter $\ell$ (it describes how the energy changes with the binding of the ligand $\ell$ ). At given $\ell$ and $a$ , the distribution of conformations of the protein is given by the Boltzmann distribution with that potential energy such that the probability of conformation $x$ in an ensemble of molecules is $\varmathbb P(x\mid\ell,a)=\exp\Big{(}-\betaup\big{(}U(x,\ell,a)-F(\ell,a)\big{)}\Big{)}$ , where $F(\ell,a)$ is the free energy of the system and $\betaup$ is the inverse temperature measured in energy units. Following the treatment of the continuous case in [7], we will only consider the zero temperature limit $\betaup\to\infty$ . In this case $\varmathbb P(x\mid\ell,a)$ degenerates to a $\deltaup$ -function at the point (or points) of the global minimum of $U_{\ell,a}$ and $F(a,\ell)$ is equal to this minimum.

For example, in [7], variables $x$ , $\ell$ , and $a$ were natural numbers and the considered potentials had the following forms

[TABLE]

where $k$ , $d$ , and $r$ are positive constants.

The exquisite discrimination problem is formulated in the following way. Given a desirable ligand $\ell_{r}$ and an undesirable ligand $\ell_{\varw}$ such that $\ell_{r}\approx\ell_{\varw}$ , and assuming that the environment defined by the solvent alone is represented by $\ell_{\varnothing}$ , find $a$ such that

[TABLE]

or, in the zero temperature limit,

[TABLE]

This condition is schematically shown on the left part of Figure 1.

3 Mathematical formulation of the problem

For the sake of brevity, we will use the term “protein” for “a system that needs to do an exquisite discrimination”, although, of course, the general argument is not restricted only to proteins. We will assume that a protein is characterized by three types of variables: conformational variables $x$ that take value in the configuration space $X$ , environmental variables $\ell$ taking value in $L$ (the space of possible ligands), and evolutionary variables $a$ taking value in $A$ (the space of protein sequences, the phenotypical trait space, etc.).

The main claim of the current work can informally be expressed in the following statement. Under general assumptions, a sufficiently flexible system that solves the exquisite discrimination problem generically experiences a large conformational change upon binding to its substrate. The ability to discriminate requires evolutionary fine tuning and possible solutions are concentrated near a codimension-1 hypersurface of the phenotypical trait space $A$ . The combination of the fine tuning and the conformational change makes the discrimination ability sensitive to binding of other ligands to distant sites.

To make these statement precise we will fix the following assumptions. Spaces $X$ , $L$ , and $A$ are assumed to be smooth ( $C^{\infty}$ ) compact manifolds. The physical behaviour of a protein is defined by an energy function $U\colon M\to\varmathbb R$ , where

[TABLE]

The product is understood in the category of smooth manifolds so $M$ is assumed to be endowed with the structure of a $C^{\infty}$ manifold. $U$ is assumed to be smooth, as well. $U$ is understood as a family of potentials on $X$ with parameters from $L$ and $A$ . In the zero temperature approximation, the configuration of a protein with sequence $a$ that corresponds to a ligand $\ell$ is considered to be $x$ that minimizes $U(x,\ell,a)$ with constant $\ell$ and $a$ globally in $X$ . We also assume that $X$ represents only the shape of the protein and the degrees of freedom of the whole molecule (translational and rotational) are already excluded as well as that the dimensionality of $X$ corresponds to the number of the leftover independent degrees of freedom.

Let us discuss these assumptions. The representation of the configuration space of a physical system by a manifold is very natural and does not require any special explanation. The compactness of $X$ is a technical requirement, which is not very restrictive. Indeed, if the configurations are given by the collection of pairwise distances between elements (aminoacids, nucleotides) with hard links between neighbours in the primary sequence, as it is commonly assumed in physical models of macromolecules, the configuration space naturally has a form of closed (without border) compact multiply connected submanifold of some Euclidean space. If, instead, no restrictions are applied, but the interatomic interactions are described by some pairwise potentials, the actual configuration space is some Euclidean space, which is not compact. However, as the interaction potential either increases with acceleration (as in elastic network models) or monotonously increase to some finite limit (as in real molecules) as some atom approaches an infinite distance from the rest of the structure, we are not interested at the behaviour of $U$ in the neighbourhood of infinity. In this case, it is enough to consider some smaller, compact, subspace of the initial configuration space. Finally, when there is a finite potential energy in the bound state of the protein(as in real molecules), the initial space $\varmathbb R^{n}$ can be compactified to the projective space $P\varmathbb R^{n}$ . The compactness of $L$ , as well as its smooth nature, is just a convenient hypothesis as there are no good models for this space. The sequence space $A$ is not a smooth manifold in nature. It is rather a nondirected graph with high symmetry. However, we assume that it can be well approximated by some smooth manifold with a smooth function $U(x,\ell,a)$ . For example, for binary sequences of length $n$ the sequence space is an $n$ -dimensional hypercube. It can be approximated by an $(n-1)$ -dimensional hypersphere.

We assume that for the protein to perform its function, it must bind the correct ligand, described by the environment $\ell_{r}$ , stronger than the solvent, $\ell_{\varnothing}$ , and the incorrect ligand, $\ell_{\varw}$ , must be bound weaker than the solvent. $\ell_{r}$ and $\ell_{\varw}$ are assumed to be close in $L$ ( $L$ , being described by some physical parameters, is usually metrizable, so one may assume that the distance between the states given by some metrics on this space is much smaller than between either of them and $\ell_{\varnothing}$ ).

As the first step, we will solve a simpler problem. Given two ligands $\ell_{0},\ell_{1}\in L$ , we want to find such phenotypes $a$ and such configurations $x$ that the protein bound to either of the ligands has the same minimal energy $U$ . This point of view ignores the inevitably small differences between the minimal energy levels of the complexes with $\ell_{r}$ and $\ell_{\varw}$ . Moreover, the values of $\ell_{\varw}$ and $\ell_{r}$ are considered to be indistinguishable, too. Therefore, we assume $\ell_{\varw}=\ell_{r}=\ell_{1}$ . As a consequence, the minimal energy that corresponds to the binding to $\ell_{\varnothing}=\ell_{0}$ is equal to that of $\ell_{1}$ , as we assume it to be between $\ell_{\varw}$ and $\ell_{r}$ (see Figure 1).

This simplification can be regarded as a coarse grained view on the problem, where any slightly different points in any of the spaces are seen as equal. In this picture, a significant difference between real configurations $x_{1}$ and $x_{2}$ means simply $x_{1}\neq x_{2}$ . Such abstraction allows a rigorous mathematical treatment as it can be recast to questions about intersections of submanifolds of appropriate manifolds. Such questions, taking into account the notion of general position (generic case), result in exact answers in qualitative terms. The backinterpretation is, however, much less rigorous. We will address the issues related to it in the end of the article.

The initial problem sought a phenotype $a\in A$ that brings minimal energies for the ligand $\ell_{\varnothing}$ to that of the ligands $\ell_{\varw}$ and $\ell_{r}$ (with the correct ordering, but we will consider this issue separately). The corresponding protein would be considered to take a large conformational change upon binding if the corresponding configurations $x_{\varnothing}$ and $x_{\varw}\approx x_{r}$ are very different. In the simplified problem, the initial discrimination problem reduces to finding $a$ such that the global minima of $U$ in $X$ corresponding to $\ell_{0}$ and $\ell_{1}$ have the same energy level. We will call this a reduced discrimination problem. Then the initial question is whether or not the corresponding minimum points $x_{0}$ and $x_{1}$ coincide in $X$ .

We will also consider an infinitesimal discrimination problem, where $\ell_{r}\approx\ell_{\varw}$ and we replace the difference between $\ell_{\varw}$ and $\ell_{r}$ by a vector $\varv$ in $L$ at $\ell_{1}$ that shows the direction from $\ell_{r}$ to $\ell_{\varw}$ . We will assume that a given $a$ , which solves the reduced discrimination problem, also solves the infinitesimal discrimination problem if the displacement along $\varv$ in $L$ at $\ell_{1}$ corresponds to a positive change in the energy value at the global minimum (see Figure 1).

4 Main results

4.1 Exquisite discrimination, conformational changes, and fine tuning

Let us recall the following notion.

Definition.

A subset of a topological space is called a residual set if it can be represented by a countable intersection of open dense subsets. A typical element of the topological space is an element that belongs to some residual set. A situation is generic if it can be represented as a typical element of some space relevant to the problem. A complement to a residual set is called meager set.

We must consider that the naturally defined system, given by $U$ , is typical. Indeed, the meaning of residual sets is that their complements, meager sets, can be considered as negligible and points that belong to them as special. An assumption that naturally occurring systems do not belong to some negligible sets is a kind of an extension of the Copernican principle. It is in this sense $U$ is typical.

The first main result now can be formulated in the following theorem.

Theorem 1.

For a typical family of potentials $U\in C^{\infty}(M)$ , solutions to the reduced discrimination problem for ligands $\ell_{0}$ and $\ell_{1}$ either do not exist, or a typical solution is located on a $(\dim A-1)$ -dimensional submanifold $\hat{\textrm{U}}$ of $A$ and, if $\dim X>0$ , its minimum points $x_{0}$ (for $\ell_{0}$ ) and $x_{1}$ (for $\ell_{1}$ ) are different.

It should be noted that this mathematical result is intuitively expected from the beginning. Indeed, one formally has to find points of minimum for $U_{\ell_{0},a}$ and $U_{\ell_{1},a}$ . Let them be $\hat{x}_{0}(a)$ and $\hat{x}_{1}(a)$ , respectively, where the dependence on $a$ is explicitly indicated. The solution is given by traits $a$ such that $U(\hat{x}_{0}(a),\ell_{0},a)=U(\hat{x}_{1}(a),\ell_{1},a)$ . This constitutes one condition on $a$ , which is intuitively expected to be satisfied on a codimension-1 hypersurface of $A$ . In the same way, additional constraint of no conformational change is written in the form $\hat{x}_{0}(a)=\hat{x}_{1}(a)$ and is equivalent to $\dim X$ additional conditions. One would intuitively expect that the set of solution to discrimination without conformational changes occupies a submanifold of codimension $\dim X+1$ . However, the intuition alone is not suitable to treat multidimensional problems. In particular, the condition on $a$ is not a simple equation but depends on solutions $\hat{x}_{0}$ and $\hat{x}_{1}$ of the energy minimization problem. These solutions themselves depend on $a$ in a complex manner, which may involve discontinuities of rearrangements. The purpose of Theorem 1 and the following Theorem 2 is to justify the intuitive conclusion and to clarify in which sense it is true.

The evolutionary solutions delivered by Theorem 1 only guarantee that, after going back from the coarse graining picture, the minimal energies for $\ell_{\varnothing}$ , $\ell_{r}$ , and $\ell_{\varw}$ will be close. However, for a discriminating protein to work correctly, it is important to have the right order of these energies: $U_{r}<U_{\varnothing}<U_{\varw}$ . Therefore, we will consider the infinitesimal discrimination problem that probes the validity of this constraint by infinitesimally small deformation of solutions for the reduced discrimination problem at $\ell_{1}$ .

More specifically, consider a nonzero vector $\varv$ on $L$ emanating from $\ell_{1}$ ( $\varv\in T_{\ell_{1}}L$ ). This vector can be regarded as showing the direction from $\ell_{r}$ to $\ell_{\varw}$ . These points are considered to be infinitesimally close to $\ell_{1}$ , and $\ell_{1}$ has the same energy minimum as $\ell_{0}$ . Therefore, for the phenotype $a$ to be a solution to the full discrimination problem, the displacement along $\varv$ with the fixed $a$ must increase the minimal energy. The boundary between solutions that respect this requirement and those that do not is made of $a$ such that there is no change in the minimal energy level in the direction spanned by $\varv$ .

The second main result concerns this additional infinitesimal constraint on the order of the minima and is formulated as the following theorem.

Theorem 2.

For a typical family of potentials $U$ , solutions to the infinitesimal discrimination problem for ligands $\ell_{0}$ and $\ell_{1}$ and for a separating vector $\varv$ either do not exist, or a typical solution is located on a $(\dim A-1)$ -dimensional submanifold $\tilde{\textrm{U}}$ of $A$ and, if $\dim X>0$ , its minimum points $x_{0}$ and $x_{1}$ are different.

In other words, the additional requirement of a correct order in energy minima does not qualitatively change the situation. It can make the set $\hat{\textrm{U}}$ smaller, though ( $\tilde{\textrm{U}}\subset\hat{\textrm{U}}$ in general).

4.2 Conformational changes and allosteric regulation

Let us now look at how the development of a conformational change as a byproduct of a solution to the exquisite discrimination problem can help a development of allosteric regulation. By allosteric regulation we will understand the disruption of the initial ability to discriminate two ligands by binding of another ligand to a distant site of the molecule.

Let us assume now that the protein in question can bind two different ligands: $\lambdaup$ and $\rhoup$ . Therefore, the environmental variable takes the form $\ell=(\lambdaup,\rhoup)$ and it belongs to the space $L=\Lambda\times\mathrm{P}$ , $\lambdaup\in\Lambda$ , $\rhoup\in\mathrm{P}$ . Let us denote, as before, the situation when $\lambdaup$ is bound by $\lambdaup_{1}$ and when it is not bound by $\lambdaup_{0}$ . Likewise, we have $\rhoup_{1}$ and $\rhoup_{0}$ for the bound and free state of the ligand $\rhoup$ . Note, that we assume, as before, that $\lambdaup_{1}$ in fact represents two ligands: $\lambdaup_{r}$ and $\lambdaup_{\varw}$ . The protein discriminates these ligands. In contrast, $\rhoup_{1}$ is assumed to be a single ligand, which is bound by the protein without discrimination. Based on the theorems of the previous section we can expect a conformational change of the protein upon binding of $\lambdaup$ , upon binding of $\rhoup$ , upond binding of $\lambdaup$ , when $\rhoup$ is already bound, and vice versa.

Let us now assume in addition that the binding of $\lambdaup$ and $\rhoup$ is localized on the molecule in question and that it happens at different sites. Let us also assume that the sites are not to directly coupled. This can be expressed in the following way. Let $x_{1}$ be the degrees of freedom involved in the interaction with the ligand $\lambdaup$ (coordinates of atoms interacting with $\lambdaup$ , for example), $x_{2}$ be the degrees of freedom involved in binding $\rhoup$ , and $x_{0}$ be the residual degrees of freedom. We assume thus that $X=X_{0}\times X_{1}\times X_{2}$ with $x_{0}\in X_{0}$ , $x_{1}\in X_{1}$ , and $x_{2}\in X_{2}$ . Then the potential decomposes in this case as

[TABLE]

Let $a$ be a solution to the reduced exquisite discrimination problem (the reasoning is analogous for the infinitesimal problem) for $\ell_{00}=(\lambdaup_{0},\rhoup_{0})$ and $\ell_{10}=(\lambdaup_{1},\rhoup_{0})$ , and define $\ell_{01}=(\lambdaup_{0},\rhoup_{1})$ with $\ell_{11}=(\lambdaup_{1},\rhoup_{1})$ . Then the following result holds.

Theorem 3.

Suppose that the protein changes its conformation during the switch from $\ell_{01}$ to $\ell_{11}$ (upon binding of $\lambdaup$ on the background of bound $\rhoup$ ). Then the situation described by

[TABLE]

is not structurally stable in the sense that it can be turned by an arbitrarily small perturbation of $U$ into situation described by

[TABLE]

In the contrary, situation (7) is structurally stable in the sense that for any small enough perturbation of $U$ it cannot be turned into situation (6).

In other words, situation (7) means that when $\rhoup$ is not bound, the protein performs the exquisite discrimination for $\lambdaup$ , while when $\rhoup$ is bound, this ability is broken. Therefore, $\rhoup$ acts as an allosteric regulator for the exquisite discrimination of $\lambdaup$ . The theorem asserts that such behaviour is typical. The condition of the theorem substantiantly uses the genericity of the conformational change upon binding provided by Theorem 1.

5 Discussion and biological interpretation

Theorems 1–3 provide rigorous results in the limit of indistinguishable ligands ( $\ell_{\varw}\to\ell_{r}$ ) and for the zero temperature approximation. In real systems, the difference between the right and the wrong ligands is finite, free energies of different states are allowed to be different provided that the correct order is preserved, and the temperature is positive. Going back from the mathematical idealization adopted above to physically meaningful models with finite differences and nonzero temperature blurs the rigor of the statements. An exclusion from a generic situation must be understood as not something impossible for practical observation but rather as something less probable than the generic case. The more the difference and the temperature the less strong the statement. This can be graphically demonstrated for the case of a nonzero difference between $\ell_{r}$ and $\ell_{\varw}$ (and between the energy levels $U_{r}$ , $U_{\varw}$ , and $U_{\varnothing}$ ) still assuming zero temperature. A solution to the exquisite discrimination problem in this case corresponds to a phenotype $a$ such that (3) holds. If we denote $\ell_{0}=\ell_{\varnothing}$ and $\ell_{1}=\ell_{r}$ , the corresponding set $\hat{\textrm{U}}$ provided by Theorem 1 (we will denote $\hat{\textrm{U}}_{r}$ ) defines the border of phenotypes that respect $U_{r}<U_{\varnothing}$ . In the same way, the analogous set $\hat{\textrm{U}}_{\varw}$ defined for $\ell_{0}=\ell_{\varnothing}$ and $\ell_{1}=\ell_{\varw}$ marks the border of phenotypes that respect $U_{\varnothing}<U_{\varw}$ . When $\ell_{r}$ becomes close to $\ell_{\varw}$ , $\hat{\textrm{U}}_{r}$ becomes close to $\hat{\textrm{U}}_{\varw}$ . From this it is clear that for $\ell_{r}\neq\ell_{\varw}$ , phenotypes that solve the exquisite discrimination problem are situated between $\hat{\textrm{U}}_{r}$ and $\hat{\textrm{U}}_{\varw}$ . This is schematically shown by the shaded region on Figure 2. In fact, this regions is a “thick” version of the codimension-1 submanifold $\tilde{\textrm{U}}$ given by Theorem 2 with an appropriately chosen direction $\varv$ . Indeed, when $\ell_{r}$ approaches $\ell_{\varw}$ by some trajectory, the shaded region collapses to the submanifold that corresponds to $\varv$ such that $\varv$ is tangent to the trajectory. We see that a nonzero difference between the ligands makes the possible evolutionary solutions to their discrimination problem to occupy a spatial domain in the trait space $A$ rather than its infinitely thin codimension-1 submanifold. Yet, with sufficiently similar ligands (which is supposed by the exquisite discrimination problem) they stay near such manifold.

Another notion that is blurred in real systems is that of a large conformational change. In the idealized coarse grained mathematical model, any conformational change was interpreted as being large. The proven theorems do not provide any means to determine how large the conformational change is or what large means in general. Such problems are typical for topological but not metric theorems. Addition of real physics on top of the bare topology in this problem (such as the limits on the stifness of the chemical bonds, assumption of a nonzero temperature, the value of the mutational effects of individual aminoacids, and so on) might help to destinguish between essential and nonessential changes in the discrimination ability of a protein and in its conformation.

Although the first part of the main result (Theorems 1 and 2) can be shortly stated as discrimination requires a conformational change, the statement would not be entirely correct. First, the requirement must not be understood as direct causality. The correct interpretation is that most solutions to the discrimination problem will involve a conformational change. It implies that if a system performs discrimination and changes its conformation, it should not be surprising and no special explanation is required to this fact. In contrary, if a discriminating system does not show a conformational change, it is an indication on a special additional circumstances that may be of interest. Second, an application of the same theoretical approach to a protein that just binds to a ligand but not necessarily discriminates between similar ligands results in a conclusion that a conformational change is expected to accompany any binding in general.

Let us elaborate the latter statement. The part of the reasoning (in a simplified form) in the proof of Theorem 1 that involves a conformational change still holds in the case of a simple binding without discrimination. This means that we should expect a conformational change upon binding in general, not only when a discrimination is performed. This general statement is very close to the classical induced fit scenario. The competing conformational selection hypothesis in its strict form, instead, represents a very special case (very special form of energy landscape), as it requires the conformation of the global minimum of free energy for the unbound state to be also a conformation of a local minimum for the bound state and vice versa. This situation is not typical for smooth potentials. However, if the relevant conformations are themselves allowed to change upon binding, then the situation becomes as typical as the pure induced fit situation. In fact, the distinction between these two cases becomes irrelevant, as was already demonstrated in [7] on a simple model. A similar conclusion was formulated in [1] based on biochemical arguments and experimental observations, and in a new vision of the protein binding proposed in [8].

What discrimination does require is a an evolutionary fine tuning expressed in the dimension of the set of possible solutions. It is this fine tuning that brings about the potential for allosteric regulation (in the same sense as a discrimination causes a conformational change). If we combine the conclusion of Theorem 3 with the above understanding of how such rigorous statements should be interpreted in application to real systems, we may conclude the following. Proteins that discriminate ligands are prone to allosteric regulation by another ligand at a different binding site. This sensitivity of the discrimination to the distant binding is associated with the conformational change during the primary binding. Although this question was not studied in this work, we may also expect a wide sensitivity of such protein to mutations. Indeed, a mutation of an aminoacid is in some sense analogous to a local binding in its effect on the potential energy. Repeating for this case the reasoning about allosteric effects, we conclude that the ability to discriminate is broken by mutations in many sites. One can justify this assertion from a different perspective. Since the exquisite discrimination requires an evolutionary fine tuning, we can expect a mutation to break this tuning in a generic case. This is graphically represented on Figure 2. As a consequence, we expect a wide (in the spreading on the level of the primary sequence) mutational effect, when many mutations, however distant from the binding site, destroy the discrimination.

Note that the ability to bind a ligand without an imposed discrimination problem is generically robust to most mutations. Indeed, the solutions to the binding problem for a single ligand lay in a half space of $A$ to one side of $\hat{\textrm{U}}$ given by Theorem 1 for that ligand and the solvent. If a solution is situated deep in this region, it is expected to survive mutations in the sense that the resulting protein retains the binding ability (perhaps, with a weaker affinity, see Figure 2).

The modelling approach taken in this work is in the family of folding landscape models [9]. Looking at a protein through its (free) energy landscape is very natural from the point of view of physics and deserves more attention. The fact that such model supports the conclusion of [7] is very important. It shows that an emergence of sophisticated properties of proteins and other biological heteropolymers, upon which substantial part of the complexity of life is built, can be attributed to a very simple evolutionary process: selection for a local property, that is the ability to discriminate between similar ligands. It is not difficult to imagine a selection process that optimizes this task. Furthermore, such ability very probably was required even back at the earliest times of abiogenesis or very early life.

6 Proofs of Theorem 1, Theorem 2, and Theorem 3

We imply in the following that all manifolds and functions (maps) are smooth. The main tool of the proof is the jet-bundle and the multijet transversality theorem that is a consequence of the Thom’s transversality theorem. We will first recall some definitions and fix some notations.

Definition.

Let $M$ and $N$ be two smooth manifolds and $S\subset N$ be a submanifold. Let $p$ be a point in $M$ . A smooth function $f\colon M\to N$ is said to be transverse to $S$ at $p$ , if $df_{p}\,T_{p}M+T_{f(p)}S=T_{f(p)}N$ , where $T_{p}M$ means the tangent space to $M$ at $p$ and $df_{p}$ is the differential of $f$ at $p$ . $f$ is said to be transverse to $S$ , if it is transverse to $S$ at each point of $M$ . This situation will be denoted by $f\pitchfork S$ . Let $P\subset N$ be another submanifold. $S$ and $P$ are said to intersect transversely (or simply to be transverse), if $T_{q}S+T_{q}P=T_{q}N$ for each $q\in S\cap P$ . This situation will be denoted by $S\pitchfork P$ .

Definition.

The codimension of a submanifold $N$ of a manifold $M$ is the number $\operatorname{codim}N=\dim M-\dim N$ .

If $S$ and $N$ are submanifolds of the same manifold and $N\pitchfork S$ , then $\operatorname{codim}N\cap S=\operatorname{codim}N+\operatorname{codim}S$ (assuming $N\cap S\neq\varnothing$ ). If this number is negative, then $N\cap S=\varnothing$ .

Definition.

Let $M$ be a smooth manifold. Two smooth functions $f$ and $g$ from $M$ to $\varmathbb R$ are said to have $k$ -th order contact at $p\in M$ , if in some coordinate chart around $p$ their values and all their partial derivatives up to order $k$ are equal at $p$ . The relation of $k$ -th order contact is independent of the coordinate chart and defines equivalent classes. The equivalent class of function $f$ by $k$ -th order contact at $p$ , denoted $[f]^{k}_{p}$ , is called $k$ -jet of $f$ at $p$ . Let $J^{k}(M,\varmathbb R)_{p}$ be the set of all $k$ -jets at $p$ . The bundle of $k$ -jets of functions on $M$ is the set $J^{k}(M,\varmathbb R)=\coprod\limits_{p\in M}J^{k}(M,\varmathbb R)_{p}$ with the projection $\piup_{k}\colon J^{k}(M,\varmathbb R)\to M$ , $[f]^{k}_{p}\mapsto p$ endowed with the differential structure lifted from $M$ by $\piup_{k}$ . Every function $f\colon M\to\varmathbb R$ generates a special section of the $k$ -jet bundle $j^{k}f\colon p\mapsto[f]^{k}_{p}$ .

Note that jet bundles can be generalized to maps between arbitrary manifolds. Essentially, $k$ -jets of functions represent an invariant notion of their Taylor polynomials truncated to order $k$ . In the special case $k=1$ , the only one we will be interested in the following, the 1-jet bundle $J^{1}(M,\varmathbb R)$ is naturally isomorphic to the product $\varmathbb R\times T^{*}M$ (we denote this by $J^{1}(M,\varmathbb R)\simeq\varmathbb R\times T^{*}M$ ), where $T^{*}M$ is the cotangent bundle of $M$ .

Definition.

An $s$ -fold multijet bundle $J^{k}_{s}(M,\varmathbb R)$ is defined as follows. We denote

[TABLE]

It is a submanifold of $M^{s}$ . Let $\piup_{k}$ be the bundle projection $J^{k}(M,\varmathbb R)\to M$ . Then $J^{k}_{s}(M,\varmathbb R)=(\piup_{k}^{\times s})^{-1}(M^{(s)})$ . It is a submanifold of $J^{k}(M,\varmathbb R)^{s}$ and is a fibre bundle over $M^{(s)}$ . Every function $f$ on $M$ generates its special section $j^{k}_{s}f$ by the rule $j^{k}_{s}f(x)=(j^{k}f(x_{1}),\ldots,j^{k}f(x_{s}))$ .

Let us denote the diagonal of the direct product $M^{2}$ as $\Delta M^{2}$ . In the special case $s=2$ , the only one we will be interested in the following, $M^{(2)}$ has a simple representation: $M^{(2)}=M^{2}\setminus\Delta M^{2}$ .

Definition.

Map $f\colon X\to Y$ is regular at $p\in X$ , if the rank of $df_{p}$ is maximal. If $f$ is not regular at $p$ , it is called singular at $p$ and $p$ is called its critical point. Map $f\colon X\to Y$ is an immersion, if $df_{p}$ is injective at every $p\in X$ .

We also need some known theorems.

Theorem 4 ([10], page 52, Theorem 4.4).

Let $X$ and $Y$ be manifolds, $W\subset Y$ be a submanifold, and $f\colon X\to Y$ be a function and let $f\pitchfork W$ . Then $f^{-1}(W)$ is a submanifold of $X$ . If in addition $f(X)\cap W\neq\varnothing$ , then $\operatorname{codim}f^{-1}(W)=\operatorname{codim}W$ .

Theorem 5 (A special case of Mather’s multijet transversality theorem, [10], page 57, Theorem 4.13).

Let $X$ be a manifold and $W$ be a submanifold of $J^{k}_{s}(X,\varmathbb R)$ . The subset of $C^{\infty}(X)$ constructed of functions $f$ that verify $j^{k}_{s}f\pitchfork W$ is a residual set of $C^{\infty}(X)$ in the Whitney $C^{\infty}$ topology (for the definition, see [10, p. 42]). Moreover, if $W$ is compact, this subset is open. This theorem is called Thom’s transversality theorem for $s=1$ and thus $J^{k}_{s}(M,\varmathbb R)=J^{k}(M,\varmathbb R)$ , $j^{k}_{s}f=j^{k}f$ .

We will first prove some lemmas.

Lemma 1.

Let $X$ be a compact manifold and $Y$ be a manifold, let $\piup\colon X\times Y\to Y$ be the projection to the second factor. Then for a typical function $f\colon X\times Y\to\varmathbb R$ , the set of critical points of $f|_{\piup^{-1}(y)}$ is finite for any $y\in Y$ .

Proof.

It is known that the subspace of functions that have only isolated points is of so called infinite codimension (see [11] and [12] for definition and explanation and [11] and [13] for the proof), a notion that is stronger than being typical (the latter implies the former). This property implies that for any $k$ , the set of $k$ -parameter families of functions with only isolated points is residual in the set of all $k$ -parameter families. Function $f$ on $X\times Y$ is a ( $\dim Y$ )-parameter family of functions on $X\simeq\piup^{-1}(y)$ . Thus, for any $y$ , $f|_{\piup^{-1}(y)}$ has only isolated critical points. The finiteness of the number of critical points on every layer follows from the compactness of $X$ .

∎

Lemma 2.

Let $X$ and $Y$ be manifolds and $S\subset X\times Y$ be a compact submanifold, let $\piup\colon X\times Y\to Y$ be the projection to the second factor, let $\dim S=\dim Y-1$ . Suppose that for each $y\in Y$ , $\piup^{-1}(y)\cap S$ is finite. Then $\piup|_{S}$ is regular at a typical point of $S$ .

Proof.

The conclusion of the lemma is trivially true for $\dim Y=1$ ( $\dim S=0$ ). Therefore, in the following, we will consider $\dim Y>1$ .

Recall that for a manifold $M$ ( $\dim M=n$ ), a smooth association of a point $p\in M$ with a $k$ -dimensional subspace in $T_{p}M$ is called a $k$ -dimensional distribution on $M$ [14, §3] (not to be confused with probability distributions). In other words, a distribution associates a tangent hyperplane with each point of a manifold. A distribution can be viewed as a subbundle of the tangent bundle. Another way to define a distribution is by defining a collection of (at least $k$ ) vector fields $V_{i}$ on $M$ that span the corresponding subspace of the distribution at each point. Finally, the same distribution can be defined by (at least $n-k$ ) differential 1-forms $\omegaup_{j}$ that annulate $V_{i}$ : $\omegaup_{j}(V_{i})=0$ for each $i$ and $j$ . Any distribution can be defined in such way at least locally (in a neighbourhood of each point of $M$ ).

Now, the regularity of $\piup|_{S}$ at point $p\in S$ means that $T_{p}S\cap T_{p}\piup^{-1}(\piup(p))=0$ . The association $p\mapsto T_{p}\piup^{-1}(\piup(p))$ defines a distribution $D$ on $X\times Y$ , which is vertical towards the projection $\piup$ (it is mapped to the trivial distribution on $Y$ that associates $0\in T_{y}Y$ with each point of $Y$ ) and has its layers as integral manifolds (the layers are tangent to the distribution at each point). In a local chart around $p\in X\times Y$ with coordinates $(x_{\muup},y_{\nuup})$ , the layers of the bundle $\piup$ are defined by the conditions $y_{\nuup}=\mathrm{const}$ , and thus the distribution is defined by 1-forms $\alphaup_{\nuup}=dy_{\nuup}$ , where $d$ is the exterior derivative.

The distribution $D$ on $X\times Y$ induces a distribution $D_{S}$ on $S$ in the following way. Let $\iotaup\colon S\to X\times Y$ be the inclusion of $S$ . Then the collection $\{\omegaup_{\nuup}\}$ , $\omegaup_{\nuup}=\iotaup^{*}\alphaup_{\nuup}$ , defines a distribution on $S$ , where $\iotaup^{*}$ is the pullback of differential forms induced by $\iotaup$ . The dimension of $D_{S}$ , however, can change from point to point depending on the degeneracy of $\{\omegaup_{\nuup}\}$ .

Choose a coordinate neighbourhood $\mathscr{O}$ around $s\in S$ in $S$ with local coordinates $s_{\lambdaup}$ . Then locally we have $\omegaup_{\nuup}=\sum\limits_{\lambdaup}\omegaup_{\nuup\lambdaup}ds_{\lambdaup}$ , where $\omegaup_{\nuup\lambdaup}\in C^{\infty}(S)$ . In these terms, the regularity of $\piup|_{S}$ at $s$ means that the rank of matrix $\omegaup_{\nuup\lambdaup}$ is maximal at $s$ ( $\operatorname{rank}\omegaup_{\nuup\lambdaup}(s)=\dim S$ , as $\dim S<\dim Y$ ).

Consider the sets

[TABLE]

where $\wedge$ is the exterior product of differential forms. Note that for each $k\leqslant l$ , $\Omega_{k}\subset\Omega_{l}$ . Note also that trivially $\Omega_{k}=\mathscr{O}$ for all $k\geqslant\dim S$ . Consider also sets $\Theta_{k}$ , where $\Theta_{0}=\Omega_{0}$ and $\Theta_{k}=\Omega_{k}\setminus\Omega_{k-1}$ for $k>0$ . These sets define the points where rank of $\omegaup_{\nuup\lambdaup}$ is equal to $k$ .

Let us prove that $\Omega_{k}$ are closed and nowhere dense in $\mathscr{O}$ for $k<\dim S$ , and thus $\Theta_{\dim S}$ is open and dense in $\mathscr{O}$ . The closeness of all $\Omega_{k}$ follows from the fact that the defining equations in (9) are equivalent to a finite set $\{F_{m}=0\}$ of functional equations on $\omegaup_{\nuup\lambdaup}$ , where $F_{m}$ are homogeneous polynomials of $\omegaup_{\nuup\lambdaup}$ of $k$ -th order with coefficients from $\{-1,1\}$ . Indeed, the equations in (9) reflect nothing else but setting to zero all $k$ -th minors of $\omegaup_{\nuup\lambdaup}$ . As $F_{m}$ are smooth, the set $\Omega_{k}=\{\sigmaup\in\mathscr{O}:F_{m}=0\}$ is closed.

Now suppose that $\mathring{\Omega}_{0}\neq\varnothing$ , where $\mathring{A}$ is the interior of $A$ . Then $D_{S}$ has constant dimension $\dim S$ in $\mathring{\Omega}_{0}$ , and the set of equations $\omegaup_{\nuup}(V)=0$ has $\dim S$ linearly independent solutions. Choose one such $V$ , a point $\sigmaup\in\mathring{\Omega}_{0}$ , where $V_{\sigmaup}\neq 0$ , a neighbourhood $\mathscr{O}_{\sigmaup}$ of $\sigmaup$ , where $V\neq 0$ and rectifiable (which always exists by smoothness of $V$ ), the integral curve $\gammaup$ of this vector field in $\mathscr{O}_{\sigmaup}$ that passes through $\sigmaup$ , any $\sigmaup_{1}\in\gammaup$ different from $\sigmaup$ , and denote $\tilde{\gammaup}\subset\gammaup$ the interval of $\gammaup$ that connects $\sigmaup$ and $\sigmaup_{1}$ . By necessity, for an arbitrary such $\sigmaup_{1}$ we have

[TABLE]

The equality $y_{\nuup}\big{(}\iotaup(\sigmaup_{1})\big{)}=y_{\nuup}\big{(}\iotaup(\sigmaup)\big{)}$ for all $\nuup$ and $\sigmaup_{1}$ means that $\iotaup(\gammaup)\subset\piup^{-1}(\piup(\sigmaup))$ and thus $S\cap\piup^{-1}(\piup(\sigmaup))$ is uncountable. This contradicts the premise, therefore $\mathring{\Omega}_{0}=\varnothing$ , which means that $\Omega_{0}=\Theta_{0}$ is nowhere dense.

Repeat this reasoning in the inductive manner for $\Theta_{k}$ , $0<k<\dim S$ . The only difference at each step is the number of independent vector fields that solve $\omegaup_{\nuup}(V)=0$ , which is equal to $\dim S-k$ . In the end of each step $\mathring{\Theta}_{k}=\varnothing$ (the proved expression) and $\mathring{\Theta}_{k-1}=\varnothing$ (the expression from the previous step) together imply $\mathring{\Omega}_{k}=\varnothing$ . The induction chain breaks at $\Theta_{\dim S}$ , since in this case the aforementioned equations have no nontrivial solutions, and thus $D_{S}$ is 0-dimensional in $\Theta_{\dim S}$ .

Now select a chart around every point of $S$ and then subselect a finite covering from this collection (which is possible by the compactness of $S$ ). Repeat the reasoning for all of them to get the conclusion of the lemma.

∎

Note that the requirements of compactness of manifolds and finiteness of $\piup^{-1}(y)\cap S$ are not essential. It is only essential for $\piup^{-1}(y)\cap S$ to be at most countable. But this level of generality brings about unneeded complications that are not relevant for the following.

Proof of Theorem 1.

Let us call a presolution to the reduced discrimination problem a phenotype $a$ such that the protein has equal in the energy level critical points of energy for $\ell_{0}$ and $\ell_{1}$ , and not necessarily minima. It is clear that the proper solutions make a subset of the presolutions.

Consider the following diagram, associated with energy functions on $M$ :

[TABLE]

Here $\piup_{1}$ is the projection of $J^{1}_{2}(M,\varmathbb R)$ as a bundle over $M^{(2)}$ . $\piup_{X}$ is $(p_{X}\times p_{X})|_{M^{(2)}}$ , where $p_{X}$ is the natural projection of $M$ on $X$ , analogously for $\piup_{L}$ and $\piup_{A}$ . $\piup_{E}$ is the projection to pairs of energy values, associated with a multijet (it can be seen as $(p_{\varmathbb R}\times p_{\varmathbb R})|_{J^{1}_{2}(M,\varmathbb R)}$ , where $p_{\varmathbb R}$ is the projection to the first factor of $\varmathbb R\times T^{*}M\simeq J^{1}(M,\varmathbb R)$ ). $\iotaup_{X}$ , $\iotaup_{L}$ , $\iotaup_{A}$ , and $\iotaup_{E}$ are the obvious natural inclusions (embeddings).

Finally, $P_{X}$ and $\piup_{T^{*}M}$ are defined as follows. Let us consider again $J^{1}(M,\varmathbb R)$ as $\varmathbb R\times T^{*}M$ and let $p_{T^{*}M}$ be the natural projection on the second factor. In turn,

[TABLE]

where $V\oplus W$ is the Whitney sum of two vector bundles $\piup_{V}:V\to B$ , $\piup_{W}:W\to B$ over the same base $B$ , i. e. the pullback from the following commutative diagram ( $\iotaup_{\Delta}$ is the diagonal inclusion map)

[TABLE]

Let $0_{X}$ be the image of the 0-th section of $T^{*}X$ in $T^{*}X$ . Then we define

[TABLE]

and $\iotaup_{P_{X}}$ as the natural inclusion $P_{X}^{2}\to(T^{*}M)^{2}$

Let us define

[TABLE]

The meaning of these sets is the following. Space $J^{1}_{2}(M,\varmathbb R)$ consists of pairs of jets over two at least somehow distinct points of $M$ . The preimage of $\Delta\varmathbb R^{2}$ defines pairs of jets that have the same energy values. The preimage of $P_{X}^{2}$ defines pairs of jets both of which have zero partial derivatives in $X$ (so, the corresponding points in $X$ are critical points for any representatives of these jets). The preimage of $(\ell_{0},\ell_{1})$ defines pairs of jets one of which is over $\ell_{0}$ and the other one is over $\ell_{1}$ . The preimages of $\Delta A$ and $\Delta X$ define pairs of jets that have the same values of $a$ and $x$ , correspondingly.

Therefore, $V_{1}$ corresponds to pairs of tuples $(x_{0},\ell_{0},a)$ and $(x_{1},\ell_{1},a)$ ( $\ell_{i}$ are fixed to the values of the problem) such that functions $U(\cdot,\ell_{i},a)$ have corresponding $x_{i}$ as critical points and $U(x_{0},\ell_{0},a)=U(x_{1},\ell_{1},a)$ . $V_{2}$ corresponds to the same tuples but with the additional constraint $x_{0}=x_{1}$ . Accordingly, $\textrm{U}_{1}$ corresponds to evolutionary presolutions of the reduced discrimination problem, while $\textrm{U}_{2}$ corresponds to such presolutions where the critical points coincide.

By Theorem 5, for a typical $U$ (from a residual set of all $U\in C^{\infty}(M)$ ), we have both $j^{1}_{2}U\pitchfork W_{1}$ and $j^{1}_{2}U\pitchfork W_{2}$ . Indeed, the theorem guaranties that each of the condition is verified on a residual set. Therefore, they both are verified on a residual set, as an intersection of two residual sets is residual.

Sets $Y_{i}$ are compact submanifolds of $J^{1}_{2}(M,\varmathbb R)$ . Indeed, consider $W_{i}$ and $Y_{i}$ as subsets of $J^{1}(M,\varmathbb R)^{2}\supset J^{1}_{2}(M,\varmathbb R)$ . If on the diagram above we replace $J^{1}_{2}(M,\varmathbb R)$ by $J^{1}(M,\varmathbb R)^{2}$ , $j^{1}_{2}U$ by $j^{1}U\times j^{1}U$ , $\piup_{1}$ by its nonrestricted version, all other projections of the form $\piup_{\bullet}$ by nonrestricted $p_{\bullet}\times p_{\bullet}$ , and then repeat the construction of $Y_{i}$ in the same way, they will coincide with $Y_{i}$ constructed in the old way. Indeed, the only difference could be some additional points on the preimage of the diagonal $\piup_{1}^{-1}(\Delta M^{2})$ , but since $(p_{L}\times p_{L})^{-1}(\operatorname{Im}\iotaup_{L})\cap\Delta M^{2}=\varnothing$ , we have $Y_{i}\cap\piup_{1}^{-1}(\Delta M^{2})=\varnothing$ , hence the equality. As $j^{1}U\times j^{1}U(M^{2})$ is compact due to the compactness of $M^{2}$ , $Y_{i}$ are compact, too. This property conserves upon restriction to $J^{1}_{2}(M,\varmathbb R)$ .

From the transversality properties and from $\piup_{1}\circ j^{1}_{2}U=\operatorname{Id}_{M}$ , by Theorem 4, $V_{i}$ are submanifolds of $M^{(2)}$ . These submanifolds are compact, too.

Note that

[TABLE]

Therefore, by Theorem 4 and assuming $Y_{i}\neq\varnothing$ ,

[TABLE]

We already see that if $\dim X>0$ (the system is minimally flexible), then $\dim V_{1}>\dim V_{2}$ . As $V_{2}\subset V_{1}$ , we can conclude that points of $V_{1}$ that correspond to coincident critical values are not typical. More specifically, they form a submanifold of codimension $\dim X$ . For instance, if $\dim A<\dim X+1$ , such points do not exist at all. Otherwise, if $\dim X>0$ , $V_{2}$ is a negligible subset in $V_{1}$ in the sense that it is nowhere dense and closed.

Note that by construction, $V_{i}\subset\piup_{A}^{-1}(\operatorname{Im}\iotaup_{A})\subset M^{(2)}$ and $\textrm{U}_{i}$ can be understood as projections of $V_{i}$ on $A$ from $M^{(2)}$ , for example as $\textrm{U}_{i}=\piup(V_{i})$ , where $\piup=p_{1}\circ\piup_{A}\colon M^{(2)}\to A$ and $p_{1}\colon A\times A\to A$ is the projection on the first factor. Unfortunately $\textrm{U}_{1}$ and $\textrm{U}_{2}$ are not in general submanifolds of $A$ due to generic singularities of the corresponding projection and generic self-crossings of the images of $\textrm{U}_{i}$ . We do not expect them to be even immersed manifolds. In fact, in a typical case, they form so called stratified sets of $A$ . We will show, however, that typical points of $\textrm{U}_{1}$ do form $(\dim A-1)$ -dimensional submanifolds with multiple connectness components.

Consider the fibre bundle $M\to L\times A$ , where the projection is the natural projection of $M$ to the corresponding factor. Then $U$ can be viewed as a family of potentials $U_{\ell,a}$ on $X$ parametrized by $\ell$ and $a$ . By Lemma 1, for typical $U$ , all $U_{\ell,a}$ have only finite number of critical points for each pair $(\ell,a)$ . Using the diagram

[TABLE]

and the same reasoning as in the beginning of the proof, we conclude that for a typical $U$ , the sets $V_{\ell_{i}}$ of all critical points in $X$ of $U_{\ell_{i}}$ at different $a$ (we will call $V_{\ell_{i}}$ the critical set of $U_{\ell_{i}}$ ) constitute manifolds in $X\times\{\ell_{i}\}\times A\subset M$ and can be regarded as subsets of $X\times A\simeq X\times\{\ell_{i}\}\times A$ .

Consider a fibre bundle with the natural projection $\xiup_{A}\colon X\times A\to A$ with two functions $U_{\ell_{0}}$ , $U_{\ell_{1}}$ on $X\times A$ that correspond to $U$ at different values of $\ell$ . They too have finite number of critical point over every $a\in A$ , since these functions are restrictions of $U$ . If we consider the direct product of these fibre bundles with projection $\xiup_{A}\times\xiup_{A}\colon(X\times A)^{2}\to A^{2}$ , function $U_{\ell_{0}}\times U_{\ell_{1}}$ has $V_{\ell_{0}}\times V_{\ell_{1}}$ as its critical set. It is finite over any $(a_{0},a_{1})\in A^{2}$ , too. Therefore, $V_{1}$ regarded as a submanifold of $X^{2}\times A\simeq X^{2}\times\Delta A^{2}$ , is a submanifold of $V_{\ell_{0}}\times V_{\ell_{1}}$ , and thus finite over any point $a\in A$ . In other words, only finite number of points from $V_{1}$ are projected to any $a\in\textrm{U}_{1}$ . By Lemma 2, a typical point of $V_{1}$ (from an open dense subset) is projected regularly. Therefore, $\piup_{A}|_{V_{1}}$ is a local immersion in a neighbourhood of a typical point with finite preimage. Due to compactness of $V_{1}$ , the set of points of change of the number of preimages and the intersection locus of the immersion in regular points are closed nowhere dense sets of $\textrm{U}_{1}$ . Therefore, typical points of $\textrm{U}_{1}$ form open submanifold of $A$ of dimension $\dim A-1$ . $V_{2}$ projects to this submanifold as a $(\dim A-\dim X-1)$ -dimensional manifold and thus is closed and nowhere dense. After exclusion of all these meager points (singularities of projection, self-intersections, change of number of preimages, and $\textrm{U}_{2}$ ), we are left with an open $(\dim A-1)$ -dimensional submanifold $\tilde{\textrm{U}}$ of the phenotype space $A$ , which is dense in $\textrm{U}_{1}$ .

Finally, let us return to the proper solution of the reduced discrimination problem, that is, when we consider only the parts of $V_{1}$ that correspond to the global minima and only the corresponding parts of $\textrm{U}_{1}$ . This will reduce $\textrm{U}_{1}$ to a smaller subset $\hat{\textrm{U}}$ , but all the conclusions will hold for it, too. Typical points of $\hat{\textrm{U}}$ form an open $(\dim A-1)$ -dimensional submanifold of $A$ and the corresponding minimum points in $X$ over $\ell_{0}$ and $\ell_{1}$ do not coincide.

The only possible complication can come from situations when at least at one of $\ell_{i}$ , $U_{\ell_{i},a}$ has multiple global minima. Each part of $V_{1}$ that is projected to the corresponding $a$ guarantees only that each pair of a minimum point at $\ell_{0}$ and a minimum point at $\ell_{1}$ do not coincide, but it does not preclude a situation, when to the same point $a$ , for example, two parts of $V_{1}$ are projected that correspond to pairs of minimum points at $\ell_{0}$ and $\ell_{1}$ of the form $(x_{0},x_{1})$ and $(x_{1},x_{2})$ . In this case we would have two coinciding global minimum points for $\ell_{0}$ and $\ell_{1}$ . However, as two members of the same pair correspond to the same value of the global minima, we have in this case

[TABLE]

But all the minima must have the same value of energy, as they are global minima. Therefore, we must have

[TABLE]

It follows that such $a$ belongs to $\textrm{U}_{2}$ and is not in the described submanifold of typical solutions to the reduced discrimination problem. This concludes the proof.

∎

Proof of Theorem 2.

Consider the following diagram

[TABLE]

where $\piup_{1,0}$ is the natural projection $[f]^{1}_{p}\mapsto[f]^{0}_{p}$ (it forgets about tangency and keeps information only about intersections of function graphs) and $P_{X}$ is as in the proof of Theorem 1. Consider the manifold (for generic $U$ ) $Y\subset J^{1}(M,\varmathbb R)$ defined by the intersection of the preimage of $P_{X}$ and the image of $j^{1}U$ . Consider also, as before, the set $V$ of critical points of $U$ , which is a manifold in $M$ , $V=(j^{1}U)^{-1}(Y)$ . Consider now the projection $V_{0}$ of $Y$ to $J^{0}(M,\varmathbb R)$ . It is a manifold of $J^{0}(M,\varmathbb R)$ . Indeed, it is just $j^{0}U(V)$ and $\operatorname{Im}j^{0}U$ is an embedding of $M$ (it is just the graph of $U$ ).

Note that $J^{0}(M,\varmathbb R)\simeq\varmathbb R\times M$ . As before, we can assume that for every $\ell$ and $a$ , $U_{\ell,a}$ has only finitely many critical points. Therefore, $V_{0}$ projected to $\varmathbb R\times L\times A$ by the natural projection from $\varmathbb R\times L\times A$ locally over a typical point $(\ell,a)$ looks like a finite set of sections (graphs) of the bundle $\varmathbb R\times L\times A\to L\times A$ . Nontypical points of $L\times A$ constitute a codimension-1 subset, which consists of either intersections of these manifolds or of the singularities of their projections. The set $W$ that corresponds to global minima of $U$ is a subset of this union of $(\dim L+\dim A)$ -dimensional manifolds that locally in a typical point looks like a single such manifold that is regularly projected to $L\times A$ .

For a typical $U$ and $\ell$ , $U(\cdot,\ell,\cdot)\colon X\times A\to\varmathbb R$ is a typical family of functions on $X$ parametrized by $A$ . Thus, for a typical $a$ , $U_{\ell_{1},a}$ is Morse function. It has only separate nondegenerate critical points that are all mapped to different values. A codimension-1 bifurcations of a typical family (bifurcations that happen on a codimension-1 subset of $A$ ) include only fold bifurcations and equality of the function value for some two critical points. All other bifurcations have codimension greater than 2 and thus those of them that happen to be in $\hat{\textrm{U}}$ form a meager set of $\hat{\textrm{U}}$ (due to compactness of $A$ , the complement to this set is open and dense in $A$ ). Therefore, we can consider that typical points of $\hat{\textrm{U}}$ that possess the properties stated in Theorem 1 do not include these higher codimension bifurcations. Furthermore, possible codimension-1 bifurcations of the global minimum (and maximum) exclude fold bifurcations. Indeed, during a fold bifurcation, a critical point disappears in a collision with a nondegenerate critical point with different Morse index but for the global minimum it is impossible unless a third point becomes the new global minimum at the same time (this makes the bifurcation of codimension at least 2). Therefore, the only codimension-1 bifurcations of the global minimum are switches of the minimal points (coincidence of minimal values).

Let $\tilde{A}$ be the open and dense subset of $A$ formed by the complement to the set of bifurcations of codimension greater than 2 for global minima of the family $U_{\ell_{1},a}$ . Let us consider the intersection $W_{\ell_{1}}$ of the submanifold $\varmathbb R\times\{\ell_{1}\}\times\tilde{A}\simeq\varmathbb R\times\tilde{A}$ of $\varmathbb R\times L\times A$ and $W$ . From the previous paragraph we conclude that in some neighbourhood of each point of $W_{\ell_{1}}$ , $W$ looks either like a graph of some smooth function $\upvarphi_{1}\colon L\times A\to\varmathbb R$ or like a graph of a continuous function $(\ell,a)\mapsto\min(\upvarphi_{1}(\ell,a),\upvarphi_{2}(\ell,a))$ , where $\upvarphi_{1}$ and $\upvarphi_{2}$ are two smooth functions that are equal at the point in question. Let, for the first case, $s_{1}=j^{0}\upvarphi_{1}$ be the corresponding section of the bundle $\piup\colon\varmathbb R\times L\times A\to L\times A$ , where $\piup$ is the natural projection. Let $s_{1}$ and $s_{2}$ be the sections corresponding to $\upvarphi_{1}$ and $\upvarphi_{2}$ of the second case. Let $I=\{1\}$ and $I=\{1,2\}$ for the first and the second cases, correspondingly. Let the local coordinates in $\varmathbb R\times L\times A$ be $(E,\ell,a)$ .

Let us locally define,for each $(\ell_{1},a^{\prime})\in W_{\ell_{1}}$ , $a^{\prime}\in\tilde{A}$ , and a corresponding neighbourhood $\mathscr{O}_{(\ell_{1},a^{\prime})}$ that admits the aforementioned representation, functions on $\mathscr{U}_{a^{\prime}}=\mathscr{O}_{(\ell_{1},a^{\prime})}\cap\tilde{A}$

[TABLE]

where by $(\varv,0)$ we understand the vector in $T_{\ell_{1}}L\oplus T_{a}A\simeq T_{(\ell_{1},a)}(L\times A)$ that corresponds to $\varv$ . These functions are smooth. Let us also define, for the same point, neighbourhood, and representation, the following piecewise smooth function $f$ on $\mathscr{U}_{a^{\prime}}$ . If the representation corresponds to $I=\{1\}$ , then we define

[TABLE]

If the representation corresponds to $I=\{1,2\}$ , then we define

[TABLE]

It is not difficult to see that, by construction, the values of thus defined functions for any $a$ and any pair its intersecting neighbourhoods $U^{\prime}_{a}$ and $U^{\prime\prime}_{a}$ (induced from any two $\mathscr{O}^{\prime}_{\ell_{1},a}$ and $\mathscr{O}^{\prime\prime}_{\ell_{1},a}$ ) agree in $U^{\prime}_{a}\cap U^{\prime\prime}_{a}$ . Thus, function $f$ is globally defined on $\tilde{A}$ . Moreover, we can use any its possible representation to study its local behaviour.

By construction, the sign of this function defines the sign of difference between $U_{\varw}$ and $U_{r}$ under an infinitesimal separation of $\ell_{1}$ to $\ell_{r}=\ell_{1}$ and $\ell_{\varw}$ , when the latter moves along $\varv$ . Let us extend $f$ on the whole $A$ by setting $f(a)=-1$ for $a\in A\setminus\tilde{A}$ . Then the set $\Omega=\{a\in A:f(a)>0\}$ is an open subset of $A$ . Indeed, it means that the subset $\mathrm{K}=\{a\in A:f(a)\leqslant 0\}$ is closed. To show this, let us choose some convergent sequence $\{a_{n}\}$ in $A$ such that $a_{n}\in\mathrm{K}$ and $a_{n}\to a$ . First note that $A\setminus\tilde{A}$ is trivially in $\mathrm{K}$ . Let $a\in\tilde{A}$ . Starting from some number, all points of $a_{n}$ lay in some of $\mathscr{U}_{a}$ with one of the considered representations of $f$ . If $I=\{1\}$ for this representation, by the smoothness of functions $\upvarphi_{1}$ and $f_{1}$ and thus of $f$ in $\mathscr{U}_{a}$ , we have $f(a_{n})\to f(a)$ and $f(a)\leqslant 0$ , thus $a\in\mathrm{K}$ . If, on the other hand, $I=\{1,2\}$ for the selected representation, the situation $f_{1,2}(a)>0$ is impossible, and we must have either $f_{1,2}(a)\leqslant 0$ , and thus $a$ is automatically in $\mathrm{K}$ , or one of $f_{i}(a)$ must be greater than 0 and the other one be smaller or equal to 0. Let us for concreteness assume $f_{1}(a)\leqslant 0<f_{2}(a)$ . But then, starting from sufficiently large number, we must have $f(a_{n})=f_{1}(a)$ and thus $f(a)=f_{1}(a)\leqslant 0$ , so again $a\in\mathrm{K}$ . Therefore, $\mathrm{K}$ contains all its limit points and thus is closed, from which follows that $\Omega$ is open.

As an open subset of $A$ , $\Omega$ is its ( $\dim A$ )-dimensional submanifold. Let $\check{\textrm{U}}$ be the open $(\dim A-1)$ -dimensional sumbanifold of $A$ made of typical points granted by Theorem 1 ( $\check{\textrm{U}}$ is an open dense subset of $\hat{\textrm{U}}$ ). Then, trivially, $\check{\textrm{U}}\pitchfork\Omega$ and thus $\tilde{\textrm{U}}=\check{\textrm{U}}\cap\Omega$ is either empty or a $(\dim A-1)$ -dimensional submanifold of $A$ . By construction, $\tilde{\textrm{U}}$ consists of typical solutions to the infinitesimal discrimination problem with different minimal points.

∎

Proof of Theorem 3.

Let $\hat{x}^{ij}$ denote the minimal points of, respectively, $U_{\ell_{ij},\,a}$ (for a typical value of $a$ the minimal points are unique and nondegenerate [10, Propositions 6.3 and 6.13]). They can be represented as $\hat{x}^{ij}=(\hat{x}^{ij}_{0},\hat{x}^{ij}_{1},\hat{x}^{ij}_{2})$ , $\hat{x}^{ij}_{k}\in X_{k}$ . Let us also suppose that $\hat{x}^{01}_{2}\neq\hat{x}^{11}_{2}$ . If not, this can be recovered by an arbitrarily small perturbation of $U$ as follows. First recall that if $W$ is a manifold, $\mathscr{O}$ is any its open subset, $\mathscr{C}_{1}$ and $\mathscr{C}_{2}$ are any its closed subsets such that $\mathscr{C}_{1}\cap\mathscr{C}_{2}=\varnothing$ , than there are smooth functions $\upvarphi$ and ψ on $W$ such that

[TABLE]

An explicit construction of such functions can be found in [15, Part 2, p. 13].

By the premise, we have $\hat{x}^{01}\neq\hat{x}^{11}$ . Consider now a neighbourhood $\mathscr{O}$ of point $(\hat{x}^{11},a)$ in $X\times A$ such that $(\hat{x}^{01},a)\notin\mathscr{O}$ . Choose an associated with $\mathscr{O}$ by (25) function $\upvarphi$ and a nonzero in $\mathscr{O}$ rectifiable vector field $\varv$ tangent to $T_{p}X_{2}$ at each point $p\in X\times A$ (such vector field always exists in a small enough $\mathscr{O}$ ). Consider now the vector field $\varw=\upvarphi\varv$ and the phase flow $g^{t}$ generated by $\varw$ (so $q(t)=g^{t}(p)$ is the solution to the differential equation $dq/dt=\varw_{q}$ with $q(0)=p$ ). The function $U_{0}^{(\varepsilonup)}=U_{0}\circ g^{\varepsilonup}$ provides the needed deformation of $U_{0}$ (see Figure 3). The corresponding deformation $U^{(\varepsilonup)}=U_{0}^{(\varepsilonup)}+U_{1}+U_{2}$ of $U$ tends to $U$ (in the Whitney $C^{\infty}$ topology) as $\varepsilonup\to 0$ . Note also that $U^{(\varepsilonup)}$ still verifies (6), but for any $\varepsilonup\neq 0$ , $\hat{x}^{01}_{2}\neq\hat{x}^{11}_{2}$ . From the other hand, if these points are different, this cannot be changed by an arbitrarily small perturbation of $U$ , as the position of nondegenerate critical points of a function smoothly depends on this perturbation to a certain extent. Therefore, the inequality of $\hat{x}^{01}_{2}$ and $\hat{x}^{11}_{2}$ is typical for a typical $a$ that solves the reduced discrimination problem for $\ell_{01}$ and $\ell_{11}$ .

Now let (6) hold for $U$ and $\hat{x}^{01}_{2}\neq\hat{x}^{11}_{2}$ . Let us denote for brevity $N=X_{2}\times\mathrm{P}\times A$ . Let us also denote $p^{ij}=(\hat{x}^{ij}_{2},\rhoup_{2},a)$ , $p^{ij}\in N$ . Let $\mathscr{C}_{1}$ and $\mathscr{C}_{2}$ be two closed subset of $N$ such that $p^{11}\in\mathring{\mathscr{C}}_{1}$ (the interior of $\mathscr{C}_{1}$ ), $X_{2}\times\{\rhoup_{1}\}\times A\subset\mathring{\mathscr{C}}_{2}$ , $p^{01}\in\mathring{\mathscr{C}}_{2}$ , and $\mathscr{C}_{1}\cap\mathscr{C}_{2}=\varnothing$ (such subsets always exist as $\hat{x}^{01}_{2}$ and $\hat{x}^{11}_{2}$ are different). Choose a function ψ defined by $\textrm{y}\equiv 1$ in $\mathscr{C}_{1}$ , $\textrm{y}\equiv 0$ in $\mathscr{C}_{2}$ , as in (26). Consider a deformation $U_{2}^{(\varepsilonup)}$ of $U_{2}$ in the following form $U_{2}^{(\varepsilonup)}=U_{2}+\varepsilonup\textrm{y}$ and the corresponding deformation of $U$ given by $U^{(\varepsilonup)}=U_{0}+U_{1}+U_{2}^{(\varepsilonup)}$ . For any number $\varepsilonup$ it has the same structure as in (5) and $U^{(\varepsilonup)}\to U$ (in the Whitney $C^{\infty}$ topology) as $\varepsilonup\to 0$ . However, for an arbitrary small enough $\varepsilonup\neq 0$ , it violates (6) but verifies (7). The robustness of situation (7), in turn, again follows from the properties of nondegenerate critical points of functions.

∎

Acknowledgments

The author thanks Olivier Rivoire for the formulation of the problem and for fruitful discussions, and Clément Nizak for carefully reading the manuscript and for his advices on making it more accessible to readers without strong mathematical background. This work was supported by ProTheoMics grant from Paris-Sciences-Lettres University and by ANR-17-CE30-0021-02 RBMPro grant from Agence Nationale de la Recherche.

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Hammes, G.G., Chang, Y.C. and Oas, T.G. “Conformational selection or induced fit: a flux description of reaction mechanism”. Proceedings of the National Academy of Sciences, 2009, 106(33), pp.13737–13741.
2[2] Hammes, G.G. “Multiple conformational changes in enzyme catalysis”. Biochemistry, 2002, 41(26), pp.8221–8228.
3[3] Koshland, D.E. “Application of a theory of enzyme specificity to protein synthesis”. Proceedings of the National Academy of Sciences, 1958 44(2), pp.98–104.
4[4] Koshland Jr, D.E. “The key-lock theory and the induced fit theory”. Angewandte Chemie International Edition in English, 1995, 33(23‐24), pp.2375–2378.
5[5] Monod, J., Wyman, J. and Changeux, J.P. “On the nature of allosteric transitions: a plausible model”. J Mol Biol, 1965, 12(1), pp.88-118.
6[6] Changeux, J.P. and Edelstein, S. “Conformational selection or induced fit? 50 years of debate resolved”. F 1000 biology reports, 2011, 3.
7[7] Rivoire, O. “Minimal evolutionary scenario for the origin of allostery and coevolution patters in proteins”. ar Xiv:1812.01524
8[8] Csermely, P., Palotai, R. and Nussinov, R. “Induced fit, conformational selection and independent dynamic segments: an extended view of binding events”. Trends in biochemical sciences, 2011, 35(10), pp.539–546.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Allostery and conformational changes upon binding as generic features of proteins: a high-dimension geometrical approach.

Abstract

1 Introduction

2 Physical formulation of the problem

3 Mathematical formulation of the problem

4 Main results

4.1 Exquisite discrimination, conformational changes, and fine tuning

Definition**.**

Theorem 1**.**

Theorem 2**.**

4.2 Conformational changes and allosteric regulation

Theorem 3**.**

5 Discussion and biological interpretation

6 Proofs of Theorem 1, Theorem 2, and Theorem 3

Definition**.**

Definition**.**

Definition**.**

Definition**.**

Definition**.**

Theorem 4** ([10], page 52, Theorem 4.4).**

Theorem 5** (A special case of Mather’s multijet transversality theorem, [10], page 57, Theorem 4.13).**

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

Proof of Theorem 1.

Proof of Theorem 2.

Proof of Theorem 3.

Acknowledgments

Definition.

Theorem 1.

Theorem 2.

Theorem 3.

Definition.

Definition.

Definition.

Definition.

Definition.

Theorem 4 ([10], page 52, Theorem 4.4).

Theorem 5 (A special case of Mather’s multijet transversality theorem, [10], page 57, Theorem 4.13).

Lemma 1.

Lemma 2.