A P Systems Variant for Reasoning about Sequential Controllability of Boolean Networks
Artiom Alhazov, Vincent Ferrari-Dominguez, Rudolf Freund, Nicolas, Glade, Sergiu Ivanov

TL;DR
This paper introduces Boolean P systems, a new formalism tailored for reasoning about the sequential controllability of Boolean networks, establishing its computational complexity as PSPACE-complete.
Contribution
The paper proposes a novel Boolean P systems formalism for analyzing Boolean networks and formalizes the sequential controllability problem within this framework.
Findings
Proves that the sequential controllability problem is PSPACE-complete.
Demonstrates how P systems can be adapted for specific reasoning tasks.
Provides a formal foundation for future research in biological network control.
Abstract
A Boolean network is a discrete dynamical system operating on vectors of Boolean variables. The action of a Boolean network can be conveniently expressed as a system of Boolean update functions, computing the new values for each component of the Boolean vector as a function of the other components. Boolean networks are widely used in modelling biological systems that can be seen as consisting of entities which can be activated or deactivated, expressed or inhibited, on or off. P systems on the other hand are classically introduced as a model of hierarchical multiset rewriting. However, over the years the community has proposed a wide range of P system variants including diverse ingredients suited for various needs. In this work, we propose a new variant -- Boolean P systems -- specifically designed for reasoning about sequential controllability of Boolean networks, and use it to first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Network Packet Processing and Optimization · Wastewater Treatment and Nitrogen Removal
\newclass\LBAACCEPTANCE
LBA–ACCEPTANCE \newclass\SEQCONTROLSEQ–CONTROL
11institutetext: Vladimir Andrunachievici Institute of Mathematics and Computer Science,
The State University of Moldova
Academiei 5, Chişinău, MD-2028, Moldova
22institutetext: École Normale Supérieure – PSL University, CNRS UMR 8553,
45, rue d’Ulm, F-75230 Parix Cedex 3, France
33institutetext: Faculty of Informatics, TU Wien
Favoritenstraße 9–11, 1040 Wien, Austria
44institutetext: Université Grenoble Alpes, CNRS UMR5525, CHU Grenoble Alpes, Grenoble INP
TIMC-IMAG, F-38000 Grenoble, France
55institutetext: Université Paris-Saclay, Univ. Évry, IBISC,
23 boulevard de France, 91034 Évry, France
A P Systems Variant for Reasoning about Sequential
Controllability of Boolean Networks††thanks: This is a revised and extended version of [1].
Artiom Alhazov 11
Vincent Ferrari-Dominguez 22
Rudolf Freund 33
Nicolas Glade 44
Sergiu Ivanov Corresponding author.55
Abstract
A Boolean network is a discrete dynamical system operating on vectors of Boolean variables. The action of a Boolean network can be conveniently expressed as a system of Boolean update functions, computing the new values for each component of the Boolean vector as a function of the other components. Boolean networks are widely used in modelling biological systems that can be seen as consisting of entities which can be activated or deactivated, expressed or inhibited, on or off. P systems on the other hand are classically introduced as a model of hierarchical multiset rewriting. However, over the years the community has proposed a wide range of P system variants including diverse ingredients suited for various needs. In this work, we propose a new variant—Boolean P systems—specifically designed for reasoning about sequential controllability of Boolean networks, and use it to first establish a crisp formalization of the problem, and then to prove that the problem of sequential controllability is \PSPACE-complete. We further claim that Boolean P systems are a demonstration of how P systems can be used to construct ad hoc formalisms, custom-tailored for reasoning about specific problems, and providing new advantageous points of view.
1 Introduction
Membrane computing and P systems are a paradigm of massively parallel computing introduced more than two decades ago by Gh. Păun [27], and inspired by the structure and the functioning of the biological cell. Following the example of the cell, a membrane (P) system is a hierarchical membrane structure with compartments containing multisets of objects, representing in an abstract sense the biochemical species. Multiset rewriting rules are attached to every membrane to represent the reactions. Over the last two decades, a considerable number of variants of P systems have been introduced, inspired by various aspects of cellular life, or capturing specific computing properties. For comprehensive overviews we refer the reader to [14, 28].
Even though P systems are directly inspired by the biological cell, their use for actual cellular modelling has encountered relatively little success. On the other hand, Boolean networks have been quite successful recently, despite their relative dissimilarity to biological structures—a Boolean network is a set of Boolean variables equipped with Boolean update functions, describing how to compute the new value of the variables from their current values. We refer the reader to [1] for a more detailed impression.
One application of interest of Boolean networks is controllability—the problem of deciding whether externally modifying some parameters of a system can make it reach a particular state, and finding the necessary modifications [6, 12, 25, 30, 31]. A variant of this problem which has attracted particular attention is sequential controllability: instead of looking for a particular combination of control inputs, find a sequence of control inputs to guide the system to a given state [17, 18, 19, 20, 22, 24]. Sequential controllability is promising because it may allow reducing the total number of control actions, or may even drive the Boolean network along trajectories which would otherwise be inaccessible. On the other hand, sequential controllability is \PSPACE-hard [24], making it a difficult problem to tackle.
The goal of this paper is to show how to combine the modelling power of Boolean networks with the richness of P systems to reason about and prove some properties of sequential controllability of Boolean networks. We construct a P system variant to satisfy the following two properties simultaneously:
represent sequential controllability of Boolean control networks via simple syntax transformations, 2. 2.
have \PSPACE-complete reachability.
This formalization of sequential controllability allows us to complete the complexity result from [24] by proving that this problem is \PSPACE-complete. We would like to use this construction to promote P system variants as a general tool for building ad hoc formalisms specifically tailored for tackling particular problems.
This paper is structured as follows. Section 2 briefly recalls all the necessary preliminaries: linear bounded automata, P systems, Boolean networks, sequential controllability. Section 3 introduces the specific P system variant for tackling sequential controllability: Boolean P systems. Section 4 shows how Boolean P systems can directly simulate Boolean networks. Section 5 introduces composition of Boolean P systems in the spirit of automata theory, and Section 6 shows how composite Boolean P systems can capture a Boolean network together with the master dynamical system emitting the control inputs. In Section 7, we show that the reachability problem for Boolean P systems is \PSPACE-complete, and we use this result in Section 8 to show that sequential controllability of Boolean networks is \PSPACE-complete as well. Finally, in Section 9 we extensively discuss the obtained technical results concerning sequential controllability, the features of Boolean P systems, and the general methodology of designing ad hoc formalisms custom-tailored to specific problems.
2 Preliminaries
In this section, we briefly recall the necessary preliminaries, in particular deterministic bounded automata, P systems, Boolean networks, Boolean Control Networks (BCN), and sequential controllability of BCN.
Given two sets and , we denote by the set of all functions . We denote by the set of all subsets of (the power set of ) and by the cardinal of the set . An indicator function of a subset is the function with the property that . In this paper, we will often use the same symbol to refer to a subset and to its indicator function.
2.1 Deterministic Linear Bounded Automata (LBA)
A deterministic linear bounded automaton (deterministic LBA or simply LBA) is a construct
[TABLE]
where:
- •
is a finite set of states,
- •
is the finite tape alphabet,
- •
is the input alphabet,
- •
is the output alphabet,
- •
is the transition function,
- •
is the initial state,
- •
is the final state,
- •
is the left boundary marker,
- •
is the blank symbol,
- •
is the right boundary marker,
We restrict the transition function such that the automaton can never write over the boundary markers or exceed them, more precisely:
[TABLE]
A configuration of the automaton will be written as , where , . The state is written to the left of the underlined tape symbol on which the head of the automaton currently stands.
Suppose the LBA is in state and reads the symbol on the tape. If , one of the following transitions occurs, depending on the value of :
[TABLE]
Due to the restriction of the transition function, the accessible part of the tape is limited to the input plus the two delimiters and . Another model of LBA consists in restricting the size of the accessible part of the tape to a linear function of the input, which is the origin of the name linear bounded automaton. The two models have the same computational power [13].
An LBA accepts the input if starting with the configuration it reaches a configuration of the form . Given an LBA and an input , the problem consists in deciding whether accepts . This problem is \PSPACE-complete [13].
2.2 P Systems
In this subsection, we give a general overview of P systems. For more details, we refer the reader to [14, 28]. A P system is a construct
[TABLE]
where is the alphabet of objects, is the alphabet of terminal objects, is the membrane structure injectively labelled by the numbers from and usually given by a sequence of correctly nested brackets, are the multisets giving the initial contents of each membrane (), is the finite set of rules associated with membrane (), and and are the labels of the input and the output membranes, respectively (, ).
Quite often, the rules associated with membranes are multiset rewriting rules (or special cases of such rules). Multiset rewriting rules have the form , with and , where is the set of multisets over , and , for all . If , the rule is called non-cooperative; otherwise it is called cooperative. In communication P systems, rules are additionally allowed to send symbols to the neighbouring membranes. In this case, for rules in , , where contains the symbols (corresponding to sending the symbol to the parent membrane), (indicating that the symbol should be kept in membrane ), and (indicating that the symbol should be sent into the child membrane of membrane ). When writing out the multisets over , the indication is often omitted.
In P systems, rules are often applied in a maximally parallel way: in one derivation step, only a non-extendable multiset of rules can be applied. The rules are not allowed to consume the same instance of a symbol twice, which creates competition for objects and may lead to non-deterministic choice between the maximal collections of rules applicable in one step.
A computation of a P system is traditionally considered to be a sequence of configurations it can successively visit, stopping at the halting configuration. A halting configuration is a configuration in which no rule can be applied any more, in any membrane. The result of a computation of a P system as defined above is the contents of the output membrane projected over the terminal alphabet .
Example 1
Figure 1 shows the graphical representation of the P system formally given by
[TABLE]
In the maximally parallel mode, the inner membrane 2 of will apply as many instances of the rules as possible, thereby doubling the number of , and ejecting a copy of into the surrounding (skin) membrane at each step. The symbol in the skin membrane is not used. Therefore, after steps of evolution, membrane 2 will contain the multiset and membrane 1 the multiset . Since all rules are always applicable in , this P system never halts. ∎
2.3 Boolean Networks
A Boolean variable is a variable which may only have values in the Boolean domain . Let be a finite set of Boolean variables. A state of these variables is any function , assigning a Boolean value to every single variable. An update function is a Boolean function computing a Boolean value from a state: . A Boolean network over is a function , in which the update function for a variable is computed as a projection of : .
A Boolean network computes trajectories on states by updating its variables according to a (Boolean) mode , defining the variables which should be updated together in a step. Typical examples of modes are the synchronous mode and the asynchronous mode . A trajectory of a Boolean network under a given mode is any finite sequence of states such that can derive from under the mode .
Remark 1
The definition of modes and evolution are quite different in P systems and Boolean networks. The asynchronous mode in Boolean networks only allows updating one variable at a time, while the asynchronous mode in P systems generally allows any combinations of updates. Furthermore, no halting conditions are generally considered in Boolean networks, and the asymptotic behavior is often looked at as the important part of the dynamics. ∎
Example 2
Consider the set of variables with the corresponding update functions and . Figure 2 shows the possible state transitions of this network under the synchronous and the asynchronous modes. The states are represent as pairs of binary digits, e.g. stands for the state in which and .
We notice that, under the synchronous mode, this network exhibits three kinds of behaviors. If initialized to , it will stay in this state forever— is a stable state. If initialized to , the network will directly converge to . Finally, if it is initialized to any one of the states or , it will oscillate between them. The synchronous mode yields deterministic behavior.
The state transitions are quite different under the asynchronous mode, under which only one variable may be updated at a time. While state remains stable, states and can now oscillate to , but not directly between them. Moreover, these states can also converge to , but cannot anymore. ∎
2.4 Boolean Control Networks (BCN)
Boolean networks are often used to represent biological networks in the presence of external perturbations: environmental hazards, drug treatments, etc. (e.g., [5, 6, 24]). To represent network reprogramming, an extension of Boolean networks can be considered: Boolean control networks (BCN) [6]. Informally, a BCN is a parameterized Boolean network template; assigning a Boolean value to every single one of its parameters yields a Boolean network.
Formally, a Boolean control network is a function , where the elements of , , are called the control inputs. To every valuation of control inputs, associates a Boolean network. A control of is any Boolean assignment to the control inputs: .
While this definition of BCNs is very general, in practice one restricts the impact the control inputs may have on the BCN to some biologically relevant classes. One particularly useful class are freeze perturbations, in which a variable in is temporarily frozen to 0 or to 1, independently of its normal update function. These actions mean to model gene knock-outs and knock-ins.
When Boolean update functions are written as propositional formulae, freeze control inputs can be written directly in the formulae. Consider for example a Boolean network over with the update functions and . To allow for freezing of , we introduce the control variables into the Boolean formula of in the following way: . Setting to 0 and to 1 freezes to 0, independently of the values of and . Symmetrically, setting to 0 and to 1 freezes to 1. Setting both and to 0 is generally disallowed.
In this paper, we will use two notations to indicate which control inputs correspond to which controlled variable. In the simplest examples in which the variables have no indices, e.g. or , we will directly specify the name of the variable in the subscript of the corresponding control inputs, like so: , , , or . In more general cases, we will refer to the variables by indexed names , and we will only specify the respective index as the subscript of the corresponding control inputs: and .
2.5 Sequential Controllability of BCN
In many situations, perturbations of biological networks do not happen once, but rather accumulate or evolve over time [9, 16, 24]. In the language of Boolean control networks, this accumulation can be represented by sequences of controls . More precisely, take a BCN with the variables and the control inputs , as well as a sequence of controls , . This gives rise to a sequence of Boolean networks . Fix a mode and consider a sequence of trajectories of these Boolean networks. Such a sequence is an evolution of under the sequence of controls if the last state of every is the first state of . In this case we can speak of the trajectory of the BCN under the control sequence as the concatenation of the individual trajectories , in which the last state of every single is glued together with the first state of .
Given the 3-tuple , where is a BCN, is a set of starting states, and is a set of target states, the sequence inference problem is the problem of inferring a control sequence driving from each state in to any state in . This problem was called the CoFaSe problem in [24] and was extensively studied. In particular, is was shown that CoFaSe is \PSPACE-hard.
Example 3
Consider again the Boolean network from Example 2, with and the update functions and . As mentioned before, a convenient way to express freezing controls is by explicitly including the control inputs into the update functions in the following way:
[TABLE]
Notice how setting to 0 essentially sets , and setting to 0 essentially sets , independently of the actual value of or .
Consider now the following 3 controls:
[TABLE]
Informally does not freeze any variables, freezes to 0, and freezes to 1. Consider now the BCN with the variables and the controlled update functions and . Fix the synchronous update mode. A trajectory of this BCN under the control —i.e. a trajectory of —is . A trajectory of is ; remark that is still a stable state of . A trajectory of is . We can now glue together the trajectories , , and by identifying their respective ending and starting states, and we obtain the following trajectory of the BCN under the control sequence :
[TABLE]
It follows from this construction that is a solution for the CoFaSe problem . Remark that is not reachable from under the synchronous mode in the uncontrolled case, as Figure 2 illustrates. ∎
Remark 2
We follow the approach from [24] which decorrelates the length of the control sequence from the length of the trajectories it yields. Thus, can yield trajectories of different lengths greater or equal to 3. From the modeling standpoint, this represents the fact that the time scale on which control inputs are emitted is not necessarily the same as the time scale of the controlled system.
3 Boolean P Systems
In this section we introduce a new variant of P systems—Boolean P systems—tailored specifically to capture sequential controllability of Boolean networks with as little descriptional overhead as possible. We further tackle the differences between evolution modes in Boolean networks and P systems by introducing quasimodes.
Rather than trying to be faithful to the original model of P systems as recalled in Section 2, we here invoke the intrinsic flexibility of the domain to design a variant fitting to our specific use case.
3.1 Formalism
Boolean P systems are set rewriting systems. A Boolean state is represented as the subset of obtained by considering as an indicator function: . By abuse of notation, we will sometimes use the symbol to refer both to the Boolean state and to the corresponding subset of .
A Boolean P system is the following construct:
[TABLE]
where is the alphabet of symbols, and is a set of rewriting rules with propositional guards. A rule is of the form
[TABLE]
where and is the guard—a propositional formula with variables from . The rule is applicable to a set if and , where by abuse of notation we use the same symbol to indicate the set of subsets of which satisfy . Formally, for , we denote by the truth value of the formula obtained by replacing all variables appearing in by 1 in , and by 0 all variables from . Then the set of subsets satisfying is .
Applying the rule to a set results in the set . Applying a finite set of separately applicable rules to results in the new set
[TABLE]
Note how this definition excludes competition between the rules, as only individual applicability is checked. Further note that applying a rule multiple times to the same configuration has exactly the same effect as applying it once.
In P systems, the set of multisets of rules of applicable to a given configuration is usually denoted by [11]. Since in Boolean P systems multiple applications of rules need not be considered, we will only look at the set of sets of rules applicable to a given configuration of a Boolean P system , and use the same notation . A mode of will then be a function assigning to any configuration of a set of sets of rules applicable to : . If for any , the mode is called deterministic111More precisely, this is the definition of strong determinism, see [3].. Otherwise it is called non-deterministic.
An evolution of under the mode is a sequence of states with the property that is obtained from by applying one of the sets of rules prescribed by the mode in state . This is usually written as . If no rules are applicable in state , it is called the halting state, and is called a halting evolution.
Example 4
Take and consider the following rules and , where is the Boolean tautology. Construct the Boolean P system . Informally, removes from a configuration which contains and , and removes from the configuration which does not already contain . A possible trajectory of under the maximally parallel mode—which applies non-extendable applicable sets of rules—is . Note that only is applicable in the first step, since requires the configuration to not contain . ∎
Remark 3
Boolean P systems as defined here are very close to other set rewriting formalisms, and in particular to reaction systems [8]. A reaction system over a set of species is a set of reactions (rules) of the form , in which is called the set of reactants, the set of inhibitors, and the set of products. For to be applicable to a set , it must hold that and . Applying such a reaction to yields , i.e. the species which are not explicitly sustained by the reactions disappear.
We claim that despite their apparent similarity and tight relationship with Boolean functions, reaction systems are not so good a fit for reasoning about Boolean networks as Boolean P systems. In particular:
Reaction systems lack modes and therefore non-determinism, which may appear in Boolean networks under the asynchronous Boolean mode. 2. 2.
The rule applicability condition is more powerful in Boolean P systems, and closer to Boolean functions than in reaction systems. 3. 3.
Symbols in reaction systems disappear unless sustained by a rule, which represents the degradation of species in biochemistry, but which makes reaction systems harder to use to directly reason about Boolean networks.
We recall that our main goal behind introducing Boolean P systems is reasoning about Boolean networks in a more expressive framework. This means that zero-overhead representation of concepts from Boolean networks is paramount. ∎
Remark 4
Reaction systems [8] are intrinsically interesting for discussing controllability, because they are defined as open systems from the start, via the explicit introduction of context. Note however that contexts only allow adding symbols to the configuration, not removing them. We refer to [15] for an in-depth discussion of controllability of reaction systems. ∎
3.2 Quasimodes
An update function in a Boolean network can always be computed, but a rule in a Boolean P system need not always be applicable. This is the reason behind the difference in the way modes are defined in the two formalisms: in Boolean networks a mode is essentially a set of subsets of update functions, while in Boolean P systems a mode is a function incorporating applicability checks. This means in particular that Boolean network modes are not directly transposable to Boolean P systems.
To better bridge the two different notions of modes, we introduce quasimodes. A quasimode of a P system is any set of sets of rules: . The mode corresponding to the quasimode is derived in the following way:
[TABLE]
Given a configuration of , picks only those sets of rules from which are also applicable to . Thus, instead of explicitly giving the rules to be applied to a given configuration of a P system , a quasimode advises the rules to be applied.
In the rest of the paper, we will say “evolution of under the quasimode ” to mean “evolution of under the mode derived from the quasimode ”.
4 Boolean P Systems Simulate Boolean Networks
Consider a Boolean network over the set of variables , and take a variable with its corresponding update function . The update function can be simulated by two Boolean P systems rules: the rule corresponding to setting to 1, i.e. introducing into the configuration, and the rule corresponding to setting to 0, i.e. erasing from the configuration:
[TABLE]
Consider now the following Boolean P system:
[TABLE]
We claim that faithfully simulates .
Theorem 4.1
Take a Boolean network and a Boolean mode . Then the Boolean P system constructed as above and working under the quasimode faithfully simulates : for any evolution of under there exists an equivalent evolution of under , and conversely, for any evolution of under there exists an equivalent evolution of under .
Proof
Take two arbitrary states and of such that is reachable from by the update prescribed by an element . Consider now the subsets of variables defined by and taken as the respective indicator functions. It follows from the construction of that it contains an element including the update rules for all the variables of : . Therefore, can derive from under the quasimode .
Conversely, consider two subsets of variables such that can derive from under the update prescribed by an element . By construction of , there exists a subset such that . Take now the indicator functions describing and respectively. Then can derive from by updating the variables in .
We conclude that the transitions of exactly correspond to the transitions of , which proves the statement of the theorem. ∎
Example 5
Consider the Boolean network from Example 2:
[TABLE]
This Boolean network can be translated to the Boolean P system with and the following rules:
[TABLE]
The first rule in ensures that is introduced whenever the current state satisfies , and the second rule ensures that is removed whenever the current state does not satisfy . Similarly, the rules in introduce or remove depending on whether the current state satisfies .
To simulate under the Boolean synchronous mode, should run under the quasimode , i.e. the quasimode allowing all rules in to be applied at all times. To simulate under the Boolean asynchronous mode, should run under the quasimode , i.e. the quasimode allowing the application of either both rules in , or both rules in , but not all 4 rules at a time. ∎
Remark 5
Incidentally, Boolean P systems also capture reaction systems (see also Remarks 3 and 4). Indeed, consider a reaction with the reactants , inhibitors , and products . It can be directly simulated by the Boolean P system rule , where . The degradation of the species in reaction systems can be simulated by adding a rule for every species , where is the Boolean tautology. ∎
5 Composition of Boolean P Systems
In this section, we define the composition of Boolean P systems in the spirit of automata theory. Consider two Boolean P systems and . We will call the union of and the Boolean P system . Note that the alphabets and , as well as the rules and are not necessarily disjoint.
To talk about the evolution of , we first define a variant of Cartesian product of two sets of sets and : . We remark now that
[TABLE]
Indeed, since the rules of Boolean P systems do not compete for resources among them, the applicability of any individual rule is independent of the applicability of the other rules. Therefore, the applicability of a set of rules of to a configuration is independent of the applicability of a set of rules of to .
For a mode of and a mode of , we define their product as follows:
[TABLE]
The union of Boolean P systems together with the product mode implement parallel composition of the two P systems. In particular, if the alphabets of and are disjoint, the projection of any evolution of under the mode on the alphabet will yield a valid evolution of under (modulo some repeated states), while the projection on will yield a valid evolution of under the mode (modulo some repeated states). Note that this property may not be true if the two alphabets intersect .
Quasimodes fit naturally with the composition of modes, as the following lemma shows.
Lemma 1
If the mode can be derived from the quasimode and from the quasimode , then the product mode can be derived from :
M_{1}\times M_{2}$$\tilde{M}_{1}\mathop{\dot{\times}}\tilde{M}_{2}$$\tilde{M}_{1}$$\tilde{M}_{2}$$M_{1}$$M_{2}
where a dashed arrow
from a quasimode to a mode indicates that the mode is derived from
the quasimode, and the arrows are
the respective projections.
Proof
Consider the mode derived from :
[TABLE]
Pick an arbitrary element and remark that it can be seen as a union where is a subset of applicable rules with the property that , and is a subset of applicable rules with the property that . Thus and , implying that
[TABLE]
Consider on the other hand an arbitrary and an arbitrary . By definition of , . Remark that every rule in and is individually applicable, meaning that they are also applicable together and that . Combining this observation with the reasoning from the previous paragraph we finally derive:
[TABLE]
which implies that and concludes the proof. ∎
6 Boolean P Systems for Sequential Controllability
Underlying sequential controllability of Boolean control networks (Section 2.5) is the implicit presence of a master dynamical system emitting the control inputs to the network and thereby driving it. This master system is external with respect to the controlled BCN. The framework of Boolean P systems is sufficiently general to capture both the master system and the controlled BCN in a single homogeneous formalism. In this section, we show how to construct such Boolean P systems for dealing with questions of controllability.
Any BCN can be written as a system of propositional formulae over . First, note that a control can be described by the conjuction . Now fix an and consider the formula
[TABLE]
in which enumerates all the conjuctions corresponding to the controls in and is the propositional formula of the update function which associates to . Using (1), we can translate any BCN into the system of Boolean functions and use the set from Section 4 to further translate the individual components of to pairs of Boolean P system rules. Denote the Boolean P system whose set of rules is precisely the union of the sets mentioned above, for . Finally, construct the Boolean P system with the following rules whose guards are always true:
[TABLE]
Suppose now that the original BCN runs under the mode , and consider the corresponding quasimode , as well as the quasimode
[TABLE]
Every element is a union of and a subset of , meaning that enables all rules removing the control inputs, and enables some of the rules adding back control inputs.
We claim that the Boolean P system running under the quasimode faithfully simulates the BCN running under the mode . The following theorem formalizes this claim.
Theorem 6.1
Consider a BCN running under the mode . Then the Boolean P system constructed as above and running under the quasimode faithfully simulates :
For any evolution of under there exists an equivalent evolution of under . 2. 2.
For any evolution of under there exists an equivalent evolution of under .
Proof
(1) Consider two states and a control such that reaches from in one step. Take and by respectively taking , , and as indicator functions. Then, as in Theorem 4.1, there exists an such that reaches from in one step. This follows directly from the construction of the rules in and from the fact that contains exactly the symbols corresponding to the control inputs activated by .
Take now and remark that its elements are of the form , where and . Under such an element , reaches a state from in one step, where contains the symbols from introduced by the rules selected by . Further note that all elements of are always erased by the rules , but may be immediately reintroduced by .
Suppose now that reaches from in multiple steps. Then reaches from in the same number of steps, provided that is always chosen such that the rules it selects reintroduce exactly the subset . If reaches from in multiple steps, but the control evolves as well, it suffices to choose such that it introduces the correct control inputs before each step. Finally, the control applied in the first step of a trajectory of must be introduced by setting the starting state of to , where corresponds to the initial state of the trajectory of .
(2) The converse construction is symmetric. A state of is translated into the state and the control corresponding to . A step of under is translated to applying to and updating the variables corresponding to the rules activated by . In this way, for any trajectory of under the quasimode there exists a corresponding trajectory in the controlled dynamics of . ∎
We now give an extensive example showing how the composite system from the proof above is constructed for a concrete BCN, and detailing how simulates its sequentially controlled trajectories.
Example 6
Consider the BCN from Example 3 with the following update functions modified to include the control inputs:
[TABLE]
and recall that and . Since the control inputs are already explicitly present in the propositional formulae, we can put these together directly to obtain , bypassing equation 1.
Construction of
First construct the Boolean P system with the following rules:
[TABLE]
Now, define with the following rules:
[TABLE]
Suppose that runs under the synchronous mode. This is translated into the quasimode for the Boolean P system . The quasimode for will be as follows:
[TABLE]
Finally, the composite P system will run under the following quasimode:
[TABLE]
Simulation of sequential control.
The 3 controls introduced in Example 3 can be written as sets in the following way:
[TABLE]
The trajectory of will be simulated as the following evolution of :
[TABLE]
where the rules to be applied in each transition are picked from the set . Note how is explicitly included as a set of symbols in the configuration of the composite Boolean P system .
Similary, the trajectory of will be simulated as follows:
[TABLE]
where the rules to be applied in each transition are picked from the set . Note how all symbols corresponding to control inputs are removed at every step, and then specifically the control inputs from are reintroduced.
Finally, the trajectory of will be simulated as follows by :
[TABLE]
To simulate the final trajectory under the control sequence , we glue together the final and the initial states of the above simulations, always anticipating the control from the subsequent simulation:
[TABLE]
Underlined elements are the states in which the control inputs change. Thus, the transition for example is governed by the set of rules already, instead of which was used in the first step. ∎
The component in the composite P system of Theorem 6.1 and Example 6 is an explicit implementation of the master dynamical system driving the evolution of the controlled system . The setting of this theorem captures the situation in which the control can change at any moment, but can be designed to implement other kinds of control sequences. We give the construction ideas for the kinds of sequences introduced in [24]:
- •
Total Control Sequence (TCS): all controllable variables are controlled at all times.
The quasimode of will be correspondingly defined to always freeze the controlled variables: , where with the property that for every every set either introduces or , but not both.
- •
Abiding Control Sequence (ACS): once controlled, a variable stays controlled forever, but the value to which it is controlled may change.
The rules of will be constructed to never erase the control symbols which have already been introduced, but will be allowed to change the value to which the corresponding controlled variable will be frozen: , with the new set of rules defined as follows:
[TABLE]
will able to rewrite some of the control symbols, or to introduce new control symbols: .
7 Reachability in Boolean P Systems
In this section we focus on reachability in Boolean P systems, which we define in the following way: given a Boolean P system , a mode (or a quasimode ), a set of starting states and a set of target states , decide whether an evolution of exists under the mode (or the quasimode ) driving it from each state in to some state in . We refer to such a decision problem by the 4-tuple , where may be a mode or a quasimode. In the rest of the paper, we will mainly deal with reachability under quasimodes.
Remark 6
Unlike the CoFaSe problem in which the synchronous mode is implicitly assumed, we explicitly include here the mode or the quasimode into the reachability problem. Indeed, the size of the quasimode may be as much as exponential in the number of symbols, while the complexity of a mode may be even bigger, since it depends on the current configuration. Furthermore, the mode choice impacts the answer of the problem. For example the problem under the quasimode has a solution if and only if .
In this section we will show that the reachability problem for Boolean P systems is \PSPACE-complete. We start by showing that this reachability problem is at least as hard as \LBAACCEPTANCE.
Lemma 2
* is reducible in polynomial time to reachability for Boolean P systems.*
Proof
We will first show how to construct a Boolean P system simulating a given LBA, and will then evaluate the size complexity of the construction.
Construction.
Let be an LBA and an input word of length . We construct in polynomial time a Boolean P system that simulates the computation of on the input . The alphabet of contains the following symbols
[TABLE]
where the symbols describe which symbols appear in which tape cells of and describes the position and the state of the LBA head. More precisely:
- •
represents the situation in which cell contains the symbol ,
- •
represents the situation in which the head is on cell and in state .
We construct the rules of as the union , where each instruction of is simulated by a set of Boolean P system rules in the following way, depending on the direction of the movement of the head:
[TABLE]
The evolution of is governed by the quasimode . Due the form of the left-hand sides of the rules above, if the current state contains exactly one state symbol of the form , at most one rule in will be applicable.
We finally define the singleton set of target states:
[TABLE]
The only state appearing in therefore corresponds to the halting configuration of in which all tape cells are blank except cells 0 and which contain the left and right end delimiters and respectively, and the head is on cell 0 and in state .
It is a direct consequence of the definition of the rules in that the LBA accepts a word of length if and only if the reachability problem has a solution, where .
Complexity.
The number of symbols in is and the number of rules is , so the Boolean P system can be constructed in time .
Since is a singleton and its only element is of cardinal , the quasimode can be constructed in time —roughly, the number of rules times the number of bits necessary to describe a rule. Because there is only one starting state and one target state, and since a state can be described by a sequence of symbols ( for the tape and for the state of the head), the whole description can be constructed in the following time:
[TABLE]
This expression is polynomial in the size of the specification of and in the length of the input , which concludes the proof. ∎
We will now show the symmetrical statement that reachability in Boolean P systems is at most as hard as \LBAACCEPTANCE.
Lemma 3
Reachability for Boolean P systems is in \PSPACE.
Proof
We will prove that reachability for Boolean P systems is in \NPSPACE, which implies the required statement by Savitch’s theorem [29].
Let , with , be an instance of the reachability problem. Algorithm 1 is a non-deterministic algorithm that solves this problem in polynomial space.
The function takes a state of and an element of a quasimode , and returns the state updated according to the rules of and the chosen element of the quasimode, as defined in Section 3.2.
Since the number of possible states of is , the shortest evolution between two states is of length at most , if it exists. Algorithm 1 therefore non-deterministically tests all possible evolutions of length at most , starting from all states in . At the end Reachable gets the value true if and only if a state in can be reached from every state in , which ensures the correctness of the algorithm.
This algorithm runs in polynomial space in the size of the reachability problem. Note that several states, a counter up to , and Boolean flags are stored, all of which takes up space. Furthermore, the function can be evaluated in polynomial space. Indeed, to determine the set of applicable rules in a state , one needs to check for each rule if the guard is true and if the left part of the rule is present in . Both operations, the evaluation of a Boolean function and a comparison, can be carried out in polynomial space with respect to . Only the rules in are then applied, and these applications can be carried out in polynomial space with respect to and . ∎
Remark 7
The argument of Lemma 3 focuses on reachability under quasimodes. This argument can be trivially extended to modes derivable from quasimodes, and more generally to any mode for which non-deterministically picking a set of rules to apply can be done in polynomial space.
The following theorem brings together Lemmas 2 and 3 to show the main result with respect to the complexity of reachability.
Theorem 7.1
Reachability for Boolean P systems is \PSPACE-complete
8 Complexity of Sequential Controllability
In this section we first extend the CoFaSe problem with some additional details necessary to properly reason about its complexity, and then show that sequential controllability of BCN is \PSPACE-complete.
8.1 CoFaSe and Control Modes
Theorem 6.1 shows that Boolean P systems can directly simulate Boolean networks together with the master control system, and Theorem 7.1 shows that reachability for Boolean P systems is \PSPACE-complete. Nevertheless, we cannot immediately conclude that CoFaSe is \PSPACE-complete because of the role modes and quasimodes play in evaluating the size of the problem.
Consider a BCN with the variables and the control inputs , and recall that the CoFaSe problem is given by the triple , where are the sets of starting and target states respectively. The simulating Boolean P system constructed in Theorem 6.1 uses the quasimode
[TABLE]
for which , meaning that size of the reachability problem for is always exponential in the size of , independently of the sizes of the individual elements of the triple 222In general, the description of is of size , because some Boolean functions may require an exponential number of Boolean connectors , , to be represented. and are of size by their definition. In practice, however, the sizes of these entities are often well under the respective upper bounds [6, 24].. As a consequence, directly combining Theorems 6.1 and 7.1 is not guaranteed to yield a polynomial bound on space in terms of the size of the CoFaSe problem .
We believe that the correct way to deal with this issue is to include a specification of the master system emitting the controls into the description of the problem of sequential controllability. Indeed, CoFaSe is formulated for the situation in which the control can change at any moment [24], and this information is not explicitly included in its definition, while it is explicitly present in the P system from Theorem 6.1.
We propose to describe the possible changes in controls by defining a relation on —the control mode. A control mode for a BCN is a relation describing the possible evolutions of control inputs. More precisely, consider the following trajectory of the BCN :
[TABLE]
This trajectory complies with the control mode if and only if , for every .
Example 7
Control modes naturally capture the types of control sequences given at the end of Section 6 and initially discussed in [23]. To streamline the definitions of the corresponding control modes, we introduce the following helper function:
[TABLE]
In other words, idx produces the set of control input indices appearing in a control , irrespectively of the nature of the control input (freeze to 0 or freeze to 1).
We can now define the control mode capturing Total Control Sequences (TCS) as follows:
[TABLE]
Informally, includes all those pairs of controls which act on every single controlled variable by activating one of the corresponding control inputs.
Similarly, in the case of Abiding Control Sequences (ACS), the control mode can be defined as follows:
[TABLE]
Thus, only allows to transition from to if acts at least on the same controlled variables as . Note that is allowed to change the value to which a controlled variable is frozen by replacing by or vice versa. ∎
We now define an extension of CoFaSe to capture sequential controllability of BCN in a more general framework. The problem is given by the 5-tuple and consists in deciding whether for every starting state in there exists an initial control and a trajectory of the BCN under the mode and the control mode ending up in a target state from . must appear as a the first term in at least a pair of : .
Example 8
Consider the Boolean network described in Figure 3, as well as the controls , freezing both and to 1, and . If then the following trajectory is possible:
[TABLE]
Suppose now that only freezing or separately is permitted, i.e. , for any appearing in a pair in . In this case, can reach or from by respectively controlling or to 1. The following 3 scenarios are possible afterwards:
maintain the control of or and stay in the same state; 2. 2.
freeze the other variable— if was controlled or if was controlled, and switch to the other state— or respectively; 3. 3.
release all controls and go back to .
In any of these cases, is not able to reach with the above restriction on the control mode.
Finally, suppose that once is employed, it must be maintained for the rest of the trajectory, i.e. . The previous paragraph shows that the only way for to leave the connected component consisting of the states while starting from is to apply . On the other hand, since cannot be deactivated once applied, this means that cannot reach from with this restriction on the control mode. ∎
8.2 and CoFaSe Are \PSPACE-complete
We start by combining Theorems 6.1 and 7.1 to characterize the complexity of .
Theorem 8.1
* is \PSPACE-complete.*
Proof
is \PSPACE-hard, since by taking it is reduced to the problem of reachability for Boolean networks, known to be \PSPACE-complete [7, 23].
Let now be an instance of and consider the following set of rules:
[TABLE]
as well as the quasimode . The Boolean P system running under the quasimode will therefore simulate the changes in controls allowed by the control mode .
We can now construct the reachability problem in the same way as in Theorem 6.1. The entire construction, including that of , happens in polynomial time with respect to the size of the initial instance of . This allows us to conclude the proof by invoking the fact that reachability in Boolean P systems is \PSPACE-complete (Theorem 7.1). ∎
As explained in the previous section, being \PSPACE-complete does not immediately imply that CoFaSe is \PSPACE-complete, since translating from CoFaSe to may require exponential increase in space. However, it is possible to directly prove that CoFaSe is in by using a variation of Algorithm 1 from Lemma 3.
Theorem 8.2
CoFaSe is PSPACE-complete.
Proof
Similarly to the proof of Lemma 3, we show here a non-deterministic polynomial-space algorithm solving the instance of CoFaSe given by the triple : Algorithm 2.
Algorithm 2 has very similar properties to Algorithm 1. Note that no requirements on the values of control inputs are imposed in the CoFaSe problem, meaning that only the state space needs to be explored, excluding the control inputs. Since , exploring trajectories of length at most is sufficient to conclude about the reachability of a state in for all states in .
Algorithm 2 stores a constant number of intermediate states and controls, a counter up to , and Boolean flags, all of which takes up space. Furthermore, can be computed in polynomial space in and , meaning that Algorithm 2 requires polynomial space in the size of the triple . Finally, we conclude the proof by invoking Savitch’s theorem [29], stating that . ∎
9 Conclusion and Discussion
We structure the conclusion into three subsections, focusing on three main takeaways and future research directions stemming from this paper.
9.1 Complexity of Sequential Controllability
The central technical result of this work is proving that sequential controllability of Boolean control networks (BCN) is \PSPACE-complete, thereby closing the question left open in [24]. One important intuition that this result yields is that sequential controllability of BCN is not in fact harder computationally speaking than simple reachability, in spite of the much heftier two-level setup with a master dynamical system driving the Boolean network. While no explicit construction is given, it is to be expected that the evolution of a BCN under a control sequence may be simulated by a Boolean network, modulo a polynomial transformation. This implies that reasoning about sequential controllability is as hard as reasoning about pure reachability in Boolean networks, opening a promising direction of future work about using the most permissive semantics [26] for sequential controllability of BCN.
We stress nevertheless that sequential controllability and reachability being in the same complexity class does not necessary imply that the techniques for efficiently solving reachability in practical situations can be immediately transposed to controllability. Exploring such possibilities is an important direction for future research on sequential controllability of BCN.
While we extensively deal with CoFaSe in this work, it should be noted that the ConEvs semantics explored in [24] is not treated. The ConEvs semantics of the control sequence constraints the moments at which the control may change to the stable states of the driven Boolean network. This places the master system in a feedback loop with the driven network and changes the architecture substantially. In particular, the computational complexity of sequential controllability under the ConEvs semantics still remains to be characterized.
9.2 Boolean P Systems
Most of the technical results presented in this paper are obtained via Boolean P systems, a framework specifically designed for dealing with sequential controllability in Boolean networks. We particularly emphasize one of our central goals: designing ad hoc formalisms very tightly suited for a specific problem and thereby giving new relevant viewpoints.
One of the advantages in relying on Boolean P systems is that the language of individual rules is more flexible than that of propositional formulae in Boolean networks. In particular, having set rewriting directly available allows for naturally expressing the notions of adding, removing, or depending on resources, while the propositional guards allow for easy checking of Boolean conditions whenever necessary. These two ingredients shine in Section 6, in which we show how a Boolean P system can capture both the BCN and the master dynamical system emitting the control inputs. On the other hand, we construct Boolean P systems without indulging too much into computationally expensive ingredients, which keeps the complexity of reachability in \PSPACE.
We would like to dwell specifically on the difference between Theorems 8.1 and 8.2, in particular on the fact that the latter directly shows that CoFaSe is in \PSPACE, completely eliding Boolean P systems. First, remark that Theorem 8.1 showing that is -complete is in fact more general, as it holds for any mode and for any control mode, incorporating en passant different kinds of control sequences such as TCS, ACS, etc. Secondly, remark that Algorithm 2 in Theorem 8.1 is directly derived from (and is a special case of) Algorithm 1 in Lemma 3, which arguably needed some general framework like Boolean P systems to be conceived.
Going back to the ConEvs semantics mentioned in the previous subsection, we expect that considering it in the framework of Boolean P systems will bring new valuable insight both concerning the characterization of its complexity and its other properties, as well as possible optimizations for specific use cases. Observe that ConEvs cannot be captured as a control mode, because it introduces a backward dependency of the control sequence on the state of the BCN. Boolean P systems on the other hand should allow to express this feedback elegantly, since the master system and the driven system are both part of the same composite system (Theorem 6.1), and can therefore communicate both ways. In fact, just from this informal reasoning we can make a conjecture with respect to the upper bound on the complexity of sequential controllability under the ConEvs semantics.
Conjecture 1
Sequential controllability of BCN under the ConEvs semantics is in \PSPACE.
Finally, we stress once again the point of Remark 3: while Boolean P systems are very closely related to reaction systems [8], they have distinctive features which make them a much better fit for reasoning about sequential controllability—specifically, explicit Boolean guards and permanency of the resources.
9.3 Lineage of (Polymorphic) P Systems, Homoiconicity, and
Lisp
As we have already insisted, one central point that we bring forward with this work is conceiving ad hoc formalisms specialized for solving particular problems. This approach is partially inspired by the venerable Lisp family of programming languages, and more particularly by language-oriented programming—a methodology proposing to start solving problems by developing specifically-tailored programming languages—domain-specific languages or DSLs [10, 32].
When adopting this approach, it is important that such bespoke constructions be done within a particular general framework, lest the design costs grow too high and the new formalisms too obscure. In this paper, we promote P systems as such a general framework. The community around this model of computing has been producing a wide spectrum of variants, a far-from-exhaustive glimpse of which can be seen in [2, 14, 21, 28]. The rich body of literature provides many ingredients and various tools for easily assembling different new formalisms. This is why we believe that P systems are particularly well suited for the ad hoc formalism methodology.
We conclude this work by underlining that Boolean P systems are far from being a frontier of how far one can go in designing specialized formalisms. We recall as an example polymorphic P systems [4], in which the rules are given by pairs of membranes rather than being part of the static description of the system, as is classically done in automata and language theory. Polymorphic P systems thus implement a form of homoiconicity—code-as-data, similarly to the Lisp languages. A lot more can be done in terms of customizing P systems, and we expect to see and invest further effort into the ad hoc formalism methodology.
Acknowledgements
All authors are grateful to Laurent Trilling from Université Grenoble Alpes for fruitful discussions.
Artiom Alhazov acknowledges project 20.80009.5007.22 “Intelligent information systems for solving ill-structured problems, processing knowledge and big data” by the National Agency for Research and Development.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Artiom Alhazov, Rudolf Freund, and Sergiu Ivanov. P versus B: P systems as a formal framework for controllability of boolean networks. In Henning Bordihn, Géza Horváth, and György Vaszil, editors, Proceedings 12th International Workshop on Non-Classical Models of Automata and Applications, NCMA 2022, Debrecen, Hungary, August 26-27, 2022 , volume 367 of EPTCS , pages 28–48, 2022.
- 2[2] Artiom Alhazov, Rudolf Freund, and Sergiu Ivanov. Polymorphic P systems: A survey. Technical report, Bulletin of the International Membrane Computing Society, December 2016.
- 3[3] Artiom Alhazov, Rudolf Freund, and Kenichi Morita. Sequential and maximally parallel multiset rewriting: reversibility and determinism. Nat. Comput. , 11(1):95–106, 2012.
- 4[4] Artiom Alhazov, Sergiu Ivanov, and Yurii Rogozhin. Polymorphic P systems. In Marian Gheorghe, Thomas Hinze, Gheorghe Păun, Grzegorz Rozenberg, and Arto Salomaa, editors, Membrane Computing - 11th International Conference, CMC 2010, Jena, Germany, August 24-27, 2010. Revised Selected Papers , volume 6501 of Lecture Notes in Computer Science , pages 81–94. Springer, 2010.
- 5[5] Albert-László Barabási, Natali Gulbahce, and Joseph Loscalzo. Network medicine: a network-based approach to human disease. Nature reviews. Genetics , 12:56–68, 2011.
- 6[6] Célia Biane and Franck Delaplace. Causal reasoning on Boolean control networks based on abduction: Theory and application to cancer drug discovery. IEEE ACM Transactions on Computational Biology and Bioinformatics , 16(5):1574–1585, 2019.
- 7[7] Allan Cheng, Javier Esparza, and Jens Palsberg. Complexity results for 1-safe nets. Theoretical Computer Science , 147(1):117 – 136, 1995.
- 8[8] Andrzej Ehrenfeucht and Grzegorz Rozenberg. Reaction systems. Fundamenta Informaticae , 75(1):263–280, 2007.
