Engineering Basal Cognition: Minimal Genetic Circuits for Habituation, Sensitization, and Massed–Spaced Learning
Jordi Pla-Mauri, Ricard Solé

TL;DR
This paper explores how simple genetic circuits in single cells can mimic basic learning behaviors like habituation and sensitization, using synthetic biology.
Contribution
The study introduces minimal synthetic genetic circuits that replicate nonassociative learning behaviors in unicellular organisms.
Findings
Synthetic circuits were designed to reproduce habituation and sensitization in unicellular systems.
The circuits also replicate the massed–spaced learning effect using regulatory elements and quorum-sensing molecules.
The work highlights differences in temporal dynamics between gene-based and neural learning systems.
Abstract
Cognition is often associated with complex brains, yet many forms of learningsuch as habituation, sensitization, and even spacing effectshave been observed in single cells and aneural organisms. These simple cognitive abilities, despite their cost, offer evolutionary advantages by allowing organisms to reduce environmental uncertainty and improve survival. Recent studies have confirmed early claims of learning-like behavior in protists and slime molds, pointing to the presence of basal cognitive functions long before the emergence of nervous systems. In this work, we adopt a synthetic biology approach to explore how minimal genetic circuits can implement nonassociative learning in unicellular systems. Building on theoretical models and using well-characterized regulatory elements, we design and simulate synthetic circuits capable of reproducing habituation, sensitization, and the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4
5- —Ministerio de Ciencia, Innovaci?n y Universidades10.13039/100014440
- —Ag?ncia de Gesti? d'Ajuts Universitaris i de Recerca10.13039/501100003030
- —European Social Fund Plus10.13039/501100004895
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSlime Mold and Myxomycetes Research · Plant and Biological Electrophysiology Studies · Planarian Biology and Electrostimulation
Introduction
1
Life on Earth has evolved in multiple directions, with major evolutionary events defining the rise of novelties, such as the transition from unicellular to multicellular or the emergence of language. ?,? A general trait shared by most of these transitions is the emergence of new types of agents capable of dealing with environmental uncertainties in novel ways. Among others, the evolution of neural agents represents a revolutionary path toward neural networks and brains.? The rise of cognitive structures and brains largely begins with the Cambrian explosion event, where we can see the rapid evolution of animals equipped with mobile parts and sensors.? Using a special class of cells, the neuron, it was possible to develop mechanisms to store and process information reliably.? In this context, learning might have played a crucial role in the development of complex brains.?
Cognitive structures are costly, from sensors and actuators to the whole brain. How can evolution favor their emergence? The answer is that mastering time, i.e., storing past information that can be used to predict the environment, can have a high return. In other words, reducing uncertainty has a high pay-off. Not surprisingly, memory is a widespread feature of most multicellular life, and several kinds of learning mechanisms have been described.? These include two well-known processes, namely habituation and sensitization, which have been known for thousands of years? as well as associative learning. However, these features are not limited to multicellular systems and were identified by several authors regarding the behavioral responses of individual cells.? This is the case of Stentor (Figurea), a single-cell protozoan ?,? which was shown to exhibit an enhanced response to subsequent milder stimuli, that is, habituation. This indicated that it was learning to associate the intense stimulus with a potential danger, resulting in increased sensitivity. Similarly, Stentor was also shown to exhibit a reduced response under repeated exposure to the same stimulus, indicating that it was learning to recognize the stimulus as nonthreatening or irrelevant. In addition, sensitization, when an organism becomes more sensitive to a stimulus after experiencing an intense or aversive event, was also observed.
Embodiments for basal cognition. Simple forms of learning (associative and nonassociative) in simple biological systems can be observed in free-living unicellular life forms, such as the protozoan Stentor (a), the Physarum polycephalum slime mold (b), but can also be found in non-neural tissues, such as hepatic cells (c), and has been traditionally reported in neural systems on the small, synaptic scale (d).
Although Jennings’ work was questioned due to reproducibility issues, recent studies confirmed (and expanded) his key results, uncovering a hierarchy of decision-making actions. ?−? ? ? Both habituation and sensitization, along with other forms of learning, have also been found in Physarum polycephalum (Figureb). This aneural organism is a giant multinucleated cell that can extend up to hundreds of square centimeters. It has been shown to exhibit complex, optimal self-organized patterns and solve a wide range of tasks that involve complex computational decisions. ?−? ? ?
Basal cognition encompasses the fundamental processes and mechanisms that enable organisms to detect certain environmental states and respond appropriately to ensure survival (such as finding food and avoiding danger) and reproduction long before the evolution of nervous systems. Modern cells have complex molecular networks that can participate in cognitive tasks, from detecting and responding to chemical cues to actively exploring their worlds in space and time. An important aspect of their behavioral repertoires is the presence of memory and learning characteristics. They are, in fact, strong prerequisites to build a behavior. In recent years, renewed interest has emerged in this area, including dedicated efforts to operationally reframe cognition.?
Theoretical and computational studies on these simple forms of learning have recently been developed, searching for potential circuits capable of implementing them within cells, particularly in terms of habituation. ?,? There is an alternative approach to this problem from an engineering perspective. In general, while much understanding of major evolutionary transitions has been gained through molecular phylogenetics or comparative analysis and paleobiological data, we can also address the problem by recreating these evolutionary events and their precursors using (among other approaches) synthetic biology.? In this context, the design of genetic circuits and the potential of tissue bioengineering can help understand the constraints associated with biological complexity. This includes morphology, ?−? ? multicellularity, ?,? ecology, ?,? biorobots? or collective intelligence,? among other problems.
Previous studies have addressed the problem of classical conditioning (associative learning) circuits based on transcriptional network systems, including several proposals for candidate molecular circuits. ?−? ? ? ? In this paper, we take the “synthetic” approach to consider how simple genetic circuits could reliably implement nonassociative learning, thus involving a single type of stimulus. This will include the two previous case studies (habituation and sensitization) and the so-called massed–spaced learning effect. The latter has traditionally been discussed within the psychology literature and involves scenarios in which long-term memory is enhanced when learning events are spaced apart in time, rather than occurring in immediate succession. However, recent studies have shown that it can also be present in single human cells,? suggesting that this learning effect might also operate on a small scale within whole bodies.
Our implementations rely on well-characterized genetic parts and regulatory mechanisms, making them amenable to experimental realization. The designed circuits proposed here will include repressors, fluorescent reporters, and quorum-sensing molecules as key components, allowing external control of the input signal and measurement of the average output at the population level. While the specific examples are tailored for E. coli , the underlying regulatory motifs are abstract, modular designs rooted in transcriptional logic and are not inherently limited to bacterial systems. By following our synthetic approach, there is a marked difference between the gene networks that implement learning and their synaptic counterparts: a much slower time scale. This difference will be relevant to our discussion on the role that learning plays in single-cell organisms and the constraints imposed on engineering designs.
Results
2
Habituation
2.1
Habituation represents one of the simplest and most ubiquitous forms of nonassociative learning, characterized by a gradual decrease in the behavioral or physiological response after repeated exposure to a stimulus that is perceived to be neither harmful nor beneficial.? This phenomenon is not merely passive fatigue or sensory adaptation, but an active process. It has been extensively studied across various speciesfrom Aplysia to humansdemonstrating its evolutionary conservation and functional importance in adaptive behavior.?
This fundamental mechanism of behavioral adaptation allows organisms to efficiently filter out irrelevant or nonthreatening stimuli from their environment, such as background noise or persistent visual cues. By reducing responses to familiar inputs, habituation enables the organism to conserve cognitive and energetic resources, which can then be redirected toward detecting and responding to novel or potentially significant environmental changes.? In this way, habituation plays a crucial role in attentional modulation and information processing, helping organisms prioritize survival-relevant stimuli.
At its core, habituation requires a responsive system capable of detecting external stimuli and modulating its sensitivity over time. This modulation typically involves negative feedback mechanisms that accumulate with repeated stimulation. For example, repeated activation of postsynaptic receptors may lead to internalization of these receptors or alterations in second-messenger signaling pathways, effectively raising the threshold required to elicit a response.? Such mechanisms enable the system to adaptively ignore persistent signals while remaining sensitive to new or changing inputs.
Figurea sketches the logic of our proposed minimal circuit, and in Figureb a genetic design is shown as a realization of an incoherent feed-forward loop (I-FFL), a motif previously discussed in the literature. ?,?,? Alternative designs based on negative feedback topologies for the receptor are discussed in the Supporting Information (Subsections S3.2 and S3.3), where they are shown to perform less optimally.
Synthetic circuit design and behavior for habituation in a single cell. In (a), the basic logic requirements for habituation are outlined in a causal diagram. In (b), an explicit synthetic circuit design is shown. A constitutively expressed LuxR-SsrA receptor binds to C6–HSL input molecules. This complex activates LacI repressor expression. The output, GFP-SsrA, requires both an absence of the repressor and the presence of the input-receptor complex for activation. In (c), a parameter space showing habituation strength, measured as a log2-fold change (FC) between the last and the first peak. Each shift in color corresponds to a unit change in FC. A dashed contour line is included to enhance visual contrast and delineates the threshold FC = −1, which signifies that the amplitude of the last peak has decreased to at most half that of the initial peak. In (d), a sample time-series simulation showing habituation response following eq . Vertical gray regions indicate the presence of an external stimulus x. Key molecular components of the circuit are shown on the right, along with their interactions. Protein structures were rendered using Illustrate. Parameters: α = 1.5, β = 5, γ = 10–2, λ = 1, μ = 1, Δτon = 1, Δτ = 10.
The proposed synthetic circuit senses and reacts to an external input signal x, such as the quorum-sensing molecule C_6_–HSL, which diffuses into the cell and binds to a constitutively expressed transcriptional regulator LuxR-SsrA, denoted X. The resulting ligand-bound complex X–x activates transcription of two key components: a memory I and an output G.
The memory component I, the repressor protein LacI in the example, is expressed under an inducible promoter activated by X–x, and its concentration accumulates over successive input pulses, serving as a biochemical integrator of past input activity. Output G, represented for simplicity as the fluorescent reporter GFP-SsrA, is expressed under a hybrid promoter that requires the presence of complex X–x for activation while simultaneously being repressed by I, which approximates N-IMPLY logic (x ∧ ¬I). However, rather than enforcing a sharp binary switch, intermediate concentrations of I lead to partial suppression of promoter activity, effectively reducing the output production rate.
All proteins except memory I contain a degradation tag that enhances their proteolytic turnover, resulting in a higher degradation–dilution rate. This difference in stability between sets of proteins establishes a difference of time scales, ideally allowing only untagged proteins to persist between input pulses.
The equations governing the system are as follows:
The key parameters include the maximal expression rates α, β, and μ. These parameters encapsulate the combined strength of a promoter and its RBS, allowing each to be independently tuned. The basal degradation–dilution rate γ governs the turnover of untagged proteins, while λ denotes the higher degradation rate of tagged proteins. Biologically plausible parameter values for the nondimensionalized model, along with a detailed discussion of their interpretability, are provided in the Supporting Information (Subsection S1.1).
Functions θ^±^(·) are normalized transfer functions of the promoters, defined as standard Hill form with half activation constant K 1/2 = 1, namely:
To assess the circuit’s capacity to habituate to a periodic pulsating signal, simulations are conducted starting from a steady state with no external input. A periodic external input x is introduced as an instantaneous pulse lasting for a duration Δτ_on_, followed by an input-free interval of duration Δτ_off_, allowing the system to relax. The total period of each cycle is therefore Δτ = Δτ_on_ + Δτ_off_. The trajectory is partitioned into a collection of n consecutive intervals T = where each t _ i _ corresponds to the onset of the i-th input pulse, and each interval has a constant duration Δτ = t _ i+1_ – t _ i _.
An example of the time-series resulting from this minimal circuit is shown in Figured, where the time-series for variables x, I, and G are displayed. The concentration of repressor I shows a growing trend over time, decreasing slightly during relaxation periods as its production halts while continuously decaying with rate γ. A habituation pattern is displayed by G with regular peaks displaying a decrease in amplitude with each successive input pulse.
The conditions required to guarantee the growth of the memory component can be analytically derived, given some approximations. Since a memory component is also relevant for other motifs discussed, a basic derivation is warranted (see also Supporting Information Section S2).
To this end, trajectories are partitioned into periodic intervals, each consisting of an active phase of duration Δτ_on_, during which an external input is applied, followed by a relaxation phase of duration Δτ_off_, where no input is present.
Assuming that the memory component does not approach saturationspecifically, that I(t) ≪ α/γ holds throughout the entire trajectoryand further assuming that the duration of each pulse is much shorter than the relaxation period (Δτ_on_ ≪ Δτ_off_), the dynamics of memory accumulation can be effectively approximated as a sequence of instantaneous increments occurring at the onset of each pulse, followed by continuous exponential decay between events.
Under the impulsive approximation, the temporal evolution of the memory component is governed by the following differential equation:
where δ(·) denotes the Dirac delta function, representing a sequence of N instantaneous inputs applied at discrete times t = kΔτ.
Assuming negligible initial conditions, memory dynamics are given by
where denotes the Heaviside step function, a piecewise function defined as 0 for negative arguments and 1 otherwise, and t _ k _ = k Δτ are the times where input pulses are applied.
The change in memory over a single interval can be expressed as
where I 0 represents the initial memory value just before the input is applied.
Since the net change in memory per cycle, ΔI, decreases monotonically as the relaxation duration Δτ_off_ increases, and since ΔI > 0 when Δτ_off_ = 0 (memory accumulates in the absence of extended downtime), it follows that there exists a critical threshold for Δτ_off_ beyond which no net memory gain occurs over successive stimulation cycles. Therefore, memory accumulation is possible only if the relaxation period satisfies the inequality:
As the number of events increases, N → ∞, the periodic steady state solution approximates:
oscillating with period Δτ ≈ Δτ_off_.
In Figurec a two-dimensional parameter space (α, γ), showing the adaptation strength, quantified as a fold change, FC, defined as the log_2_-transformed ratio between the peak response following the final stimulus and the peak response following the first stimulus:
where τ_1_ ∈ [t 1, t 2) denotes the time interval from the arrival of the first stimulus up to the arrival of the second stimulus, and τ_ n _ ∈ [t _ n _, ∞) denotes the time interval beginning with the arrival of the final stimulus and extending thereafter.
Two domains are clearly defined, with a boundary separating a phase where habituation occurs from another where habituation is not possible.
Robust habituation depends on memory I being able to hold its state between stimuli, which can be achieved by ensuring it degrades slowly (low γ). At the same time, increasing the memory production rate α makes habituation stronger by causing the system’s final response to be a smaller fraction of its initial output. However, a higher production rate also reduces the system’s peak response, creating a key trade-off: a stronger habituation strength comes at the cost of a lower maximum output. A more in-depth analysis of this trade-off is provided in Supporting Information Subsection S3.1.
Sensitization
2.2
Sensitizationalso called facilitationis another fundamental form of nonassociative learning that stands in direct contrast to habituation. While habituation involves a decrease in response after repeated exposure to a stimulus, sensitization is characterized by an amplification of the response after repeated or intense stimulation.?
This heightened responsiveness serves as an essential adaptive mechanism across a wide range of species, enabling organisms to prioritize and react more vigorously to potentially harmful or biologically significant stimuli.? Sensitization has been extensively studied in the context of defensive behaviors and nociceptive responses, where it enhances survival by promoting rapid responses to environmental threats. Beyond behavioral responses, sensitization manifests itself at multiple levels of biological organization, from whole-organism reflexes to synaptic plasticity and cellular signaling pathways.?
The molecular mechanisms that underlie sensitization are generally based on positive feedback loops or cascade amplification systems, which progressively strengthen in response to repeated stimulation. In this work, a simple synthetic design that recapitulates this behavior is presented, drawing upon elements from our earlier design for habituation. The key to this system lies again in leveraging the molecular memory provided by repressor I, which generates a signal that accumulates in the presence of input and decays slowly during the relaxation phase; this dynamic enables the circuit to effectively “remember” prior stimulations and respond more strongly upon repeated exposure. It is also worth noting that, in the context of synaptic plasticity, sensitization is often achieved through a basic motif derived from the habituation mechanism, with the addition of an extra interneuron.?
Building on the basic logic of sensitization illustrated in Figurea, a functional genetic circuit can be derived. This proposed circuit (Figureb) largely reuses the gene regulatory interactions from the previous habituation design but incorporates a few key modifications to establish a positive feedback loop.
Synthetic circuit design and behavior for sensitization in a single cell. In (a), the basic logic requirements for sensitization are outlined in a causal diagram. In (b), an explicit synthetic circuit design is shown. When the input molecule (C6–HSL) is present, it binds to its receptor (LuxR-SsrA), which then simultaneously activates both the repressor (LacI) and the circuit’s output signal (GFP-SsrA). Repressor TetR-SsrA, which controls LuxR-SsrA expression, is itself repressed by LacI, creating an indirect positive regulatory feedback loop that enables the sensitization behavior. In (c), a parameter space showing sensitization strength, measured as a log2-fold change (FC) between the last and the first peak. Each shift in color corresponds to a unit change in FC. In (d), a sample time-series simulation showing sensitization response following eq . Vertical gray regions indicate the presence of an external stimulus x. Key molecular components of the circuit are shown on the right, along with their interactions. Parameters: α = 1.5, β = 5, γ = 10–2, λ = 1, μ = 1, ρ = 1.75, Δτon = 1, Δτ = 10.
The circuit creates a positive feedback loop on receptor X (LuxR-SsrA) through a double repression cascade. In the presence of input x, the X–x complex drives expression of repressor I (LacI), which in turn represses expression of R (TetR-SsrA). Since R itself represses the production of X, the net effect is that X indirectly promotes its own expression and amplification. This design decouples the feedback strength through intermediate regulators, allowing tunable and graded amplification that depends on the stimulation history. By using this indirect mechanism, the system avoids some limitations inherent to a direct self-activation loop (see Supporting Information Subsection S4.2).
The output G (GFP-SsrA), which is expressed from a promoter directly activated by the X–x complex, is consequently also amplified. Thus, the production of G is dependent upon both the presence of the external input and the time-dependent concentration of X, ensuring input-gated activation and a sensitized response that reflects the system’s history of stimulation. An alternative design based on a coherent feed-forward loop (C-FFL), which offers similar performance and complexity, is discussed in the Supporting Information (Subsection S4.3).
The key to the memory effect is that only the repressor I has a slower degradation–dilution rate (γ). Its continued presence represses R, which in turn relieves repression of the receptor X promoter. The fast degradation rate (λ) of both X and R ensures their rapid turnover, with their concentrations rapidly approaching a quasi-steady-state set by the slower dynamics of I, enabling time scale separation and effective memory encoding.
The equations governing the system are as follows:
where α, β, μ, ρ represent independently tunable maximal expression rates, γ is the basal rate at which proteins dilute–degrade, and λ is the degradation rate of short-lived (tagged) proteins. As before, the constraint λ ≫ γ is assumed, creating a difference of time scales. This ensures that untagged proteins degrade rapidly, while other proteins accumulate within the cell over successive pulses.
An example of a time-series that displays sensitization is shown in Figured. As in the habituation example, the memory component I accumulates over timebut here, this leads to a decrease in repressor R, which otherwise inhibits production of X, effectively boosting receptor levels in proportion. This double inhibition loop effectively increases sensitivity to input x, yielding progressively larger peaks in output G.
Figurec shows the parameter space (α, γ) for the sensitization model, where the fold change from eq is used to display a well-defined boundary of sensitization responses (blue domain). The decrease in the fold change as the production rate of the memory molecule (α) increases is due to the faster accumulation of memory I, which more quickly saturates its corresponding transfer function. When α is high, the system further approaches this saturation limit during the first peak response, thereby limiting any further increase in the levels of the input receptor and the output protein (see Supporting Information Subsection S4.1 for details).
Combining
Sensitization and Habituation
2.3
Cognition is not limited to specific kinds of response, and a relevant question is how we can combine or modify our minimal motifs in such a way that multiple learning responses can be at work. This occurs with some simple neural circuits that can encode both increased and decreased responsiveness. This is the case of classic experiments in Aplysia californica, where a slight touch to the siphon triggers a gill-withdrawal reflex. ?,? After a strong stimulus such as a tail shock, this response becomes sensitized, with even light touches causing a stronger reaction. However, if light touch is repeated without reinforcement, the reflex gradually habituates, weakening over time. While sensitization involves enhanced neurotransmitter release, habituation results from reduced synaptic activity.
In this section, we briefly illustrate the possibility of combining sensitization and habituation responses within a single circuit to produce a hybrid response. This results in an output where an initial period of sensitization is followed by habituation.
Although one might expect this construct to require some nontrivial combination of the previous motifs, surprisingly, this more complex behavior does not require a fundamentally new design (Figurea). It can be achieved simply by reintroducing a single repressor interaction from the habituation circuit, the hybrid promoter from Figureb, into the sensitization design from Figureb. In this hybrid circuit, output G (GFP-SsrA) expression depends on both the presence of the input-bound receptor X–x and the absence of repressor I. The behavioral transition occurs when the inhibitory effect of I on output expression dominates over the activation pathway mediated by X–x.
Synthetic circuit design and behavior for sensitization–habituation in a single cell. In (a), an explicit synthetic circuit design is shown. In (b, c), the corresponding parameter spaces for the habituation and sensitization regimes are shown. The same indicator (FC) is used, but in this case, the first peak is compared to the highest peak to measure sensitization, and the last peak is compared to the highest peak to measure habituation. Each shift in color corresponds to a unit change in FC. In (d), the time-series for the model is shown, with sensitization followed by habituation, following eqs . The vertical gray shading denotes temporal intervals during which external stimulus x is present. Key molecular components of the circuit are shown on the right, along with their interactions. Parameters: α = 1.5, β = 10, γ = 10–2, λ = 1, μ = 1, ρ = 1.75, K = 1.5, Δτon = 1, Δτ = 10.
The number of pulses required to transition from sensitization to habituation is tuned by the relative sensitivity to I for the two promoters regulated by it. If the hybrid promoter requires a higher repressor concentration, a clearer separation between the two phases can be observed. This sensitivity can be experimentally engineered through mutagenesis or predicted in silico.
An alternative implementation, tweaking a C-FFL sensitization motif, is also possible by using two distinct sources for repressor I, with different dynamics: a tagged version, indirectly repressed by the input complex for sensitization; and an untagged version, directly expressed by the same complex for habituation (see Supporting Information S5.3).
The equations governing the system are as follows:
where α, β, μ, ρ represent independently tunable maximal expression rates, γ is the basal rate at which proteins dilute–degrade, and λ is the degradation rate of short-lived (tagged) proteins.
The function represents the repressor curve for the output. It is analogous to the repressor transfer function presented in eq, but with an increased half activation constant K,
This increase in K 1/2 shifts the system’s dynamic response to a longer characteristic time scale. Consequently, the point at which the system’s behavior shifts from sensitization to habituation occurs more slowly.
Figured illustrates an example time-series in which the memory component I accumulates progressively over time, as observed in earlier cases. As I builds up, receptor X increases, leading to an amplified response and successively larger peaks of output G. However, this positive feedback eventually saturates, even as I continues to accumulate. Beyond a critical threshold, further increases in I have a stronger inhibitory effect on G than the activation mediated by X, leading to a progressive decline in the peak amplitude of output G.
Figureb,c shows that the parameter regions for sensitization and habituation in the hybrid circuitdefined by a doubling or halving of the output peakresemble those of the individual circuits (Figuresc and ?c). Although these regions show partial overlap, the two behaviors cannot be simultaneously maximized. This stems from their divergent parameter dependencies: while both behaviors require a low memory degradation–dilution rate γ, habituation strengthens monotonically with the maximal expression rate α, whereas sensitization weakens beyond an optimal α value. This inherent trade-off is a direct consequence of sharing core circuit components for both motifs within a single, shared pathway.
Massed–Spaced
Learning
2.4
Our last example is the so-called massed–spaced learning (MSL). It refers to a cognitive strategy in which learning sessions are distributed over time (spaced learning) rather than concentrated over a short period (massed learning). This approach improves memory retention and recall by allowing time for consolidation between learning episodes. Massed–spaced learning could involve synaptic plasticity mechanisms at the single-cell level, where neurons strengthen or weaken their connections based on repeated but temporally spaced stimulation. Key processes such as long-term potentiation and synaptic tagging can be the basis for this effect, allowing individual neurons to encode and maintain learned information more effectively when spaced intervals optimize molecular and structural changes. It can be characterized on multiple scales, from synaptic changes to individual behavior, and manifests itself at both the behavioral and molecular levels.?
Although MSL is traditionally considered a neural process, a recent study suggests that similar behaviors may also be present in aneural systems, including single cells within a given tissue.? Their findings indicate thatunder controlled conditionsthese non-neural cells seem to retain information better when they are exposed to spaced intervals rather than all at once. The authors used an engineered non-neuronal reporter cell line capable of exhibiting spacing-dependent responses, offering not only increased experimental throughput to model memory formation, but also a platform to study cellular cognition outside the nervous system.
The logic architecture of MSL can be captured by a cascade of positive effects, as sketched in Figurea. Once again, a memory component is a requirement for learning capacity, but now the output response can be decoupled from the immediate presence of input events. This decoupling arises because the input stimulus does not directly drive output expression; instead, it triggers the synthesis of an intermediate transcriptional activator that persists transiently after stimulus withdrawal. Consequently, sustained output gene expression can occur as long as sufficient concentrations of this inducer remain, even in the absence of ongoing input. This behavior differs fundamentally from all previously described motifs, in which output synthesis occurs only while the input signal is present, and intermediate components modulate the amplitude of the response rather than directly drive output production.
Synthetic circuit design and behavior for massed–spaced learning in a single-cell. In (a), the basic logic requirements for massed–spaced learning are outlined in a causal diagram. In (b), an explicit synthetic circuit design is shown. Receptor LuxR senses the external input, which is then integrated by a downstream transcriptional activator (AraC) to drive expression of an output reporter gene (GFP). In (c), an example time-series of a response to a spaced signal with a delay Δt = 50 between consecutive signals, and N = 5 repetitions. Note that the response to the spaced signal reaches a higher maximum than that of the massed response (dashed line). In (d), the log2-transformed fold change (FC) of peak response with respect to the peak response for a massed signal. A signal of duration T = 10 is split into N equal pulses, separated by a delay Δt. When the signal is massed (N = 1), the interpulse delay does not influence the signal, as there is no sequence of pulses to space out. The colormap is centered at FC = 0, the reference (massed peak) value, with positive deviations in blue and negative deviations in red. Slices of the Fold Change over increasing delay and increasing repetitions are displayed in the insets (e, f). Parameters: α = 1, β = 1, γ = 10–2, μ = 1.
The MSL phenomenon can now be observed (under the right conditions) by dividing a sustained stimulus into a series of shorter stimuli separated by a delay Δt, in such a way that the cell can accumulate inducer molecules over time, leading to a higher overall response. However, as will be shown below, this enhancement works only when the intervals between stimuli are shorter than the characteristic decay time of the intermediate inducer. If the delay between stimuli exceeds this critical duration, the concentration of the inducer will diminish substantially before the arrival of the subsequent stimulus, thus negating the cumulative effect.
A minimal genetic implementation of this circuit, as shown in Figureb, consists of a linear feed-forward chain where a receptor protein X detects an external signal molecule x, leading to the production of an inducer A, which in turn controls the expression of the output protein G. Specifically, we suggest the input signal C_6_–HSL is integrated by accumulating an inducer, AraC, resulting in prolonged GFP expression that lingers after the input vanishes.
The equations governing the system are now:
where α, β, μ are different promoter strengths that can be tuned separately, and γ is the rate at which proteins dilute–degrade.
The dynamical response of the circuit is summarized in the time-series in Figurec. For a single, continuous input pulse (left column), the intermediate activator A accumulates rapidly during the stimulus period, followed by an exponential decay afterward. This transient accumulation produces a single maximum peak in the output, which quantifies the response efficiency. In contrast, when the same total input duration is divided into several evenly spaced pulses (right column), the concentration of A accumulates incrementally and decays only partially during the interpulse intervals. This pattern of spaced reinforcement results in a higher maximum peak amplitude of the output compared to the massed case, demonstrating the expected enhancement in response. The circuit is therefore capable of distinguishing between continuous (massed) and temporally spaced input patterns, analogous to the way neural systems differentiate between massed learning and learning reinforced through repeated, spaced exposures over time.
A parameter space illustrating the relative efficiency of distributing the input into N evenly spaced pulses, separated by interpulse intervals Δt, is shown in Figured.
The log_2_-transformed fold change in the peak response relative to the massed input case is used to quantify the enhancement due to input spacing:
where the case N = 1 corresponds to massed input, and no interpulse delay is therefore applicable. Positive values of FC indicate an increased peak response under spaced stimulation, with the magnitude reflecting the extent of enhancement compared to the massed case.
As expected, a region exists in which dividing and spacing the input pulses yields a positive gain; however, both parameters are interdependent and must be balanced.
For very short interpulse delays (Δt → 0), the difference becomes negligiblein this limit, the signal effectively behaves as a continuous (massed) input, provided N remains finite and physiologically realistic.
However, if either the number of pulse repetitions or the interpulse delay is excessively large, the peak response decreases. This reduction arises because an excessive number of pulses (large N) allocates insufficient time within each pulse for the accumulation of the intermediate activator, while a prolonged interpulse delay (large Δt) allows the activator concentration to decay significantly before the next pulse arrives.
To gain further insight into how the spacing of input pulses affects memory accumulation and downstream output, we consider a simplified version of the presented model, in which each input pulse fully saturates the receptor response and the output protein G is produced in an all-or-nothing manner, driven by a step activation threshold.
This can be modeled by the following simplified system:
where δ(t – t _ k ) denotes the Dirac delta function representing an impulsive input at discrete times t _ k _ = k Δτ_off. Consequently, the memory variable A exhibits instantaneous integration of each input pulse, followed by an exponential decay in the interval before the next input arrives. The expression of output G is governed by a Heaviside transfer function , which acts as an abrupt switch that yields 1 when memory is above a normalized threshold , A(t) ≥ 1, and 0 otherwise. Thus, production halts when the memory decays below the threshold.
Let N identical pulses be delivered instantly at equally distant intervals of size Δτ ≈ Δτ_off_, with total input strength α Δτ_on_. The state of A immediately after the k-th pulse, denoted A _ k _, follows from the balance between input and decay:
The duration of output production during the k-th interval, denoted , is determined by how long A(t) remains above the threshold, bounded by the arrival of the next pulse for all but the final interval.
Let t _ k _ ^*^ = γ^–1^ log(A _ k _) be the time expected for A _ k _ > 1 to decay to the threshold. Then
Thus, output dynamics are given by the addition of individual pulse contributions.
The duration of output production from the k-th pulse up to time t, denoted τ_ k _ ^+^(t), and the time elapsed since production halted, denoted τ_ k _ ^–^(t), are respectively defined as
The Heaviside function ensures that each component contributes only after its onset time t _ k _.
With β > γ, the output increases during each production interval and peaks at the end of each pulse . Given equally spaced pulses and zero initial output in the first interval, each subsequent pulse starts with a positive baseline due to incomplete decay from previous inputs. This leads to a nondecreasing sequence of peak outputs. Hence, the global maximum must be at the end of the final pulse, , since no prior peak can be higher due to the cumulative effect of residual output from earlier pulses.
Under the restriction that the input added in a single pulse is strong enough to sustain output production for the full relaxation phase. That is, if
then, splitting the input into N equally spaced pulses will yield a higher total output than a single, massed pulse. This assumption simplifies the following analysis, though it should be noted that it defines a strict subset of the parameter space where spaced pulses are advantageous. For a more detailed analysis, see Supporting Information (Subsection S6.1).
For a massed pulse (N = 1), the output production halts when A(t 0 ^*^) = 1, giving an output production duration:
which yields a maximum output:
When, instead, the input is split into two equal pulses (N = 2), the first half-strength pulse decays for Δτ_off_ until the second pulse arrives (per the assumption in eq), after which the memory jumps to
Production then halts at:
Thus, the combined active phases produce:
Under the previous assumptions, the region where spaced pulses are strictly better than a massed pulse is
which simplifies algebraically to the condition:
This holds when γ Δτ_off_ > 0, ensuring that e^–γΔτ_off_ ^ < 1. Results showing that this property also holds for the general case N ≥ 2 under the same assumptions are provided in the Supporting Information (Subsection S6.1).
Discussion
3
Cognition can be defined as the acquisition, processing, storage, and use of information to generate or modulate behavior. Some single-cell organisms, from bacteria to protozoans or Physarum molds, are known to be able to perform a diverse range of computational tasks, including simple forms of learning. The study of basal cognition has deeply enlarged our comparative analysis of possible cognitions and raises some deep questions: How common is cellular cognition? How complex can it be? If cognition is defined narrowly in terms of neural representations, it is necessarily rare at the cellular level. However, if instead it is defined operationally as the capacity to modify responses based on stimulus history, then cognition-like behavior is likely widespread in bacterial and unicellular systems. Many regulatory networks naturally integrate past inputs over time via slow protein turnover, transcriptional delays, and molecular memory, making forms of nonassociative learning an intrinsic consequence of cellular regulation rather than an exceptional specialization. From this perspective, the synthetic circuits explored here do not introduce cognition into cells but isolate, formalize, and amplify mechanisms that are already common, highlighting how basal cognitive functions can emerge from minimal biochemical dynamics.
A central tenet in defining cognitive complexity has to do with how organisms cope with time. When dealing with the fundamental components of basal cognition, one first aspect of this problem is how to respond to potential sources of information (from the environment or other individuals) using previous experiences. In neural systems, many of these issues are resolved at the synaptic level, whereas aneural agents have to deal with time and timing using other design principles. Circadian rhythms, for example, define a very important innovation in unicellular organisms, which internalizes night–day cycles and allows anticipation. ?−? ?
In this paper, we have considered a relevant problem regarding the minimal complexity necessary for nonassociative learning based on designed genetic circuits. The regulatory circuits presented, while implemented using components from Gram-negative bacteria, are based on abstract and modular control motifssuch as incoherent feed-forward loops for habituation and double-repression cascades for sensitizationthat arise from general principles of transcriptional regulation. These motifs are therefore not intrinsically tied to bacterial physiology and, in principle, can be ported to other biological systems, including yeast or mammalian cells, by replacing organism-specific elements (e.g., transcription factors, promoters, and degradation or inhibition mechanisms) while preserving the underlying regulatory logic. By exploring this within the context of habituation, sensitization, and mass-spaced learning, we aimed to rigorously define the key conditions that allow for their efficient implementation. A central component common to all our circuits is a memory element that provides the way to keep time and modulate the final response, directly or by means of repressor elements. Moreover, habituation and sensitization responses, as well as their combination, have been designed by means of very minor changes in transcription motifs. This suggests a high accessibility among diverse learning circuits, including potential combinations.
Our pursuit of minimal circuits centers on a transcription factor–based genetic toolkit; however, this choice of genetic motifs does not preclude alternative approaches to studying and engineering cognition through other components of cellular networks. Theoretical research in systems biology has demonstrated that biochemical networks involved in cell signaling can generate context-dependent dynamic behaviors.? More broadly, computational circuits have also been shown to be embeddable within metabolic networks. ?−? ?
Within this broader landscape, an important open question emerging from our findings is whether the circuit motifs investigated here might already be presentalbeit implicitlywithin natural regulatory networks. Indeed, while core topological elements like incoherent feed-forward loops and double-repression cascades are ubiquitous in bacterial networks,? topology alone does not ensure learning-like behavior. Such implementation of nonassociative learning demands specific dynamical properties, most critically a “molecular memory” that decays slowly enough to span multiple stimuli. In natural systems, such persistence is constrained by factors including dilution due to cell division, and evolutionary trade-offs that prioritize metabolic efficiency over the costs of maintaining high protein concentrations required for an analog memory, especially when a simpler, switch-like response suffices. While synthetic circuits may overcome these limitations through tunable, high-level expressionoften with little regard for metabolic costsuch energetically expensive states are seldom sustainable within native cellular networks. Consequently, if such learning motifs occur naturally, their functional expression is likely context-dependent and restricted to specific ecological regimes. Cells may instead exploit alternative memory mechanismssuch as genetic mutations? or DNA methylation?whereas microbial communities can encode past environmental exposures through ecological shifts;? these forms of memory typically outlast transient, inheritance-based memories mediated by metabolites or proteins.? Moreover, in complex environments, adaptive behaviors can also emerge through population-level strategies that favor probabilistic solutionssuch as stochastic phenotype switching ?,? thereby rendering cellular-level “memory” unnecessary.
By using synthetic biology to engineer cognitive functions, we also gain a powerful lens to investigate cellular complexity and its potential constraints. A long-term objective in this endeavor is to map the structure of the “cognitive space” in which various types of cognitive agents may reside. ?,? This raises a fundamental question: to what extent can individual cells, whether existing independently or as part of a multicellular organism, perform cognitive tasks such as learning or anticipation? The role of gene networks in these processes may differ substantially between prokaryotic and eukaryotic systems. For example, a Stentor cell is approximately ∼10^5^ times larger than an E. coli cell, with a reproduction time of τ_r_ ≈ 2–3 days under optimal laboratory conditions, compared to the ∼2 h replication cycle of the bacterium when grown in minimal medium. More generally, protozoan life cycles span 1 to 3 orders of magnitude longer than those of E. coli , allowing molecular mechanismsincluding gene networkssufficient time to influence behavior throughout the life cycle of the organism. In contrast, bacterial replication occurs on a time scale comparable to that of the expression dynamics of candidate synthetic circuits, which significantly constrains the integration of such mechanisms into cellular behavior. Understanding how circuit complexity and organismal complexity are intertwined will be essential for charting the limitsand possibilitiesof cellular cognition.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Szathmáry E.Smith J. M.The major evolutionary transitions Nature 199537422723210.1038/374227 a 07885442 · doi ↗ · pubmed ↗
- 2Lane, N. Life ascending: the ten great inventions of evolution; WW Norton: New York, NY, 2010.
- 3Ginsburg S.Jablonka E.The evolution of associative learning: A factor in the Cambrian explosion Journal of theoretical biology 2010266112010.1016/j.jtbi.2010.06.01720558182 · doi ↗ · pubmed ↗
- 4Hsieh S.Plotnick R. E.Bush A. M.The Phanerozoic aftermath of the Cambrian information revolution: sensory and cognitive complexity in marine faunas Paleobiology 20224839741910.1017/pab.2021.46 · doi ↗
- 5Monk T.Paulin M. G.Predation and the origin of neurones Brain Behavior and Evolution 20148424626110.1159/00036817725472692 · doi ↗ · pubmed ↗
- 6Ginsburg S.Jablonka E.Evolutionary transitions in learning and cognition Philos. Trans. R. Soc. B 20213762019076610.1098/rstb.2019.0766 PMC 793513333550955 · doi ↗ · pubmed ↗
- 7Gunawardena J.Learning outside the brain: Integrating cognitive science and systems biology Proceedings of the IEEE 202211059061210.1109/JPROC.2022.3162791 · doi ↗
- 8Blumstein D. T.Habituation and sensitization: new thoughts about old ideas Animal behaviour 201612025526210.1016/j.anbehav.2016.05.012 · doi ↗
