Evolutionary bioenergetics of sporulation

Canan Karakoç; William R. Shoemaker; Jay T. Lennon

PMC · DOI:10.1073/pnas.2524274123·February 6, 2026

Evolutionary bioenergetics of sporulation

Canan Karakoç, William R. Shoemaker, Jay T. Lennon

PDF

Open Access

TL;DR

This paper calculates the energy cost of bacterial sporulation and explains when it is evolutionarily advantageous.

Contribution

Quantifies the full ATP cost of sporulation and links it to evolutionary trade-offs and trait loss.

Findings

01

Sporulation requires nearly 10^10 ATP and 10% of the bacterial energy budget.

02

Translation, membrane synthesis, and protein turnover account for most of the energy cost.

03

Sporulation is favored when harsh conditions last for months or longer.

Abstract

Evolutionary bioenergetics examines how energetic constraints influence the origin, maintenance, and evolution of cellular components and organismal traits. It takes a bottom–up approach to quantifying the ATP required to assemble biological structures ranging from genes and membranes to virus particles, whole cells, and multicellular organisms. Here, we apply this framework to estimate the full energetic cost of making a bacterial endospore, one of the most abundant and persistent forms of life on Earth. By accounting for macromolecule synthesis, regulatory checkpoints, maternal investment, and subcellular coordination, we identify conditions under which sporulation is favored rather than alternative strategies and reveal the evolutionary forces that have driven the repeated loss of this complex trait across lineages over billions of years. Energy is required for the expression and…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Chemicals1

ATP

Figures5

Click any figure to enlarge with its caption.

Fig. 1 — Energetic cost of spore formation. Top: Major stages of endospore development. Bottom: Energetic cost of spore formation over time reflecting transcriptional and translational investments in units of ATP. Total costs (PT) at each time interval (gray bars) represent the sum of opportunity costs (PO), which are linked to biosynthesis (blue bars) and direct costs (PD) associated with polymerization (orange bars). The total cost of spore formation declines exponentially over time. The embedded pie chart illustrates the proportional contribution (% of PT) associated with key processes, including replication of the whole genome and septum synthesis, along with transcription and translation.

Fig. 2 — Energetic cost of spore revival. Top: Major stages of spore revival, which include germination followed by outgrowth. Bottom: Energetic costs of spore revival over time reflecting transcriptional and translational investments units of ATP. Gray bars represent total costs (PT) for each time interval. Total costs (PT) at each time interval (gray bars) represent the sum of opportunity costs (PO), which are linked to biosynthesis (blue bars) and direct costs (PD) associated with polymerization (orange bars). The total cost of spore revival declines exponentially over time. The embedded pie chart illustrates the proportional contribution (% of PT) associated with key processes, including transcription, translation, and membrane synthesis.

Fig. 3 — Energetic costs of sporulation and other cellular processes and traits. Bars represent build costs (transcription + translation) in units of ATP molecules. The spore life cycle, with colored segments denoting germination, spore formation, and outgrowth, represents the total cost of one complete spore generation (11.5 h). All other bars show costs per vegetative generation (1.16 h at 20°C). For developmental programs (biofilm formation and competence), total costs were divided by program duration; for other traits, we assumed per-generation resynthesis. Genome replication and membrane lipid synthesis are included as reference benchmarks. The secondary axis shows costs as a percentage of the per-generation cellular budget (CG+tgenCM). See Materials and Methods and SI Appendix, Table S3 for calculation details.

Fig. 4 — Energetic constraints on sporulation efficiency. (A) The frequency of sporulation efficiency (fraction of cells that sporulate in a population) derived from publicly available data (SI Appendix, Table S2). (B) In a dynamical model, resource depletion reduces cell growth and triggers spore formation. (C) An increase in spore costs relative to cell costs results in a monotonic decline in sporulation efficiency. The dashed line represents the empirical estimate of the spore-to-cell cost ratio. The sharp decrease in efficiency as costs rise suggests that sporulation becomes energetically unfavorable with increasing costs.

Fig. 5 — Evolutionary maintenance of sporulation. (A) Conservation of sporulation-related genes across spore-forming and non-spore-forming lineages in the Bacillota phylum based on the frequency of COGs (Clusters of Orthologous Genes). There are 237 COGs in total (23). (B) The loss of sporulation genes in Bacillus can be examined by combining energetic cost estimates, which inform the strength of selection (s), with empirical mutation rates (61). Here, we calculate the fixation rate of beneficial deletions relative to that of neutral substitutions (analogous to dN/dS in the population genetics literature; Supporting Information). The black dashed line shows the neutral expectation (s=0), where the ratio is determined solely by the mutation rates of deletions and substitutions. The solid blue line shows the average nonneutral case using published estimates of Ne (61, 62). For deletion sizes where the ratio is greater than 1, the energetic cost of maintaining sporulation genes is sufficient for selection to favor their loss. The sharp peaks reflect the empirical distribution of deletion sizes.

Funding5

—National Science Foundation (NSF)100000001
—National Science Foundation (NSF)100000001
—DOD | USA | AFC | CCDC | Army Research Office (ARO)100000183
—National Aeronautics and Space Administration (NASA)100000104
—Alexander von Humboldt-Stiftung (AvH)100005156

Keywords

energymutationdevelopmentselectiondormancy

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicrobial Metabolic Engineering and Bioproduction · Bacterial Genetics and Biotechnology · RNA and protein synthesis mechanisms

Full text

Life requires energy to support growth, homeostasis, and long-term persistence. Energy is stored and transferred by molecules such as adenosine triphosphate (ATP), which fuel catabolic and anabolic reactions that govern basal metabolism, biosynthesis, and cellular repair (1, 2). These energy carriers also power gene regulation and translation, processes essential for building and maintaining cellular structures and organismal traits. In the early history of life, energetic constraints shaped the emergence of self-replicating systems and the evolution of core molecular functions (3). Today, the availability of energy continues to influence both physiological and developmental processes, from relatively simple biochemical pathways to more elaborate features such as secretion systems, sensory networks, and multicellular differentiation (4 ?–6). While it is generally assumed that traits are maintained when fitness benefits outweigh their energetic costs, such trade-offs are rarely quantified, particularly under conditions that limit metabolism (7, 8). As such, the role of bioenergetics in determining the evolutionary fate of complex traits remains largely unresolved.

In nature, organisms often inhabit environments where energy is insufficient to support sustained growth and reproduction (9). To endure these unfavorable conditions, many species have evolved mechanisms of persistence that buffer against demographic variance and environmental stochasticity (10). Dormancy is one such strategy, whereby individuals enter a reversible state of reduced metabolic activity. In some cases, dormancy arises passively as metabolism slows in response to environmental stress. In others, it is complex and actively regulated, requiring precise gene expression and cellular remodeling (11). One particularly elaborate and well-characterized form of dormancy is endospore formation, which is found among bacteria like Bacillus and Clostridia. Endospores are among the most metabolically inert and long-lived biological entities (12). They are remarkably resistant to harsh environmental conditions, including elevated temperatures, high doses of ionizing radiation, extreme energy limitation, and the vacuum of space (13, 14). Consequently, endospores are widely distributed across environmental, engineered, and host-associated ecosystems. With an estimated $[eqn]$ individuals in marine sediments alone (15), endospores rank among the most abundant cell types on Earth.

Despite its protective benefits, sporulation comes with an energetic cost. Building a spore demands sustained investment of cellular resources over a prolonged developmental timeline. Roughly, 5% of the Bacillus subtilis genome is devoted to this process, encoding genes involved in the synthesis of spore structures (e.g., cortex and coat proteins), DNA packaging and repair, and signaling pathways that guide cell fate decisions (16). Sporulation is orchestrated by a precisely timed cascade of sigma factors, a family of proteins that enable RNA polymerase to bind specific gene promoters and initiate transcription (17, 18). Each sigma factor is activated in one of two cellular compartments, directing stage-specific gene expression during development (18). This tightly regulated sequence is punctuated by checkpoints that ensure the mother cell adequately provisions the developing forespore (18). However, sporulation alone does not guarantee evolutionary success. To realize a fitness benefit, the resting cell must ultimately be revived through the processes of germination and outgrowth. This developmental transition is regulated by mechanisms that integrate prior germinant and nutrient exposures, producing a memory-like influence on environmental evaluation and decision-making (19, 20). Although germination and outgrowth are faster than sporulation, they entail substantial proteome remodeling (21), adding further costs to the full dormancy cycle.

The energetic demands of spore formation and revival may help explain the distribution and maintenance of this ancient trait. An ancestral feature within the phylum Bacillota, sporulation is thought to have originated nearly three billion years ago (22). Over this geological timespan, it has been conserved in some groups, partially retained in others, and entirely eliminated in major clades such as Staphylococcaceae and Lactobacillaceae, with consequences for molecular evolutionary dynamics (23). Although costly traits can be undermined by cheaters (24), the benefits of sporulation are largely privatized because each cell’s investment directly secures its own survival, which may explain its stability in some lineages despite apparent energetic demands. These dynamics are reflected in phylogenomic comparisons showing that non-spore-forming bacteria diversify more rapidly than their spore-forming counterparts (25), while experimental evolution demonstrates that the loss of sporulation ultimately reduces genetic diversity within populations (26).

The evolutionary loss of sporulation may occur through neutral processes, in which random mutations accumulate simply because the trait is no longer maintained by purifying selection. Over time, this can lead to pseudogenization, frame-shift mutations, or partial pathway degradation. Because the underlying sporulation network represents a large mutational target, models predict that neutral decay of this kind could lead to loss of sporulation within approximately $[eqn]$ generations (27). In contrast, selective pressures can drive the loss of sporulation, favoring deletions that reduce the energetic costs of maintaining unused machinery. These adaptive changes may be reflected in genome streamlining and biased gene loss in nonexpressed regions (28). Distinguishing between neutral and selective mechanisms is crucial for determining whether trait loss reflects passive genetic drift or adaptive optimization to new ecological conditions, thereby illuminating how complex life-history traits are maintained or discarded over evolutionary timescales.

Accurately quantifying the energetic costs of complex traits may clarify the conditions that favor their maintenance or loss. However, standard measurements of energy demand, such as oxygen consumption, heat production, or metabolite turnover, often lack precision, use units that are difficult to interpret in cellular terms, and are challenging to compare across biological scales. To address these limitations, we used a quantitative bioenergetic framework grounded in ATP equivalents, the universal currency of energy transfer in all forms of life (29).

By integrating genomic information with temporally resolved transcriptomic and proteomic measurements, we estimated the energetic demands of sporulation and revival (germination and outgrowth). This approach enabled us to quantify both opportunity costs ( $[eqn]$ ) and direct costs ( $[eqn]$ ) associated with precursor synthesis, gene expression, and proteome turnover across developmental time. With these estimates, we compared the energetic costs of dormancy to other components of the total cellular budget, including baseline energy demands and other persistence traits that may confer advantages under fluctuating or suboptimal conditions. Finally, we incorporated the estimated costs into a mechanistic model to examine how energetic constraints influence sporulation efficiency. By combining these results with genome-scale mutation accumulation data, we tested predictions about how bioenergetics shape the relative contribution of neutral and selective processes to the long-term maintenance of sporulation.

Results and Discussion

We begin with a detailed accounting of the energetic costs underlying the spore life cycle of the model bacterium B. subtilis, resolving patterns of energy expenditure over time, across developmental transitions, and within subcellular compartments. We use an empirical, demand-based approach that is grounded in measured ATP fluxes, biomass synthesis rates, and proteome composition (29), as opposed to supply-limited frameworks that emphasize constraints imposed by respiratory membrane area and mitochondrial amplification (3) (SI Appendix, Table S4). Our analysis shows that the complete spore life cycle demands a major energetic investment, consuming nearly $[eqn]$ ATP, roughly 10% of the total cellular energy budget. Most of this cost arises from opportunity costs ( $[eqn]$ ), including the diversion of metabolic precursors, along with the direct synthesis of spore-specific macromolecules. By incorporating these estimates into a mechanistic model, we identify thresholds beyond which dormancy is no longer advantageous and demonstrate how energetic constraints limit sporulation efficiency, defined as the proportion of cells that complete the transition to an endospore. Based on genome-scale comparisons, our findings indicate that the energetic burden of sporulation can generate positive selection for mutations that disable this pathway, acting alongside neutral decay to drive its evolutionary loss. The bioenergetic framework developed here provides a conceptual and quantitative foundation for understanding how resource allocation strategies shape the ecological and evolutionary dynamics of complex traits.

Energetics of Spore Formation.

We estimate the total cost of producing a single spore to be ∼2.4 $[eqn]$ ATP ( $[eqn]$ ). The majority of this cost (80%) is attributable to opportunity costs ( $[eqn]$ ), which reflect the diversion of precursor metabolites away from growth. The remaining 20% consists of direct costs ( $[eqn]$ ), representing ATP hydrolysis events required for the biosynthesis and polymerization of molecular building blocks. Across both $[eqn]$ and $[eqn]$ , we found that translation accounts for the largest share of energy expenditure (68%), followed by genome replication (17%), and transcription (3%). Upon initiation of sporulation, the cell must duplicate its entire genome (30). This requires $[eqn]$ $[eqn]$ , on the same order as the $[eqn]$ $[eqn]$ needed to transcribe spore-related genes. Approximately 20% of B. subtilis genes participate in or are expressed during sporulation, contributing 18% ( $[eqn]$ $[eqn]$ ) of the total replication cost. Although membrane synthesis has been hypothesized to impose a substantial energetic burden on cells (31), we estimate that the lipid required for the 1 μm septum that separates the mother cell from the forespore only amounts to 12% of $[eqn]$ incurred during sporulation.

Our findings reveal that spore formation is characterized by a pronounced asymmetry in energetic investment, with the mother cell bearing the majority of the biosynthetic burden. We estimate that the mother cell accounts for roughly 87% of the total costs, while the forespore contributes the remaining 13%. Approximately 67% of the total energy expenditure, including the cost of genome replication, occurs within the first hour of development. This early demand drops off exponentially as development progresses, a pattern reminiscent of maternal investment during animal embryogenesis (32). Following initiation, septum formation creates two unequally sized cellular compartments, establishing the physical basis for asymmetric investment. More than one hundred genes are activated within the forespore to support differentiation and maintain coordination with the mother cell (18). Throughout development, the mother cell delivers precursors and recycled building blocks to the forespore through a tubular intercellular channel (33, 34), reducing the need for de novo biosynthesis. This structure also mediates continued signaling between compartments during intermediate and late stages of development (35). Ultimately, the mother cell lyses, releasing the mature spore into the environment (Fig. 1).

Energetic cost of spore formation. Top: Major stages of endospore development. Bottom: Energetic cost of spore formation over time reflecting transcriptional and translational investments in units of ATP. Total costs (PT) at each time interval (gray bars) represent the sum of opportunity costs (PO), which are linked to biosynthesis (blue bars) and direct costs (PD) associated with polymerization (orange bars). The total cost of spore formation declines exponentially over time. The embedded pie chart illustrates the proportional contribution (% of PT) associated with key processes, including replication of the whole genome and septum synthesis, along with transcription and translation.

Spore formation poses a risk for cells navigating unpredictable environments. Development unfolds slowly, requiring 8 to 10 h to complete compared to the 0.5 to 1.0 h division cycle that is typical during vegetative growth. As a result, cells may engage in a costly process that offers no benefit if conditions improve. Our accounting shows that by the time the cell reaches the commitment point, ∼2 h after initiation, 85% of the total transcriptional and translational costs of sporulation have already been incurred. If precursor molecules are fully recycled, then only 20% of these costs are truly nonrecoverable because polymerization and activation are accounted for in $[eqn]$ , whereas full recycling is credited in $[eqn]$ .

The initial phase of sporulation nonetheless represents a significant upfront energetic commitment. Before reaching the developmental checkpoint, cells can still abort the process in response to environmental cues, but such reversal incurs a partial energy loss (35). Once SpoIIE activates asymmetric septum formation and $[eqn]$ is engaged (36), development becomes irreversible (Fig. 1). Beyond this threshold, the cell continues to invest heavily in spore-specific proteins, including structural components of the coat, small acid-soluble proteins that protect DNA from damage, and enzymes such as proteases that are required for revival (37 ?–39).

Energetics of Spore Revival.

Spore revival, which includes germination and subsequent outgrowth, is essential to the success of dormancy. Our analyses reveal that revival is even more energetically costly than spore formation, with total expenditures of $[eqn]$ $[eqn]$ (Fig. 2). Although germination (15 min) and outgrowth (3.5 h) proceed more rapidly than sporulation, they demand significant energy to reestablish vegetative growth, though the timing and extent of ATP accumulation during early spore revival remain debated (40). During the initial 15 min of revival, transcription and translation require ∼7.6 $[eqn]$ $[eqn]$ ( $[eqn]$ $[eqn]$ and $[eqn]$ $[eqn]$ ), representing 12% of the total revival cost. The outgrowth phase, in contrast, accounts for the vast majority (88%) of the total energetic cost. Transcription and translation require ∼5.4 $[eqn]$ $[eqn]$ ( $[eqn]$ $[eqn]$ and $[eqn]$ $[eqn]$ ), and membrane biogenesis adds another ∼6.9 $[eqn]$ $[eqn]$ ( $[eqn]$ $[eqn]$ and $[eqn]$ $[eqn]$ ). Importantly, these estimates already account for the substantial recycling of spore membrane lipids during outgrowth: roughly one sixth of the membrane is reused, and only ∼3% is newly synthesized at this stage (41, 42).

Energetic cost of spore revival. Top: Major stages of spore revival, which include germination followed by outgrowth. Bottom: Energetic costs of spore revival over time reflecting transcriptional and translational investments units of ATP. Gray bars represent total costs (PT) for each time interval. Total costs (PT) at each time interval (gray bars) represent the sum of opportunity costs (PO), which are linked to biosynthesis (blue bars) and direct costs (PD) associated with polymerization (orange bars). The total cost of spore revival declines exponentially over time. The embedded pie chart illustrates the proportional contribution (% of PT) associated with key processes, including transcription, translation, and membrane synthesis.

Our analysis reveals a highly uneven allocation of energy across core molecular processes during revival, with transcription accounting for just 4%, translation for 87%, and membrane assembly for 9% of total energy expenditure (Fig. 2). This elevated demand during revival reflects the rapid reactivation of biosynthetic pathways and reconstruction of the vegetative-cell proteome, in contrast to spore formation, which functionally relies on a smaller, specialized protein subset. Although revival-specific genes contribute an estimated ∼4.6 $[eqn]$ ( $[eqn]$ ), equivalent to approximately 11% of the energy required for genome replication, this cost is paid in advance at the onset of sporulation.

Despite their ability to rapidly reawaken, dormant spores contain remarkably little internal energy. Our bioenergetic accounting shows that a total of ∼7.6 $[eqn]$ ATP are required for germination alone. However, ATP and GTP concentrations within Bacillus spores are extremely low, at around 2 nmol/g, as estimated using magnetic resonance spectroscopy with ^31^P NMR (43). This corresponds to roughly $[eqn]$ molecules per spore. Additional reserves including AMP (∼400 nmol/g) and ADP (∼100 nmol/g) (44), are also present in low concentrations. Assuming a spore mass of ∼200 femtograms (45), total endogenous stores amount to just $[eqn]$ ATP equivalents. These estimates reveal that the energy required for germination exceeds internal reserves by nearly five orders of magnitude. While endogenous pools may be sufficient for initiating the early steps of germination, they are far from adequate to support the full transition from dormancy to active growth.

The apparent energy shortfall presents a fundamental problem concerning how a dormant spore generates enough ATP to complete germination. Our analysis suggests that spores overcome this limitation by mobilizing prepackaged molecular reserves that are metabolized immediately upon rehydration. Within 5 min of germination onset, essential components such as enzymes, ribosomes, amino acids, and nucleotides are activated to initiate core metabolic processes (44). A major source of amino acids is the rapid degradation of small acid-soluble proteins, which account for 10 to 20% of the spore’s proteome (46). In addition, stored carbon sources, including 3-phosphoglyceric acid (∼2,700 nmol/g) and malate (∼3,000 nmol/g), fuel glycolysis and related metabolic pathways, yielding ∼2.1 $[eqn]$ ATP in total (43, 47). These reserves are substantial but insufficient to cover the full germination demand; they likely bridge the earliest reactivation steps until exogenous carbon is taken up during outgrowth. However, full outgrowth and the return to vegetative growth requires additional energy from the external environment. Increased glucose uptake during outgrowth likely supports the elevated ATP production needed for biosynthesis and cell expansion (48). Because both spore formation and revival impose substantial energetic costs, their evolutionary persistence must be evaluated relative to alternative cellular stress responses, which we explore in the next section.

Head-to-Head Bioenergetics of Microbial Survival.

Spore-forming bacteria like Bacillus deploy tiered stress responses that vary in reversibility, energy cost, and benefits over time. The early responses, coordinated by the sigma factor $[eqn]$ , involve low-cost physiological adjustments such as DNA repair, redox homeostasis, and osmoprotection (49, 50). These transient changes help cells buffer short-term environmental fluctuations without altering their developmental trajectory. If conditions worsen, cells may adopt facultative strategies such as motility (51), cannibalism (52), oligotrophic survival (53), or competence (54), which offer moderate costs and reversible outcomes. When these options are insufficient, cells initiate spore formation, a developmental program that requires a major upfront investment but offers long-term protection.

We define the total build cost as the ATP required for transcription and translation. By this measure, we estimate that the Bacillus spore life cycle requires about $[eqn]$ ATP. When additional infrastructure costs are included, such as genome replication (∼4.0 $[eqn]$ ATP), septum synthesis (∼2.8 $[eqn]$ ATP), and membrane remodeling during revival (∼6.9 $[eqn]$ ATP), the complete energetic investment rises to ∼9.2 $[eqn]$ ATP, nearly 10 billion overall (Fig. 3).

Energetic costs of sporulation and other cellular processes and traits. Bars represent build costs (transcription + translation) in units of ATP molecules. The spore life cycle, with colored segments denoting germination, spore formation, and outgrowth, represents the total cost of one complete spore generation (11.5 h). All other bars show costs per vegetative generation (1.16 h at 20°C). For developmental programs (biofilm formation and competence), total costs were divided by program duration; for other traits, we assumed per-generation resynthesis. Genome replication and membrane lipid synthesis are included as reference benchmarks. The secondary axis shows costs as a percentage of the per-generation cellular budget (CG+tgenCM). See Materials and Methods and SI Appendix, Table S3 for calculation details.

To facilitate meaningful comparisons across different life history strategies, we express all costs on a per-generation basis. For vegetative growth, one generation corresponds to 1.16 h at 20^°^ C, whereas a full spore life cycle, which includes formation, germination, and outgrowth, spans 11.5 h. For developmental programs such as biofilm formation (∼12 h) and competence development (∼3 h), we expressed total costs on a per generation basis by normalizing them to program duration. For other traits such as flagella and stress response proteins, we assumed complete resynthesis each generation, providing conservative upper bounds. In reality, many vegetative structures persist across generations (55), which further emphasizes the high relative cost of sporulation. Maintenance energy (∼1.3 $[eqn]$ ATP) represents the per-generation operating cost of core housekeeping metabolism and is distinct from the build costs of homeostasis proteins, which are synthesized once to provide stress-response capacity. For reference, genome replication (∼4.0 $[eqn]$ ATP) and membrane bilayer synthesis (∼2.8 $[eqn]$ ATP) are shown on the same per-generation scale. Together, these comparisons indicate that the spore life cycle is among the most energy-intensive processes in the bacterial cell (Fig. 3).

Although seemingly expensive, spore formation is a front-loaded strategy whose benefits accumulate with time. We illustrate this by comparing a cell that sporulates with one that remains vegetative but does not grow. Using a maintenance rate of ∼1.2 $[eqn]$ at $[eqn]$ , the break-even time is

[eqn]

If a cell can reduce its maintenance energy requirements tenfold to (∼1.2 $[eqn]$ ), the break even time ( $[eqn]$ ) becomes $[eqn]$ (3 d). With a hundredfold reduction to (∼1.2 $[eqn]$ ), the break even time is $[eqn]$ (32 d). Thus, without committing to sporulation, Bacillus can remain viable for extended periods at low power draw (53). However, under prolonged or unpredictable stress, the one-time cost of dormancy becomes the energetically optimal investment. Note that maintenance and growth conditions are temperature dependent, so these values will change accordingly.

Although commonly viewed as a last resort, sporulation can represent a bioenergetically optimal strategy in fluctuating environments. Such conditions are widespread in nature, including energy-limited soils and sediments where resource inputs are both variable and vanishingly small (9). By comparing the energy costs of microbial stress responses in a common currency, our analysis reframes dormancy not as a passive fallback but as a strategic investment shaped by ecological, evolutionary, and developmental constraints.

Costs on Collective Outcomes: Sporulation Efficiency.

Understanding the energetic costs of sporulation at the cellular level has the potential to explain how dormancy strategies scale up to shape population-level phenomena. Excluding transitional states, an individual Bacillus exists either as a metabolically active vegetative cell or as a dormant spore (Figs. 1 and 2). Yet within a given environment, populations rarely exhibit uniform behavior. Instead, the fraction of individuals that undergo sporulation, known as sporulation efficiency (φ), is highly variable.

In batch culture, where rapid physiological shifts and unbounded population growth eventually lead to resource exhaustion, our analysis of literature-reported data shows that the median sporulation efficiency is roughly 30%, with values ranging from 0 to 100% (Fig. 4A). Some of this variation likely reflects details of experimental design, including differences in strain background, resources, temperature, and sampling methodology. Incomplete sporulation has also been interpreted as the outcome of stochastic decision-making or bet-hedging strategies, in which subsets of cells commit to dormancy while others remain vegetative to exploit residual resources or potential improvements in environmental conditions. While such strategies are evolutionarily important, they modulate the long-term fitness consequences of existing costs but do not reduce the cost of an individual spore.

Energetic constraints on sporulation efficiency. (A) The frequency of sporulation efficiency (fraction of cells that sporulate in a population) derived from publicly available data (SI Appendix, Table S2). (B) In a dynamical model, resource depletion reduces cell growth and triggers spore formation. (C) An increase in spore costs relative to cell costs results in a monotonic decline in sporulation efficiency. The dashed line represents the empirical estimate of the spore-to-cell cost ratio. The sharp decrease in efficiency as costs rise suggests that sporulation becomes energetically unfavorable with increasing costs.

Because all cellular features require energy to build and maintain, we examined how our estimates of energetic costs influence sporulation efficiency. To investigate this relationship, we developed a population dynamics model in an environment where resources decline over time:

[eqn]

[eqn]

[eqn]

Here, $[eqn]$ and $[eqn]$ represent the concentrations of vegetative cells and spores, respectively. The functions $[eqn]$ , $[eqn]$ , and $[eqn]$ represent the per capita rates of cellular growth, sporulation, and the initiation of sporulation, each dependent on the concentration of available resources R. The yields $[eqn]$ and $[eqn]$ describe the efficiency of resource use for producing cells and spores, respectively, with ATP used as the unit of energetic currency (see Methods and SI Appendix).

Our model predicts that sporulation efficiency declines sharply when energetic costs exceed empirically derived estimates (Fig. 4C). This outcome reflects a trade-off between resource availability and the energy required for spore formation, which limits the number of cells that can successfully sporulate. By integrating empirical measurements with mechanistic modeling, we show that the relative cost of producing a spore compared to a vegetative cell can constrain the success of dormancy as a survival strategy. As a point of comparison, we developed the chemostat equivalent of our model and found that steady-state sporulation efficiency does not depend on the cost of building a spore or a cell (SI Appendix).

Evolutionary Maintenance of an Energetically Costly Trait.

Sporulation is an ancient survival strategy that originated billions of years ago. It confers resistance to harsh conditions, enables long-distance dispersal, and protects against viral infection (14, 56, 57). These advantages have contributed to the prevalence of spore-forming lineages across the globe. Yet despite its benefits, this complex form of dormancy remains susceptible to evolutionary decay. Repeated and independent losses of sporulation have occurred across the Bacillota phylogeny (26, 58), often attributed to relaxed selection in environments that favor continuous growth (58). In such settings, sporulation genes may no longer provide a fitness benefit and become prone to mutational degradation (59). In addition to neutral decay, the energetic burden of unused genetic material may drive the adaptive deletion of sporulation loci, as even nonexpressed genes and regulatory elements can impose fitness costs (60). The relative importance of neutral vs. selective forces in the loss of sporulation, however, remains unresolved.

When growth is favored, sporulation genes are typically not expressed, rendering their transcriptional and translational costs negligible. However, DNA replication still requires energy and resources. If even small costs are visible to selection, sporulation genes may be lost through the fixation of beneficial deletions. This contrasts with trait loss driven by neutral processes, in which mutations accumulate gradually through nucleotide substitutions. In Bacillus, the estimated rate of deletions is $[eqn]$ per site per generation, while the rate of nucleotide substitution is $[eqn]$ per site per generation, making substitutions nearly three times more frequent (61). Whether the energetic cost of maintaining unused sporulation genes is sufficient to favor deletions over neutral decay depends on the strength of selection and the size of a deletion. To examine this, we compared the fixation rate of a beneficial deletion to that of a neutral substitution:

[eqn]

where $[eqn]$ is the effective population size, s is the selection coefficient for a deletion of a given size, $[eqn]$ is the rate of deletions, and $[eqn]$ is the rate of substitutions.

To examine how the relative fixation rate depends on deletion size, we combined our energetic cost estimates with published values for $[eqn]$ and the empirical distribution of deletion sizes from mutation accumulation experiments (61, 62) (Fig. 5A). Although deletions occur less frequently than substitutions, selection acting on large deletions (∼1 kb) is strong enough to outweigh the rate of neutral substitutions. This length corresponds to the typical size of a gene, providing evidence that selection can drive the loss of sporulation in bacterial populations (Fig. 5B). While previous studies have emphasized mutational degradation under relaxed selection (58), our analysis shows that the energetic burden of maintaining unused sporulation genes can be sufficient to favor their adaptive deletion. Even small metabolic costs, if persistent, may become visible to selection (27, 63). Observations from host-associated Bacillota, where sporulation loss often coincides with genome streamlining and metabolic specialization, support the idea that energy-driven processes shape trait loss in natural populations under real-world conditions (64, 65).

Evolutionary maintenance of sporulation. (A) Conservation of sporulation-related genes across spore-forming and non-spore-forming lineages in the Bacillota phylum based on the frequency of COGs (Clusters of Orthologous Genes). There are 237 COGs in total (23). (B) The loss of sporulation genes in Bacillus can be examined by combining energetic cost estimates, which inform the strength of selection (s), with empirical mutation rates (61). Here, we calculate the fixation rate of beneficial deletions relative to that of neutral substitutions (analogous to dN/dS in the population genetics literature; Supporting Information). The black dashed line shows the neutral expectation (s=0), where the ratio is determined solely by the mutation rates of deletions and substitutions. The solid blue line shows the average nonneutral case using published estimates of Ne (61, 62). For deletion sizes where the ratio is greater than 1, the energetic cost of maintaining sporulation genes is sufficient for selection to favor their loss. The sharp peaks reflect the empirical distribution of deletion sizes.

Theoretical treatments of sporulation and dormancy often emphasize their role as bet hedging strategies that promote survival under environmental uncertainty (66). Other modeling approaches have cast sporulation as an optimal switching problem (67), dissected the genetic circuitry that governs commitment (68, 69), and explored trade-offs between spore quality and quantity (70). These perspectives illuminate the conditions under which sporulation is expected to be advantageous, but they generally treat its costs only abstractly or implicitly. By quantifying the energetic demands of the spore life cycle, our analysis shows how these costs influence the selective conditions under which sporulation is maintained, and importantly, the circumstances in which it can be lost.

The loss of complex traits is a recurring theme in evolution, often attributed to relaxed selection and the accumulation of neutral mutations (71, 72). Yet even in classic examples such as eye reduction in cave-dwelling fish or flight loss in birds, growing evidence suggests that the regression of biological function can be favored by selection, especially when a trait imposes physiological burdens or energetic costs (73, 74). Our study provides a first-principles demonstration of how the energetic cost of a complex trait can influence its evolutionary fate. More broadly, the trade-offs we identify may apply to other metabolically expensive traits across the tree of life, including bioluminescence, motility, and secondary metabolite production in microbes, symbiotic nodulation and inducible defenses in plants, and elaborate courtship displays or environmentally contingent morphologies in animals. Although our model focuses on sporulation, a widespread form of dormancy that supports the persistence and dispersal of a globally abundant group of bacteria, the broader approach of quantifying energetic investment and linking it to evolutionary dynamics provides a scalable framework for understanding the distribution and loss of complex traits in nature.

Materials and Methods

Bioenergetic Accounting: Definitions and Assumptions.

We estimated bioenergetic costs using glucose as the sole carbon and energy source. In bacteria, glucose is metabolized through the tricarboxylic acid (TCA) cycle, yielding ATP. The hydrolysis of ATP to ADP and inorganic phosphate ( $[eqn]$ ) releases approximately 30 kJ/mol of free energy ( $[eqn]$ ) under standard conditions. While some ATP is used to power energy-consuming processes, other hydrolysis events support the biosynthesis of macromolecular building blocks. Following established conventions (7, 29), we estimate cellular energy expenditure in ATP (or ATP-equivalents) and use P to denote one hydrolysis event (GTP, etc., counted 1:1 as P). Following refs. 29 and 75, we partition costs into direct ( $[eqn]$ ) and opportunity ( $[eqn]$ ). Direct costs capture ATP-powered steps such as monomer activation/charging and polymerization (e.g., aminoacyl-tRNA charging, chain elongation). Opportunity costs capture the energetic value of precursor synthesis (e.g., NADPH-consuming reductions) that forgoes growth if diverted to a trait. Total cost is $[eqn]$ . We report $[eqn]$ unless stated otherwise, because it best reflects resource allocation under balanced growth. However, $[eqn]$ is useful when considering heat dissipation and instantaneous power, and $[eqn]$ is useful when considering that some monomers can be recycled during processes. We assume 20^°^C for budget references and use the growth ( $[eqn]$ ) and maintenance ( $[eqn]$ ) entries of ref. 29 where indicated.

It is important to note that biosynthetic costs vary across species due to differences in metabolic pathways and environmental factors such as resource availability (76). Our analysis focused on cellular processes with sufficient quantitative data, including those related to the central dogma and membrane synthesis. We excluded minor costs below approximately $[eqn]$ ATP, such as posttranslational modifications, and protein folding (29), although even these small expenditures may be evolutionarily significant in certain regulatory contexts (60).

Replication costs.

Costs associated with the replication of the mother cell genome are incurred early in the spore cycle, before the commitment stage ∼2 h. Although DNA replication is a complex process involving unwinding, primer synthesis, Okazaki fragment ligation, and proofreading, most of the energy is spent on synthesizing nucleotide building blocks and polymerizing them into DNA.

Let $[eqn]$ be the genome length (bp). With $[eqn]$ the direct synthesis cost of a DNA nucleotide (excluding polymerization; $[eqn]$ ), $[eqn]$ the polymerization cost ( $[eqn]$ ), $[eqn]$ for helicase unwinding, and $[eqn]$ for lagging-strand primers (29, 75), we write:

[eqn]

The opportunity cost uses the average opportunity per DNA nucleotide, $[eqn]$ (precursor synthesis $[eqn]$ plus conversion), giving

[eqn]

Ligation and proofreading are $[eqn]$ of $[eqn]$ and are neglected.

Transcription costs.

Transcription is also affected by complex mechanisms such as activation, initiation, termination, proofreading, and RNA processing. From a bioenergetic perspective, however, most of the transcription budget is dedicated to nucleotide synthesis and polymerization (29). Therefore, the expression costs of genes can be estimated as the sum of the costs associated with protein-coding genes involved in spore formation and spore revival. Like replication, transcription costs have two components: direct costs associated with the polymerization and the opportunity costs, which are the energy needed to synthesize the ribonucleotide building blocks. Assuming that the length of genes and mRNAs (premature and mature) are approximately the same as in other bacteria (29), we estimate the direct costs of transcription as follows:

i) One-time costs: We assume efficient nucleotide recycling, so nucleotide synthesis for each transcript is charged once. Let $[eqn]$ be the length of transcript j, $[eqn]$ the number of transcripts produced over the program, and $[eqn]$ the direct synthesis cost per RNA nucleotide (i.e., RNA direct cost minus polymerization; $[eqn]$ ). The corresponding opportunity cost uses $[eqn]$ per nucleotide:

[eqn]

[eqn]

Here $[eqn]$ is the average direct costs of synthesizing an RNA nucleotide (10 $[eqn]$ ), and $[eqn]$ is the average opportunity cost of synthesizing an RNA nucleotide, which is $[eqn]$ $[eqn]$ .

ii) Time-resolved polymerization: Polymerization is paid whenever a transcript is made. We distribute the transcript production of gene j across hours t using weights $[eqn]$ proportional to the measured expression at hour t, with $[eqn]$ . If $[eqn]$ transcripts are made at hour t, the hourly direct cost is

[eqn]

where $[eqn]$ per nucleotide is the RNA polymerase elongation cost, and $[eqn]$ is a per nucleotide small direct overhead for relieving transcription-induced supercoils, motivated by the twin-domain model of Liu and Wang (77) and the mechanochemistry of DNA gyrase (≈ 2 ATP per negative supercoil); we parameterize this as $[eqn]$ , typically <5% of polymerization. The totals over the program are $[eqn]$ and $[eqn]$ from Eq. 8.

To connect to protein demand, we estimate $[eqn]$ from protein abundances (parts per million, ppm) and a yield Y of proteins per mRNA lifetime: $[eqn]$ , $[eqn]$ . We use $[eqn]$ proteins per cell (78) and a representative $[eqn]$ . Because this yield-based accounting already fixes the number and timing of transcript synthesis via $[eqn]$ , we do not introduce an explicit mRNA degradation rate in the cost formulas to avoid double-counting.

Translational costs.

Although mRNA has a shorter half-life and ribonucleotide polymerization is relatively expensive, proteins are 100 to 1,000 times more abundant than transcripts (29). In addition, nucleotides are already activated molecules, whereas tRNA must be charged with amino acids, making chain elongation more energetically demanding (29). As a result, most of the energetic cost of protein synthesis is attributed to translation, while processes such as initiation, termination, and posttranslational modification contribute minimally to total expenditure (29). We estimated the per-cell translation costs as follows:

[eqn]

[eqn]

where $[eqn]$ includes $[eqn]$ per incorporated amino acid for aminoacylation and elongation, and $[eqn]$ uses amino acid opportunity costs [bacterial mean (∼25 $[eqn]$ )] (29). Protein degradation during the sporulation window is slow relative to synthesis, so we neglect turnover terms.

Beyond elongation and aminoacylation, we add a constant $[eqn]$ per completed protein to account for one GTP at initiation (IF2) and one GTP during termination/recycling (RF3), consistent with standard bacterial translation cycles (79). Other small terms (e.g., proofreading, chaperone cycles) were not included, consistent with our $[eqn]$ ATP cutoff.

Membrane synthesis and remodeling.

We estimated the energetic cost of synthesizing and remodeling lipid bilayers for the cell envelope and sporulation-specific structures. The number of lipids is the bilayer area divided by the head-group area $[eqn]$ (80 ?–82). We modeled the vegetative cell as a spherocylinder (cylinder + hemispherical caps) with length $[eqn]$ and diameter $[eqn]$ (41). Writing $[eqn]$ and $[eqn]$ , and using bilayer thickness $[eqn]$ (3.0 nm hydrophobic core + 0.5 nm headgroup radius per leaflet) (83, 84), the outer and inner leaflet areas are

[eqn]

Assuming a membrane protein occupancy of $[eqn]$ , the lipid count is

[eqn]

We take unit lipid costs as $[eqn]$ and $[eqn]$ (75), giving

[eqn]

The septum is modeled as a flat circular bilayer (thickness effects negligible relative to b), with total bilayer area:

[eqn]

The costs follow analogously with the same $[eqn]$ and unit costs.

During revival (0 to 3.5 h), we assume that only a fraction $[eqn]$ of the full vegetative membrane is newly synthesized and that a fraction $[eqn]$ of lipid cost is offset by recycling. Net revival membrane cost is

[eqn]

We use $[eqn]$ in the main analysis.

Empirical Data Used for Bioenergetic Accounting.

We used various publicly available cellular and molecular information to support this theoretical framework.

Gene sets and lengths.

Gene and protein metadata (including lengths and SubtiWiki symbols) were obtained from SubtiWiki (85). To unify identifiers, we normalized gene tokens and propagated locus tags across synonym lists.

Reference genome and counting conventions.

All genome-level quantities were referenced to B. subtilis subsp. subtilis strain 168 (NCBI RefSeq NC_000964.3; assembly GCF_000009045.1, ASM904v1). The genome length used in our calculations is $[eqn]$ bp. Unless stated otherwise, gene counts refer to protein-coding loci (CDS); the total number used here is $[eqn]$ . Noncoding RNAs were excluded from gene-count totals but may appear in expression sets where relevant. All percentages “of the genome” and “of genes” are computed relative to these values.

Protein sequences and per-protein costs.

Reviewed UniProt sequences for B. subtilis were mapped deterministically to locus tags via BSU identifiers in name/gene fields, then through unambiguous SubtiWiki primary/alias tokens. Per-protein direct and opportunity costs ( $[eqn]$ ) were computed by summing amino acid costs from ref. 75 over each sequence.

Protein abundances.

PaxDb abundances (ppm) (86, 87) were tokenized and mapped to locus tags using the same symbol map. When a PaxDb row matched multiple tags, ppm was split evenly (mass-conserving). Proteins without a match received a tiny floor (0.1 ppm), then all ppm were renormalized to the known total protein count $[eqn]$ (78) to obtain copies per cell.

Sporulation time series.

Expression heatmaps from SporeWeb (88, 89) were converted to long format ( $[eqn]$ – $[eqn]$ ). For each gene j, hourly weights $[eqn]$ normalized expression were used to i) assign one-time transcript synthesis at first appearance and ii) distribute polymerization across hours (Transcription Costs).

Revival (germination and outgrowth).

Newly synthesized proteins from Swarge et al. (21) were converted to interval scores as successive differences (H0.25, H0.5, H1, H1.5, H2.5, H3.5). We excluded H5.5 (210 to 330 min) to avoid vegetative growth. For each protein, interval weights (normalized positive scores) distribute copies and thereby transcription/translation costs over time. Revival membrane remodeling used $[eqn]$ with $[eqn]$ as above.

Handling of missing fields.

When gene or protein length was unavailable in SubtiWiki or the mapped UniProt entry, we substituted the dataset median (gene length and protein length medians computed over the available entries). If a protein lacked sequence-derived per-protein cost totals, we estimated them as (length × median per-amino acid cost) computed across proteins; if length was also missing, we used protein-level medians. For proteins without a PaxDb match we assigned a small abundance floor (0.1 ppm) and then renormalized all ppm values to the total protein count, ensuring mass conservation.

Head-to-head bioenergetics.

To compare energetic costs of sporulation with other cellular processes, we expressed all trait costs on a per-generation basis. For vegetative traits, one generation corresponds to the division time of approximately 1.16 h at 20^°^C. For the spore life cycle, one generation corresponds to the complete developmental program (spore formation, germination, and outgrowth), which spans ∼11.5 h. This framework enables direct comparison of the metabolic burden imposed by different life history strategies operating on distinct timescales.

We define build costs as the sum of ATP expenditures for transcription and translation required to synthesize the proteins associated with each trait, excluding maintenance, protein turnover, and operational costs. For the spore life cycle, build costs were calculated by integrating time-resolved gene expression data for sporulation (88) and revival (21) over the developmental program. For other cellular traits, build costs were estimated from steady-state protein abundances during growth (86, 87).

Gene sets for biofilm formation, motility, chemotaxis, competence, and stress responses were curated from SubtiWiki (85), mapped to locus tags, and merged with protein abundances. Transcription costs were calculated assuming steady-state transcript abundance given by a protein-to-mRNA yield ratio of $[eqn]$ . Translation costs were computed as protein copy number multiplied by per-amino acid synthesis costs (direct + opportunity) (75).

For developmental programs operating over multiple vegetative generations (biofilm formation, competence development), total build costs were divided by program duration to obtain per-generation equivalents. For other traits, we assumed complete per-generation resynthesis (i.e., costs calculated directly from steady-state abundances represent per-generation values), providing conservative upper bounds given that some cellular structures may persist across cell divisions. For traits exhibiting population heterogeneity (e.g., biofilm formation, competence), bulk proteomic measurements yield population-averaged costs. These values represent the mean energetic burden across all cells, including both producers and nonproducers, and may underestimate per-producer investment. However, population-averaged costs remain appropriate for evolutionary analyses, as selection shapes population-level fitness.

All costs are expressed relative to the per-generation budget $[eqn]$ ATP at 20^°^C (29), where $[eqn]$ is the growth cost and $[eqn]$ is the maintenance energy. Detailed calculation methods, data sources, and assumptions for each trait are provided in SI Appendix, Table S3.

Bioenergetic Model of Spore Efficiency.

To investigate how energy limitation influences sporulation efficiency, we developed a population dynamics model of Bacillus that incorporates the metabolic demands of endospore formation and vegetative cell production. We used bioenergetic yield parameters to capture how efficiently resources are converted into either spores or cells. The yields of vegetative cells ( $[eqn]$ ) and spores ( $[eqn]$ ) are defined as

[eqn]

[eqn]

Using our ATP estimates, we can parameterize these yields:

[eqn]

where ϵ represents the ATP produced per (μg) of resource, reflecting the efficiency of energy extraction from a given substrate. These expressions link the yields of cells and spores, reducing the degrees of freedom in our model by one. Given our definition of yield and the focus of our study on bioenergetics, it is important to explicitly account for the consumption of a limiting resource (R). To do so, we apply a minimal model of microbial growth in batch culture, defined as follows:

[eqn]

[eqn]

[eqn]

[eqn]

[eqn]

where $[eqn]$ is the maximum growth rate, and K is the half-saturation constant, specifying the value of R where $[eqn]$ is half of $[eqn]$ . The rate of spore initiation is modeled as the following function:

[eqn]

The sporulation initiation function exhibits intuitive properties. The sensitivity parameter σ governs the steepness of the transition from a low to high rate of spore formation. When $[eqn]$ , the function approximates a Heaviside step function (90):

[eqn]

The inclusion of a spore germination term is unnecessary for a deterministic system where the resource concentration only decreases over time.

Evolutionary Maintenance of Sporulation.

In this section, we use a minimal model of natural selection to illustrate the selective advantage conferred by the loss of a single nonfunctional nucleotide via a mutation-induced deletion. We focus on a scenario in which a population of spore-forming Bacillus inhabits an environment that supports continuous growth, making sporulation unnecessary. In this context, sporulation genes are not expressed and become effectively nonfunctional. As a result, they fall under relaxed selection (91), allowing mutations to accumulate over time, including those that would be deleterious in environments where sporulation is required.

However, nonfunctional DNA can still impose a metabolic cost. A cell that loses a single nucleotide through a mutation-induced deletion may gain a fitness advantage relative to its ancestor due to having been relieved of this slight, though existent, metabolic burden. This advantage can be calculated as the number of ATP required to build nucleotides ( $[eqn]$ ATP), expressed relative to the cell’s total energetic budget (29). In the case of Bacillus, the selective advantage of the mutant can be calculated as

[eqn]

Under the strong-selection, weak-mutation evolutionary limit, the probability that a mutation of frequency f will fix can be derived as (92):

[eqn]

A de novo mutation has an initial frequency of $[eqn]$ , from which we obtain for $[eqn]$ :

[eqn]

where N is the effective population size. Deletions are then successfully fixed at the following rate:

[eqn]

[eqn]

We have identified how positive selection can reduce the energetic demands imposed by a nonfunctional region of the genome. The function $[eqn]$ , however, is somewhat unwieldy because it depends on the number of elapsed generations. To simplify interpretation, it is useful to normalize this function by the rate of putatively neutral substitutions. Since deletions have been defined in terms of fitness gains, it is reasonable to treat substitutions at the same sites as neutral. Therefore, we consider only substitutions within the region encoding endospore formation as neutral. The rate of substitution in this region can be defined as:

[eqn]

By taking the ratio, we can acquire a time-independent quantity:

[eqn]

This equation is effectively the classic ratio of nonsynonymous to synonymous substitutions, adjusted for differing mutation rates. This adjustment is important because $[eqn]$ is typically lower than $[eqn]$ (61, 93 ?–95). In Bacillus, mutation accumulation experiments estimated the rate of insertions and deletions at $[eqn]$ per event, per individual, per generation, while the rate of substitution was $[eqn]$ , nearly three times higher (61). By counting the fraction of indels that were identified as deletions in each reported mutation accumulation line, we can estimate a deletion rate of $[eqn]$ .

Using these rates, we evaluated the likelihood that a beneficial deletion in spore formation and revival genes becomes fixed relative to a neutral substitution, given the energetic cost of a single nucleotide. Under strict neutrality, this ratio is expected to be less than one due to the lower mutation rate of deletions (SI Appendix, Fig. S3). To examine how this ratio scales with deletion size, we applied two published estimates of the effective population size for Bacillus: $[eqn]$ from ref. 61 and $[eqn]$ from ref. 62.

To better capture how metabolic costs influence the evolution of nonfunctional DNA in Bacillus, additional features can be incorporated into the model. For example, deletions do not always remove a single nucleotide. Large segments of the bacterial genome can be lost through individual deletion events (96). We can define the strength of selection on a deletion of size Δ as the product of the deletion size and the relative metabolic burden of a single nucleotide, $[eqn]$ . However, these larger fitness gains may be rare due to the lower probability of observing large deletions (97).

We can examine the empirical probability distribution of Δ in Bacillus. Although it does not capture all possible deletion sizes, it reflects a general pattern supported by data (SI Appendix, Fig. S3). Smaller deletions are more frequent than larger ones. We incorporated this distribution into our calculation of the ratio $[eqn]$ , defining a size-specific deletion function:

[eqn]

where $[eqn]$ is the empirical probability of acquiring a deletion of size Δ.

Supplementary Material

Appendix 01 (PDF)

Bibliography99

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1S. J. Pirt, The maintenance energy of bacteria in growing cultures. Proc. R. Soc. London. Ser. B, Biol. Sci. 163, 224–231 (1965).4378482 10.1098/rspb.1965.0069 · doi ↗ · pubmed ↗
2C. P. Kempes, L. Wang, J. P. Amend, J. Doyle, T. Hoehler, Evolutionary tradeoffs in cellular composition across diverse bacteria. ISME J. 10, 2145–2157 (2016).27046336 10.1038/ismej.2016.21PMC 4989312 · doi ↗ · pubmed ↗
3N. Lane, W. Martin, The energetics of genome complexity. Nature 467, 929–934 (2010).20962839 10.1038/nature 09486 · doi ↗ · pubmed ↗
4J. E. Niven, S. B. Laughlin, Energy limitation as a selective pressure on the evolution of sensory systems. J. Exp. Biol. 211, 1792–1804 (2008).18490395 10.1242/jeb.017574 · doi ↗ · pubmed ↗
5R. Unni, K. L. Pintor, A. Diepold, D. Unterweger, Presence and absence of type VI secretion systems in bacteria. Microbiology 168, 001151 (2022).10.1099/mic.0.00115135467500 · doi ↗ · pubmed ↗
6M. Lynch, The bioenergetic cost of building a metazoan. Proc. Natl. Acad. Sci. U.S.A. 121, e 2414742121 (2024).39508768 10.1073/pnas.2414742121 PMC 11573499 · doi ↗ · pubmed ↗
7A. Wagner, Energy constraints on the evolution of gene expression. Mol. Biol. Evol. 22, 1365–1374 (2005).15758206 10.1093/molbev/msi 126 · doi ↗ · pubmed ↗
8W. R. Shoemaker, J. T. Lennon, Evolution with a seed bank: The population genetic consequences of microbial dormancy. Evol. Appl. 11, 60–75 (2018).29302272 10.1111/eva.12557 PMC 5748526 · doi ↗ · pubmed ↗