UnifiedGreatMod: a new holistic modelling paradigm for studying biological systems on a complete and harmonious scale

Riccardo Aucello; Simone Pernice; Dora Tortarolo; Raffaele A Calogero; Celia Herrera-Rincon; Giulia Ronchi; Stefano Geuna; Francesca Cordero; Pietro Lió; Marco Beccuti

PMC · DOI:10.1093/bioinformatics/btaf103·March 12, 2025

UnifiedGreatMod: a new holistic modelling paradigm for studying biological systems on a complete and harmonious scale

Riccardo Aucello, Simone Pernice, Dora Tortarolo, Raffaele A Calogero, Celia Herrera-Rincon, Giulia Ronchi, Stefano Geuna, Francesca Cordero, Pietro Lió, Marco Beccuti

PDF

Open Access

TL;DR

UnifiedGreatMod is a new modeling approach that integrates detailed and broad biological data to study complex systems like cancer and bacterial infections.

Contribution

Introduces UnifiedGreatMod, a holistic modeling paradigm combining multi-level biological data into a unified framework.

Findings

01

UnifiedGreatMod was used to simulate E. coli metabolism under nutrient changes and integrate gene expression data.

02

The model was applied to study luminal epithelial cell responses to C. difficile infection.

03

The framework supports both multi-level stable states and fluctuating conditions in biological systems.

Abstract

Computational models are crucial for addressing critical questions about systems evolution and deciphering system connections. The pivotal feature of making this concept recognizable from the biological and clinical community is the possibility of quickly inspecting the whole system, bearing in mind the different granularity levels of its components. This holistic view of system behaviour expands the evolution study by identifying the heterogeneous behaviours applicable, e.g. to the cancer evolution study. To address this aspect, we propose a new modelling paradigm, UnifiedGreatMod, which allows modellers to integrate fine-grained and coarse-grained biological information into a unique model. It enables functional studies by combining the analysis of the system’s multi-level stable states with its fluctuating conditions. This approach helps to investigate the functional relationships…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species2

Escherichia coli Clostridioides difficile(species)

Diseases2

cancer Clostridium difficile infection

Figures2

Click any figure to enlarge with its caption.

A sketch of the proposed modelling paradigm based on the graphical Petri Net formalism (i.e. ESPN) for stating the harmonization between the mechanistic-based and the constrained-based models. The red colour is associated with the components of the ODEs derived from the ESPN, while the green one with the fluxes obtained from the FBA.

Analysis of C. difficile infection dynamics across different modelling settings. (A) Schematic representation of the CDI-gut system model architecture: illustration of interactions between C. difficile, intestinal epithelial cells (IECs), and gut metabolites during antibiotic treatment. (B) Comparative analysis of model behaviour: The left panels present the median trajectories of the Ablated model compared to the Unified model, while the right panels illustrate the same variables, colour-coded by three different parameter configuration sets. These visualizations highlight distinct behavioural patterns in bacterial population dynamics, IEC survival, and metabolic adaptation. (C) CDI-gut environment system analysis across experimental settings: violin plots with overlaid trajectories showing C. difficile population, leucine and tryptophan concentrations, and haem levels, demonstrating how different modelling approaches capture system heterogeneity and metabolic-dynamic coupling.

Equations1

Funding1

—Ministero dell’Univerisita’ e della Ricerca

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene Regulatory Network Analysis · Scientific Computing and Data Management · Microbial Metabolic Engineering and Bioproduction

Full text

1 Introduction

Traditionally, biological problems have been approached using a reductionist perspective by thoroughly dissecting biological phenomena into individual components. However, with the advent of deep sequencing technologies, there has been an unprecedented opportunity to comprehensively characterize biological components simultaneously and gain a global sense of the systemic interactions that occur within a cell (Feist et al. 2009).

In computational biology, the pursuit of understanding complex biological systems has led to the development of diverse modelling approaches, each offering unique rationales and challenges. Constraint- and mechanism-based modelling have emerged as powerful tools for interpreting biological phenomena. Specifically, constrain-based approaches, such as Flux Balance Analysis (FBA) (Machado and Herrgård 2014, Opdam et al. 2017), can be employed to reconstruct genome-scale metabolic networks incorporating curated and systematized information about the known small metabolites and metabolic reactions of a cell type based on transcriptomics, proteomics, or metabolomics data (Bordbar et al. 2014). Furthermore, many studies have been trying to predict fluxes via transcriptomics and FBA or other constraint-based modelling techniques; recent approaches, such as Damiani et al. (2019), Di Filippo et al. (2021), and Galuzzi et al. (2022), map from expression profile to constraints for metabolic fluxes adjusting fluxes based on gene expression levels.

Among the available tools for FBA, the COBRA Toolbox is integral to the constraint-based modelling community, which focuses on open-source genome-scale metabolic models (GEMs). Noteworthy projects for high-throughput GEM generation include Path2Models (Büchel et al. 2013), AGORA (Magnúsdóttir et al. 2017), CarveMe (Machado et al. 2018), and BiGG Models (Norsigian et al. 2020). Moreover, human and microbial models and maps are available at the Virtual Metabolic Human website (Noronha et al. 2019), which collects human and gut metabolism data and links this information with nutrition.

These modelling approaches are efficient for the analysis of large-scale metabolic networks, but need detailed descriptions of molecular mechanisms, such as quantitative information, on molecular abundances and reaction kinetics. In contrast, fine-grained biological information is often modelled using systems of ordinary differential equations (ODEs), namely, mechanism-based approaches, to reproduce the dynamics of the system (Kendall et al. 1999). Although these approaches have the potential to reproduce reality more closely by investigating a biological phenomenon at a greater depth, they need help with scalability and parameterization issues, as they require a considerable amount of preliminary quantitative information, which limits their applicability to larger systems.

Recognizing the complementary nature of these approaches, there is growing interest in developing computational models that bridge the gap between constraint-based and mechanistic modelling paradigms.

An attempt to expand the possibilities of FBA is Dynamic Flux Balance Analysis (DFBA) proposed in Maranas and Zomorrodi (2016), Mahadevan et al. (2002), and Yang et al. (2019). DFBA represents a compromise between fully dynamic models—which cannot be simulated on a large scale—and steady state models—in which metabolite fluxes within a biological system remain constant over time and do not involve reaction kinetics (Palsson 2011, Maranas and Zomorrodi 2016). Its novelty lies in its ability to model metabolism under dynamic conditions, combining extracellular dynamics with intracellular steady states.

However, while DFBA allows investigating genome-scale networks under transient conditions, it cannot implement mechanistic knowledge of non-metabolic processes.

Recently, a collective effort has been to hybridize mechanism-based and constraint-based approaches. The goal is to tackle their limitations and merge their capabilities to decrease costs, improve efficiency, and perform more descriptive phenotype predictions.

A first insight was provided in Pernice et al. (2020a), where the authors proposed some initial indications on how this hybridization can be carried out through a high-level graphical formalism. In particular, the paper clearly shows how FBA can be exploited as a global source and sink of the system for specific sets of metabolites. Moreover, it discusses the possibility of using FBA in the network sub-models where there are missing kinetics and/or sloppy parameters and to close the original system.

Another notable example was the recent proof of concept of a hybrid ODE- and constraint-based model proposed in Ben Guebila and Thiele (2021), where an ODE-based glucose–insulin model was combined with a whole-body organ-resolved reconstruction of human metabolism to investigate Type 1 diabetes mellitus. However, the computational methodology and associated framework used in such work are peculiar to the example proposed.

Thus, to the best of our knowledge, a well-formalized methodology that is sufficiently general is still missing. To address such a gap in this work, we advance the hybridization of these two approaches by first formally defining the coupling between ODEs and FBA using a graphical meta-formalism and second by implementing this new modelling paradigm, namely ‘UnifiedGreatMod’, into the well-known open-source and general modelling framework GreatMod (Castagno et al. 2020) with the aim of providing a general-purpose, scalable, reproducible, and easy-to-use modelling tool, allowing researchers to study biological systems on a complete and harmonious scale.

Finally, it is important to emphasize that our framework is designed to work synergistically with the COBRA Toolbox, enabling users to seamlessly switch between or simultaneously utilize mechanistic-based and constraint-based models as needed. This interoperability ensures that users can effectively integrate existing COBRA models and tools with the models defined in ‘UnifiedGreatMod’, creating a cohesive and powerful modelling environment.

2 Methods

The flow of the paradigm involves the integration of heterogeneous data, considering different biological domains of a cell (such as genomics, transcriptomics, and metabolics data) or as distinct entities interacting with each other, including microbes interacting with host cells, facilitating their interchange within a holistic view of biological systems.

The harmonization, a central aspect of our paradigm, facilitates a dialogue among biological data and merges the computational aspects of the paradigm. Specifically, how the combination of mechanism and constraint-based models simulates the entire system to yield temporal evolution of key biological entities. This simulation enables the derivation of new insights into the systems under study.

‘UnifiedGreatMod’ aims to harmonize two different solution techniques: ODEs and FBA. This is achieved through a graphical formalism, which allows the modeller to state (i) the dynamic model represented by an ODEs system, (ii) the metabolic model studied by the FBA, and (iii) the coupling between the dynamic and metabolic models (see Fig. 1).

A sketch of the proposed modelling paradigm based on the graphical Petri Net formalism (i.e. ESPN) for stating the harmonization between the mechanistic-based and the constrained-based models. The red colour is associated with the components of the ODEs derived from the ESPN, while the green one with the fluxes obtained from the FBA.

2.1 Overview of the coupling definition

The innovative contribution of ‘UnifiedGreatMod’ regards the harmonization between the dynamic and metabolic models. Intuitively, the key to this harmonization is the identification of ‘metabolites’ and ‘reactions’ to be coordinated, assuming (i) the existence of a set of metabolites shared between the two models, representing the resources that might be produced or consumed by both models, and (ii) the existence of boundary reactions that allow the sharing of the respective metabolite between the models. The following describes the idea behind the harmonization step, while all the theoretical details are provided in the Supplementary Material S1.1.

As shown in the left-upper corner of Fig. 1, the ‘UnifiedGreatMod’ workflow can be summarized in the following three phases:

definition of the mechanistic-based model and identification of constraint-based model;identification of the reactions encoded in the constraint-based model (i.e. metabolic network) to exploit the harmonization by coordinating the two modelling techniques;simulation the unified system.

Notably, the ‘UnifiedGreatMod’ workflow allows for a flexible architecture that enables users to develop custom pre-processing steps tailored to their specific research needs, provided they adhere to the paradigm established mathematical formalism. The definition of coupling encompasses two steps that enable the complete integration of omics data: the definition of the mechanistic-based model and the identification of the constraint-based model. At these levels, pre-processing modules defined by users can operate within the formal constraints of the FBA and Petri net formalisms, ensuring mathematical consistency while maintaining biological relevance.

2.1.1 Phase 1—Step 1: definition of the mechanistic model

In this first step of the first phase, the user initially draws the mechanistic-based model using the graphical Petri Net (PN) formalism.

In detail, PNs (Marsan et al. 1995) and its generalizations, such as the Extended Stochastic PNs (ESPNs) (Pernice et al. 2020a), are a high-level mathematical formalism that exploits graphical elements to represent system components and their interactions. ESPN formalism has been extensively used to study biological systems (Castagno et al. 2020, Pernice et al. 2020b, Pernice et al. 2023), thanks to its ability to represent reaction systems in a natural graphical manner and provide qualitative and quantitative information on the system’s behaviour under study.

The basic components are (i) ‘places’, corresponding to state variables of the system (graphically represented as circles) and (ii) ‘transitions’, denoting events or activities (graphically represented as boxes). Places can contain ‘tokens’ drawn as black dots (namely marking of the place), representing the modelled entities of the system. Then, the number of tokens in each place defines the state of a PN, called ‘marking’. Places and transitions are connected by ‘arcs’, which express the relation between states and event occurrences. Moreover, by definition, ESPN can successfully integrate complex velocities functions in a unique model, and combine metabolic and regulatory networks (see section 2.1.4).

An example of ESPN is depicted in the first-top layer of Fig. 1, which is characterized by four places (named ‘P1’, ‘P2’, ‘P3’, and ‘P4’) and seven transitions (named ‘T1’, ‘T2’, ‘ $[eqn]$ ’, ‘ $[eqn]$ ’, ‘ $[eqn]$ ’, ‘ $[eqn]$ ’, and ‘ $[eqn]$ ’). For instance, the ‘T1’ transition is connected to two input places ‘P1’ and ‘P4’ and two output places (‘P2’ and ‘P3’). Thus, it can change the marking of the four places through its ‘firing’, i.e. by removing tokens from ‘P1’ and ‘P4’, and adding tokens to ‘P2’ and ‘P3’ with a firing intensity defined by the Mass Action law: $[eqn]$ . Where $[eqn]$ is the kinetic constant of the transition and $[eqn]$ are the number of tokens at time $[eqn]$ in the places $[eqn]$ and $[eqn]$ , respectively.

As shown in Pernice et al. (2020a), starting from an ESPN model, it is possible to derive the system of ODEs describing how the number of tokens changes over time in each place, as graphically represented by the red dashed arrow in Fig. 1. For instance, if we consider the first ODE in the rectangle pointed by the red dashed arrow, its solution describes the evolution in the number of tokens in the place ‘P1’ over time.

Since ‘P1’ has only one transition (‘T1’) that can alter its token count, specifically by removing tokens with an intensity defined as described in the previous paragraph, the changes in token dynamics can be explicitly modeled through this equation.

The same for the remaining three places.

For the model construction and automatic derivation of the low-level mathematical processes characterizing the system dynamics, we exploited and appropriately extended ‘GreatMod’, a general modelling framework, to simulate complex biological systems using an intuitive graphical interface. ‘GreatMod’ is composed of three main modules: (i) a Java GUI based on Java Swing Class, called GreatSPN (Amparore et al. 2016), which allows the user to draw models using PN formalism and its extensions, (ii) the R library epimod (Castagno et al. 2020), and (iii) Docker containerization, a lightweight OS-level virtualization. Further details are reported in the Supplementary Material S1.1.4.

Regarding the parametrization of the mechanistic model, various omics data (e.g. transcriptomics, proteomics) can be integrated to help define reaction activities, map cellular processes under specific conditions, and adjust reaction constraints appropriately. Specifically, transcriptomics data can provide information on the initial values of biological entities or rate constants in the mechanistic model. For instance, gene-protein-reaction (GPR) associations can translate transcriptomics or proteomics data into reaction activities within the network (Di Filippo et al. 2021, Zampieri et al. 2023). Furthermore, omics data can suggest including specific pathways or biological relationships, ensuring the model captures the full range of dynamics in the system under study.

2.1.2 Phase 1—Step 2: identification of the metabolic network

For metabolic network identification, the corresponding model can be encoded in MAT-file (‘.mat’) format as defined by the COBRA Toolbox (Heirendt et al. 2017). Then, the constrained-based modelling approach, called FBA, can be used to study metabolic networks by analysing the flow of metabolites.

A linear programming solver is thus used to find the flux distribution $[eqn]$ of all the $[eqn]$ reactions that maximize or minimize the objective function $[eqn]$ while satisfying two main constraints: (i) the steady-state assumption $[eqn]$ , and (ii) each flux must take value between $[eqn]$ and $[eqn]$ , which are the lower and upper constraints of the $[eqn]$ -th flux. Let us denote $[eqn]$ , the set of all the fluxes $[eqn]$ .

To facilitate the integration of a constrained-based model into our framework, we developed the ‘epimod_FBAfunctions’ R package. This package offers essential functions to read and modify MAT files, translating metabolic network representations into the ‘GreatMod’ internal format. In particular, it is possible to integrate gene expression data and dietary inputs to set the flux constraints properly or define substrate availability (Cooke et al. 2024). Additionally, metabolomics data can play a complementary role by directly influencing reaction bounds, substrate availability, and the initial concentrations of metabolites. We have implemented a strategy to integrate metabolomics and gene expression data. Our computational pipeline can (i) compute differential reaction activities from transcriptomics to predict whether the differential expression of metabolic enzymes directly causes differences in metabolic fluxes and (ii) constrain boundary reactions based on the concentration of metabolites resulting from a given dietary input.

The critical aspect of integrating transcriptomics data lies in selecting the integration method. The package ‘epimod_FBAfunctions’ implements the integration process presented in Kashaf et al. (2017). This method evaluates the GPR rules associated with each reaction to derive reaction expression values from gene expression values.

2.1.3 Phase 2—Step 1: identification of the reactions

The first step of the second phase aims to identify a set of boundary reactions, also known as exchange or transport reactions, $[eqn]$ , so each corresponding flux can be associated with a velocity in the ODEs system, representing the exchange of such metabolites. These reactions define the input and output of metabolites to and from the metabolic network in FBA. These reactions are crucial for accurately modelling the system because they set limits on the availability and removal of metabolites.

First, we need to filter the set of all the fluxes of the FBA model to consider reactions that can be used to define $[eqn]$ . Specifically, we strongly suggest using only the boundary reactions for two reasons: (i) to maintain consistency with the fundamentals of our paradigm and (ii) not to violate the balance assumption of the FBA model. Our new paradigm has to connect two modelling approaches with their assumptions and rules, simulating two different system layers that share a given number of resources (e.g. metabolites). In particular, the FBA model is constrained by the system’s steady state, which supposes no changes in any metabolite concentration. Therefore, by defining a single ODE that models the concentration over time of a metabolite also described in the FBA model, it is straightforward that the steady-state assumption of the FBA model does not hold anymore. Differently, the boundary reactions represent by definition the connection of the metabolic model with an extracellular environment, delineating in such a manner the perfect bridge between an ODE representing the shared metabolite outside the metabolic model, which acts as its resource by modifying dynamically the reaction constraints, respectively.

Among the boundary reactions, the ones with a greater sensibility to perturbations in their constraints play a central role in connecting the two modelling approaches. Given that the system of ODEs is connected to the FBA model in terms of the fluxes constraints, it is straightforward to consider reactions where small perturbations in the concentration of the shared metabolite affect the FBA solution. Otherwise, the two approaches could be solved independently without dynamic communication, which might improve and change the analysis. Therefore, we suggest focusing the modelling efforts on high-sensitive reaction fluxes to minimize the poor call of such boundaries and reduce the likelihood of errors. We focus on fluxes for which a small perturbation entails a substantial variation of the FBA model outcomes (e.g. the objective function value). Specifically, by exploiting Sobol’s variance-based Sensitivity Analysis (SA) (Saltelli et al. 2010), a powerful and widely used method to determine the contribution of input variables to the output variability of a mathematical model, it is possible to detect automatically critical fluxes in genome-scale metabolic, which is particularly useful for understanding how changes in the fluxes constraints influence the objective function. In this context, we developed in ‘epimod_FBAfunctions’ package functionality to perform the SA considering a constrained-based model.

Further details regarding the definition of $[eqn]$ and the definition of the SA approach are reported in the Supplementary Material S1.3 and S1.1.3, respectively.

2.1.4 Phase 2—Step 2: harmonization

Once $[eqn]$ is defined, the next step is defining how to coordinate the two modelling approaches.

The general idea is that the concentrations obtained from solving the ODEs could be used as flux constraints because they represent the availability of the metabolites in the environment and, therefore, the uptake limit. Usually, the upper constraint is set only if a growth limit has to be defined. Otherwise, it should be infinity (see the Supplementary Material S1.4).

Thus, given an ESPN model, we consider $[eqn]$ the set of all transitions (called general transitions) whose firing intensities are defined as continuous real functions, and not through the Mass Action law. Let us refine $[eqn]$ into two disjoint subsets, $[eqn]$ and $[eqn]$ , so that (i) $[eqn]$ and (ii) $[eqn]$ if only if its intensity is obtained by solving the constrained-based model.

Let us observe that, by definition, the transition intensity has to be a positive number, therefore if reversible reactions are considered in $[eqn]$ , then they are decoupled into irreversible reaction pairs, defining two different transitions, denoted with $[eqn]$ if the estimated flux is negative and $[eqn]$ if it is positive. Thus, we can infer that

[eqn]

where, $[eqn]$ is the firing intensity associated with the general transition $[eqn]$ depending on the marking of its inputs places $[eqn]$ at time $[eqn]$ , $[eqn]$ the general transition representing the flux $[eqn]$ , and $[eqn]$ is the indicator function that returns 1 only if the input condition is satisfied.

As depicted with the green colour in Fig. 1, $[eqn]$ is defined by three fluxes $[eqn]$ associated, respectively, with the boundary reactions $[eqn]$ . Let us observe that these three reactions were selected since they represent the exchange of the metabolites represented by the places $[eqn]$ , $[eqn]$ , and $[eqn]$ , respectively. Therefore, for each reaction, one or two general transitions are defined depending on whether it is a reversible reaction or not. For instance, the reaction $[eqn]$ is associated with two general transitions $[eqn]$ and $[eqn]$ , whose intensity will depend on the sign of $[eqn]$ obtained by solving the FBA.

Let us observe that the constraints of the shared fluxes are defined as functions $[eqn]$ and $[eqn]$ (lower and upper bounds, respectively) that depend on the solution of the system of ODEs thought $[eqn]$ , i.e. the input places of the transitions in $[eqn]$ . For instance, they might be defined as a linear transformation of the respective exchanged metabolites concentration (i.e. $[eqn]$ ) when $[eqn]$ .

2.1.5 Phase 3: simulation

Finally, the simulation technique integrates the solution of the system of ODEs, which is based on the Backward Differentiation Formula method (Burden et al. 2016), with the resolution of the Linear Programming Problem (The Backward Differentiation Formula method is implemented using the C++ LSODA library (https://en.smath.com/view/lsoda), while the optimization problem using the glpk library (https://www.gnu.org/software/glpk/).

Specifically, this approach relies on a threshold criterion that compares the elements of the ODEs affecting the FBA (i.e. some lower and upper bounds might change) at the two consecutive solution times. Consequently, the FBA is performed with the new constraints solely when notable alterations occur enabling the ODEs to be adjusted to reflect substantial changes in the FBA model dictated by flux constraints and input specifications. This improves the model’s efficiency by avoiding superfluous computations. Therefore, the solution of the model emerges from two closely linked resolutions of the ODEs and FBA (we refer for formal details to the Supplementary Material S1.5 and to Supplementary Material S1.4 for the performance analysis).

An example of the final system of ODEs is depicted in Fig. 1, in which both the components of the mechanistic-based (in red) and constrained-based (in green) models are present.

3 Results

The use of deep sequencing technology to analyse microbiome profiles across different tissues has prompted the microbiology community to expand their research objectives to include the functional aspects of microorganisms integrating heterogeneous data. For this reason, we propose two applications of ‘UnifiedGreatMod’ in this context.

The first application is related to the carbon catabolite repression phenomenon exhibited by E. coli (Ammar et al. 2018), with all details provided in the Supplementary Materials.

The second application, described hereafter, investigates the interaction between Clostridioides difficile and intestinal epithelial cells (IECs).

Definition of mechanistic and constraint-based models

Clostridioides difficile is a gram-positive bacteria that causes infections typically affecting older adults in hospitals or long-term care. Symptoms of the clinical disease can range from diarrhoea to life-threatening damage to the colon. During infection, C. difficile colonizes the large intestine and causes toxin-mediated disruption of the intestinal epithelial barrier function, which results in increased intestinal permeability and triggers haemorrhage (Kouhsari et al. 2018). Recently, evidence has shown that chronic C. difficile infection contributes to colorectal tumorigenesis (Drewes et al. 2022). To model the interaction between C. difficile and IECs during infection, six interacting biological domains have been considered: (i) C. difficile ‘metabolic network’ incorporates all metabolic reactions of the C. difficile; (ii) C. difficile ‘cell dynamics’ models the proliferation response of a bacteria population upon treatment with antimicrobial therapy; (iii) ‘metronidazole action’ describes the mechanism of action of a drug driving an imbalance between oxidative and antioxidative processes, inducing oxidative stress and cell death; (iv) ‘intestinal lumen’ models the toxin-mediated inflammation and the dietary intake of nutrients; (v) ‘intestinal epithelial cells’ describes the dynamics of the IEC and the absorption of nutrients, and (vi) ‘blood vessels’ describes the transepithelial amino acids transported into the circulation following protein digestion and absorption. The detailed biological description of all modules is provided in Supplementary Material S3.1. The model’s parameterization is primarily based on values extracted from the literature, except for ‘Death4Treat’, ‘Detox’, and ‘IECsDeath’. These parameters are explored using Partial Rank Correlation Coefficients (Fornari et al. 2015) to observe changes in the overall model dynamics (see Supplementary Fig. S5). Further details are available in Supplementary Material S1.2.1. Additionally, Supplementary Material S2 provides a detailed example of how parameters can be derived from transcriptomic data.

Model component identification and harmonization

Harmonizing the low-level ODEs and the low-level metabolic model (exchanges between the external environment and the metabolic model) involves identifying the critical fluxes through Sobol’s variance-based SA (Saltelli et al. 2010). The SA method indicates which transitions in our model have the highest influence on the output.

The process of selection relies on the identification of the metabolic model’s most sensitive exchange reactions (see the Supplementary Material S1.1.3). We extrapolated sensitivity coefficients for exchange reactions and concentrated on the most sensitive reactions. This approach minimizes the impact of poor boundary selection. A filtering threshold was applied, where reactions associated with sensitivity coefficients below 0.01 (this threshold was arbitrary) were considered negligible. The final selection included ‘EX_leu_L_e’ and ‘EX_trp_L_e’, which are the most sensitive reactions that are also aligned with the biological question of the model.

To better capture all dynamics biologically related to the antibiotic-resistant mechanism, (Karasawa et al. 1995, Knippel et al. 2020), we also consider the transitions ‘EX_pro_L_e’, ‘EX_ile_L_e’, ‘EX_val_L_e’, ‘EX_cys_L_e’, and ‘sink_pheme_c’. These reactions are split into ‘in’ and ‘out’ components to retain direction information.

For a detailed formal description of the harmonization process, we refer to the Supplementary Material S1.2.

Modelling antibiotic therapy across experimental settings and conditions

The coupled model runs simulations to predict the system behaviour under different experimental settings and conditions. To assess the significance of ‘UnifiedGreatMod’, we proposed three experiment settings. In the first, called ‘ablation experiment’, the dynamics of the metabolic environment are computed by FBA only at the beginning of the simulation. In the second experiment, named ‘partial-ablation experiment’, the FBA is computed when an external stimulus occurs, i.e. at any drug injection. In the last experiment, namely ‘unified experiment’, the FBA model is continuously solved for the state of the ODEs system (Fig. 2A).

Analysis of C. difficile infection dynamics across different modelling settings. (A) Schematic representation of the CDI-gut system model architecture: illustration of interactions between C. difficile, intestinal epithelial cells (IECs), and gut metabolites during antibiotic treatment. (B) Comparative analysis of model behaviour: The left panels present the median trajectories of the Ablated model compared to the Unified model, while the right panels illustrate the same variables, colour-coded by three different parameter configuration sets. These visualizations highlight distinct behavioural patterns in bacterial population dynamics, IEC survival, and metabolic adaptation. (C) CDI-gut environment system analysis across experimental settings: violin plots with overlaid trajectories showing C. difficile population, leucine and tryptophan concentrations, and haem levels, demonstrating how different modelling approaches capture system heterogeneity and metabolic-dynamic coupling.

Moreover, system dynamics are also responsive according to different systems conditions, where a condition is represented by given systems parameters configuration selected after a calibration step aimed to (i) minimize the IECs damage, (ii) keep the drug concentration closed to the minimum inhibitory concentration value, and (iii) minimize the number of C. difficile cells without complete eradication.

In the unified setting, in a condition of limited drug efficacy, the successful colonization by C. difficile leads to an increment in the number of C. difficile cells after the completion of therapy injections. Consequently, IECs die, the gut lining is damaged, and, following extravasation, red blood cells undergo lysis, which increases heame concentration in the gut micro-environment Fig. 2B (left panels). Furthermore, only the Unified experiment reveals the presence of a haeme-mediated spectrum of antibiotic-resistant strains, ranging from fully susceptible to partially susceptible and fully resistant populations. The model results illustrate how a gradient of conditions fosters diverse antibiotic resistance types (Fig. 2B).

Three representative parameter configurations were identified based on median trajectories of $[eqn]$ . These configurations (Sets 1–3; parameter values in Supplementary Table S3) demonstrate distinct infection phenotypes: Set 1 exhibits the highest bacterial growth and IEC death rate, indicating strong antibiotic resistance, Set 2 shows intermediate behaviour, while Set 3 displays controlled bacterial growth and enhanced IEC survival suggesting higher antibiotic susceptibility (Fig. 2B, right panels).

The drug therapy is administered as injections at specific time intervals (8, 16, 24, 32, 40, and 48 h). In the Unified scenario, a decrease in C. difficile count is observed following each drug injection. Meanwhile, the number of IECs declines due to the lethal effects of the infection, leading to an increase in intracellular haem concentration. A comprehensive overview of the model results across all locations is provided in Supplementary Fig. S4.

The comparison across experimental settings reveals different trajectories in the evolution of the system (Fig. 2C). For instance, in all three settings, the number of IECs changes linearly with the IECsDeath rate, representing the degree of infection virulence. However, the impact of the Unified model becomes particularly evident when analysing cell numbers: as cell death increases, the decline in IEC count is significantly more pronounced in the Unified model compared to the Ablated settings. Additionally, distinct behavioural differences emerge in each experimental setting when examining the trajectories of leucine, tryptophan, and intracellular haem (Fig. 2C).

4 Discussion

Computational modelling is becoming increasingly crucial to formulate new hypotheses and provide plausible explanations of how a real system works. In-silico models may help uncover how system components interact with each other and how they are related to wider biological effects. However, new computational theoretical and practical achievements are needed to model how cell dynamics are intertwined with the mechanics of cell signalling, metabolism, and the influence of extracellular factors.

Here, we propose ‘UnifiedGreatMod’ based on open-source and general framework to integrate formal concepts and different solution techniques and offer the possibility of relating multi-level dynamics. ‘UnifiedGreatMod’ exploits a single easy-to-use graphical meta-formalism ensuring result reproducibility. The most innovative aspect of ‘UnifiedGreatMod’ is the possibility to continuously compute the evolution of the systems considering, at the same time, all the reactions involved in the metabolism, and changes in the micro-environment. Due to its clinical importance as a nosocomial disease and an emerging tumorigenic pathogen, ‘UnifiedGreatMod’ was exploited to model C. difficile infection in the human gut as a case study.

In the gut, the growth medium composition significantly affects C. difficile’s toxin production. These toxins compromise the epithelial cell barrier, causing apoptosis in IECs. Toxin synthesis regulation is a complex bacterial response to specific nutrient availability during infection. Clostridioides difficile virulence is tightly linked to nutrient availability; when nutrients are limited, the bacteria produce toxins. These toxins release host-derived nutrients through cytotoxic activity. When IECs die, they release their intracellular contents, including nutrients such as amino acids, into the gut environment. These nutrients then become available for the bacteria.

Leucine is a key metabolite of C. difficile Stickland fermentation, i.e. it can be used to generate ATP through its paired oxidation and reduction with other amino acids. It has been shown that leucine is one of the main energy sources during bacterial exponential growth and that its pools are depleted before the onset of the stationary phase (Hofmann et al. 2018). The role of tryptophan in the growth of C. difficile is not explicitly defined in the literature. However, C. difficile cannot synthesize tryptophan, as suggested by Neumann-Schaal et al. (2019). It would need to obtain this amino acid from the environment, which could influence its growth dynamics.

The comparison among ablated, partially ablated, and unified experiments demonstrated that simulations depict the connection between nutrient availability in the gut and bacterial growth state during the disease. In all scenarios, tryptophan and leucine dynamics are associated with the model parameter representing the cytotoxic capacity of the bacterium upon IECs. This parameter reflects the degree of toxic gene regulation by nutrient-sensing transcriptional repressors, which are depressed with low levels of environmental nutrients (Theriot and Fletcher 2019).

Nevertheless, by dynamically computing the FBA throughout the entire simulation, we can uncover mechanisms of antibiotic resistance. This mechanism may promote chronic infection by C. difficile, even when metronidazole treatment is ongoing.

Antibiotic injections reduce the number of bacteria at drug administration. Ablation experiments indicate that the model shows substantial variations in haem content at comparable equivalent levels of cellular damage. In contrast, the latest experiments better simulate the system’s behaviour by representing a greater richness, showing C. difficile acquisition of haem and the ability to withstand oxidative stress are crucial factors of antibiotic resistance. Intermediate susceptibility states are visible from the C. difficile dynamics. Highly susceptible bacteria are cleared, preserving IEC barrier integrity and reducing nutrient leakage, while highly resistant bacteria kill IECs, saturating the environment with nutrients. The unified model fanned out trajectories within the intermediate antibiotic resistance category, reflecting real bacterial populations, which typically display a range of antibiotic susceptibilities rather than discrete categories. This gradient allows the model to show how intermediate types are selected and how to establish antibiotic resistance. Additionally, the two FBA integrated experiments capture the dynamic consumption of metabolites due to oscillations in the bacterial population with varying detail.

In conclusion, these results indicate that this new paradigm, harmonizing dynamic and metabolic models, can significantly enhance personalized healthcare in the context of dynamic treatment and patient monitoring. ODEs allow for the precise modelling of temporal changes in biological systems, capturing the dynamic behaviour of disease progression and treatment response. Meanwhile, FBA enables the optimization of metabolic pathways by analysing the flow of metabolites through networks, ensuring that cellular processes are efficiently managed. By integrating these two powerful approaches, healthcare providers can develop highly tailored therapeutic strategies that are continuously adjusted based on real-time patient data. In the current implementation of UnifiedGreatMod, users must define and link the FBA problem to the ODE system at the beginning of the simulation. While this setup allows users to manually modify the code to implement alternative strategies beyond the classical FBA approach, future updates could improve usability by incorporating modular new functionalities. These enhancements will enable users to choose the integration module suitable for the aim of the study. In addition, the user could also define new integration methods to combine bio-omics data without needing significant manual modifications.

Supplementary Material

btaf103_Supplementary_Data

Bibliography39

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ammar EM , Wang X, Rao CV et al Regulation of metabolism in Escherichia coli during growth on mixtures of the non-glucose sugars: arabinose, lactose, and xylose. Sci Rep 2018;8:609.29330542 10.1038/s 41598-017-18704-0PMC 5766520 · doi ↗ · pubmed ↗
2Amparore E, Balbo G, Beccuti M et al 30 years of great SPN, London: Springer Series in Realibility Engineering, 2016, 227–54.
3Ben Guebila M , Thiele I. Dynamic flux balance analysis of whole-body metabolism for type 1 diabetes. Nat Comput Sci 2021;1:348–61.38217214 10.1038/s 43588-021-00074-3 · doi ↗ · pubmed ↗
4Bordbar A , Monk JM, King ZA et al Constraint-based models predict metabolic and associated cellular functions. Nat Rev Genet 2014;15:107–20.24430943 10.1038/nrg 3643 · doi ↗ · pubmed ↗
5Burden RL , Feires JD, Burden AM. Numerical Analysis, 10th edn. Boston: Cengage Learning, 2016.
6Büchel F , Rodriguez N, Swainston N et al Path 2models: large-scale generation of computational models from biochemical pathway maps. BMC Syst Biol 2013;7:116.24180668 10.1186/1752-0509-7-116PMC 4228421 · doi ↗ · pubmed ↗
7Castagno P , Pernice S, Ghetti G et al A computational framework for modeling and studying pertussis epidemiology and vaccination. BMC Bioinformatics 2020;21:344.32938370 10.1186/s 12859-020-03648-6PMC 7492136 · doi ↗ · pubmed ↗
8Cooke J , Delmas M, Wieder C et al Genome scale metabolic network modelling for metabolic profile predictions. P Lo S Comput Biol 2024;20:e 1011381.38386685 10.1371/journal.pcbi.1011381 PMC 10914266 · doi ↗ · pubmed ↗