An Integrative Genome-Scale Metabolic Modeling and Machine Learning Framework for Predicting and Optimizing Single-Cell Protein Production in Saccharomyces cerevisiae
Neha K. Nair, Aaron D'Souza

TL;DR
This paper introduces a comprehensive computational framework combining genome-scale metabolic modeling, machine learning, and optimization to predict and improve single-cell protein production in Saccharomyces cerevisiae.
Contribution
It integrates multiple advanced computational techniques to accurately predict biomass flux and optimize conditions for SCP yield, demonstrating significant improvements.
Findings
Random Forest and XGBoost achieved near-perfect R2 scores (>0.9999) in flux prediction.
Bayesian optimization increased biomass flux by 12.13-fold, reaching 1.041 gDW/hr.
SHAP analysis identified key metabolic reactions critical for SCP production.
Abstract
Saccharomyces cerevisiae is increasingly recognised as a key source for single-cell protein (SCP) production, a rising solution to global protein-supply challenges. This study presents a computational framework combining the Yeast9 genome-scale metabolic model (GEM) with machine learning and optimisation to predict and enhance biomass flux for SCP yield. The Yeast9 GEM, comprising 4,131 reactions, 2,806 metabolites, and 1,161 genes, was simulated using flux balance analysis (FBA) across 2,000 Latin Hypercube-sampled flux profiles. Random Forest and XGBoost regressors achieved R2 values of 0.9999760 and 0.9997702, respectively. A variational autoencoder (VAE) identified four metabolic clusters with mean biomass fluxes of 0.472, 0.493, 0.527, and 0.505 gDW/hr. SHAP-based feature attribution identified twenty key reactions in glycolysis, the TCA cycle, and amino-acid biosynthesis; 18/20…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
