A generalizable framework for unlocking missing reactions in genome-scale metabolic networks using deep learning
Xiaoyi Liu, Hongpeng Yang, Chengwei Ai, Ruihan Dong, Yijie Ding,, Qianqian Yuan, Jijun Tang, and Fei Guo

TL;DR
CLOSEgaps is a deep learning framework that models metabolic networks as hypergraphs to automatically identify and fill missing reactions, significantly improving the accuracy of genome-scale metabolic models without relying on experimental data.
Contribution
It introduces a novel hypergraph-based deep learning approach for automatic gap-filling in GEMs, capable of characterizing both known and hypothetical reactions.
Findings
Achieves over 96% accuracy in filling artificially introduced gaps
Improves phenotypic prediction accuracy across 24 GEMs
Enhances production of key metabolites in two organisms
Abstract
Incomplete knowledge of metabolic processes hinders the accuracy of GEnome-scale Metabolic models (GEMs), which in turn impedes advancements in systems biology and metabolic engineering. Existing gap-filling methods typically rely on phenotypic data to minimize the disparity between computational predictions and experimental results. However, there is still a lack of an automatic and precise gap-filling method for initial state GEMs before experimental data and annotated genomes become available. In this study, we introduce CLOSEgaps, a deep learning-driven tool that addresses the gap-filling issue by modeling it as a hyperedge prediction problem within GEMs. Specifically, CLOSEgaps maps metabolic networks as hypergraphs and learns their hyper-topology features to identify missing reactions and gaps by leveraging hypothetical reactions. This innovative approach allows for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicrobial Metabolic Engineering and Bioproduction · Bioinformatics and Genomic Networks · Machine Learning in Bioinformatics
