Estimate Metabolite Taxonomy and Structure with a Fragment-Centered Database and Fragment Network
Hansen Zhao, Xu Zhao, Huan Yao, Jiaxin Feng, Sichun Zhang, Xinrong, Zhang

TL;DR
This paper introduces MSFragDB, a fragment-centered database, and a fragment network approach to improve metabolite structure identification in mass spectrometry, addressing coverage gaps and spectrum variability.
Contribution
The work presents a novel fragment-based database and network model that enhance metabolite identification accuracy and taxonomy estimation, even with incomplete or contaminated spectra.
Findings
MSFragDB has higher hit ratio than existing databases.
The fragment network effectively estimates chemical structures.
The approach improves identification despite spectrum contamination.
Abstract
Metabolite structure identification has become the major bottleneck of the mass spectrometry based metabolomics research. Till now, number of mass spectra databases and search algorithms have been developed to address this issue. However, two critical problems still exist: the low chemical component record coverage in databases and significant MS/MS spectra variations related to experiment equipment and parameter settings. In this work, we considered the molecule fragment as basic building blocks of the metabolic components which had relatively consistent signatures in MS/MS spectra. And from a bottom-up point of view, we built a fragment centered database, MSFragDB, by reorganizing the data from the Human Metabolome Database (HMDB) and developed an intensity-free searching algorithm to search and rank the most relative metabolite according to the users' input. We also proposed the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Advanced Proteomics Techniques and Applications · Microbial Metabolic Engineering and Bioproduction
