A Major Update and Improved Validation Functionality in the mwtab Python Library and the Metabolomics Workbench File Status Website
P. Travis Thompson, Hunter N. B. Moseley

TL;DR
This paper describes major updates to the mwtab Python library and the Metabolomics Workbench website to improve validation and data curation for metabolomics datasets.
Contribution
The paper introduces enhanced validation features and improved error handling in the mwtab library and its associated website.
Findings
The mwtab package now supports better error handling and parsing for batch processing.
All datasets in the Metabolomics Workbench were evaluated using the improved validation features.
The mwFileStatusWebsite was updated to align with the new mwtab package features.
Abstract
Background: The Metabolomics Workbench (MW) is a public scientific data repository consisting of experimental data and metadata from metabolomics studies collected with mass spectroscopy (MS) and nuclear magnetic resonance (NMR) analyses. Although not as rapidly as in the past, MW has steadily evolved, updating its mwTab and JSON deposition text file formats and its web-based infrastructure. However, the growth of MW has been exponential since its inception in 2013 and continues to be exponential, with the number of datasets hosted on the repository increasing by 50% since April 2024. As part of regular maintenance to keep up with changes to the mwTab file format and an earnest effort to use MW datasets in meta-analyses, the mwtab Python package has been updated. Methods: Updates include better error handling for batch processing, better parsing to read more files without error, and…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Cell Image Analysis Techniques · Advanced Proteomics Techniques and Applications
