An Automatic Pipeline for the Integration of Python-Based Tools into the Galaxy Platform: Application to the anvi'o Framework
Fabio Cumbo, Jayadev Joshi, Daniel Blankenberg

TL;DR
This paper introduces an automated method to convert Python command-line tools into Galaxy-compatible wrappers by parsing argparse interfaces, demonstrated on the anvi'o bioinformatics platform, significantly streamlining tool integration.
Contribution
The authors developed a novel, automated approach that generates Galaxy tool wrappers directly from Python argparse interfaces, reducing manual effort and errors in tool integration.
Findings
Successfully applied to the anvi'o platform with hundreds of tools
Generated functional Galaxy wrappers automatically from argparse definitions
Reduced manual effort and increased scalability of tool integration
Abstract
The integration of command-line tools into the Galaxy platform is crucial for making complex computational methods accessible to a broader audience and ensuring reproducible research. However, the manual development of tool wrappers is a time-consuming, error-prone, and knowledge-intensive process. This bottleneck significantly affects the rapid deployment of new and updated tools, creating a gap between tool development and its availability to the scientific community. We have developed a novel, automated approach that directly translates Python tool interfaces into Galaxy-compliant tool wrappers. Our method leverages the argparse library, a standard for command-line argument parsing in Python. By embedding structured metadata within the metavar attribute of input and output arguments, our system programmatically parses the tool's interface to extract all necessary information. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Genomics and Phylogenetic Studies · Biomedical Text Mining and Ontologies
