Automated Cache for Container Executables
Vanessa Sochat, Matthieu Muffato, Audrey Stott, Marco De La Pierre,, and Georgia Stuart

TL;DR
This paper introduces automated methods for discovering and deriving important executables within containers, enhancing container registry management and user experience in high-performance computing environments.
Contribution
It presents new automation techniques for identifying key executables in containers and expands the container set by over 8,000 entries for improved management.
Findings
Added over 8,000 containers from BioContainers
Automated discovery of important executables in containers
Software is publicly available on GitHub
Abstract
Linux container technologies such as Docker and Singularity offer encapsulated environments for easy execution of software. In high performance computing, this is especially important for evolving and complex software stacks with conflicting dependencies that must co-exist. Singularity Registry HPC ("shpc") was created as an effort to install containers in this environment as modules, seamlessly allowing for typically hidden executables inside containers to be presented to the user as commands, and as such significantly simplifying the user experience. A remaining challenge, however, is deriving the list of important executables in the container. In this work, we present new automation and methods that allow for not only discovering new containers in large community sets, but also deriving container entries with important executables. With this work we have added over 8,000 containers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
