Asset Management in Machine Learning: A Survey
Samuel Idowu, Daniel Str\"uber, and Thorsten Berger

TL;DR
This survey reviews 17 tools for managing ML assets, highlighting the reliance on traditional version control and the need for more granular asset management features in ML development.
Contribution
It provides a systematic overview of current ML asset management tools, identifying gaps and emphasizing the importance of specialized support for ML assets.
Findings
Most tools rely on traditional version control systems.
Few tools differentiate between key ML assets like datasets and models.
The survey highlights the need for more granular asset management features.
Abstract
Machine Learning (ML) techniques are becoming essential components of many software systems today, causing an increasing need to adapt traditional software engineering practices and tools to the development of ML-based software systems. This need is especially pronounced due to the challenges associated with the large-scale development and deployment of ML systems. Among the most commonly reported challenges during the development, production, and operation of ML-based systems are experiment management, dependency management, monitoring, and logging of ML assets. In recent years, we have seen several efforts to address these challenges as witnessed by an increasing number of tools for tracking and managing ML experiments and their assets. To facilitate research and practice on engineering intelligent systems, it is essential to understand the nature of the current tool support for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
