Empirical Characterization of Logging Smells in Machine Learning Code
Patrick Loic Foalem, Leuson Da Silva, Foutse Khomh, Ettore Merlo, Heng Li

TL;DR
This paper empirically investigates common bad logging practices, known as logging smells, in machine learning systems by analyzing open-source repositories and surveying ML engineers to improve logging effectiveness.
Contribution
It systematically characterizes logging smells in ML code through large-scale repository mining and practitioner surveys, providing new insights into logging practices in ML development.
Findings
Identification of recurring logging smells in ML systems
Assessment of perceived severity and relevance of these smells by practitioners
Empirical data on logging practices in open-source ML projects
Abstract
\underline{Context:} Logging is a fundamental yet complex practice in software engineering, essential for monitoring, debugging, and auditing software systems. With the increasing integration of machine learning (ML) components into software systems, effective logging has become critical to ensure reproducibility, traceability, and observability throughout model training and deployment. Although various general-purpose and ML-specific logging frameworks exist, little is known about how these tools are actually used in practice or whether ML practitioners adopt consistent and effective logging strategies. To date, no empirical study has systematically characterized recurring bad logging practices--or logging smells--in ML System. \underline{Goal:} This study aims to empirically identify and characterize logging smells in ML systems, providing an evidence-based understanding of how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Engineering Research · Scientific Computing and Data Management
