Effective Removal of Operational Log Messages: an Application to Model Inference
Donghwan Shin, Domenico Bianculli, Lionel Briand

TL;DR
This paper introduces LogCleaner, a technique that effectively removes operational log messages to improve the accuracy and efficiency of system model inference, demonstrated on multiple datasets.
Contribution
LogCleaner is a novel method combining periodicity and dependency analysis to accurately filter operational messages from logs, enhancing model inference quality.
Findings
Removes 98% of operational messages on average
Preserves 81% of transactional messages
Speeds up model inference and improves accuracy significantly
Abstract
Model inference aims to extract accurate models from the execution logs of software systems. However, in reality, logs may contain some "noise" that could deteriorate the performance of model inference. One form of noise can commonly be found in system logs that contain not only transactional messages---logging the functional behavior of the system---but also operational messages---recording the operational state of the system (e.g., a periodic heartbeat to keep track of the memory usage). In low-quality logs, transactional and operational messages are randomly interleaved, leading to the erroneous inclusion of operational behaviors into a system model, that ideally should only reflect the functional behavior of the system. It is therefore important to remove operational messages in the logs before inferring models. In this paper, we propose LogCleaner, a novel technique for removing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Reliability and Analysis Research · Software Engineering Research
