Leveraging Code Clones and Natural Language Processing for Log Statement Prediction
Sina Gholamian

TL;DR
This paper presents a novel approach that combines code clone detection and natural language processing to automate the prediction of log statement placement and content in software source code, improving over prior manual methods.
Contribution
It introduces a clone-based method for predicting log locations and descriptions, leveraging source code clones and NLP to enhance automation in logging practices.
Findings
Effective log location prediction using code clones
Improved log description prediction with NLP models
Outperforms prior log prediction approaches
Abstract
Software developers embed logging statements inside the source code as an imperative duty in modern software development as log files are necessary for tracking down runtime system issues and troubleshooting system management tasks. Prior research has emphasized the importance of logging statements in the operation and debugging of software systems. However, the current logging process is mostly manual and ad hoc, and thus, proper placement and content of logging statements remain as challenges. To overcome these challenges, methods that aim to automate log placement and log content, i.e., 'where, what, and how to log', are of high interest. Thus, we propose to accomplish the goal of this research, that is "to predict the log statements by utilizing source code clones and natural language processing (NLP)", as these approaches provide additional context and advantage for log prediction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Engineering Techniques and Practices
