PDLogger: Automated Logging Framework for Practical Software Development
Shengcheng Duan, Yihua Xu, Sheng Zhang, Shen Wang, Yue Duan

TL;DR
PDLogger is an innovative end-to-end framework that leverages large language models and program analysis techniques to generate comprehensive, high-quality logs in practical software development scenarios, significantly outperforming prior methods.
Contribution
It introduces the first complete multi-log generation approach combining LLMs with program slicing and variable extraction, addressing limitations of existing isolated log prediction techniques.
Findings
Significantly improves log-position precision by 139%.
Enhances log level accuracy by 82.3%.
Achieves robust performance across different LLMs.
Abstract
Logging is indispensable for maintaining the reliability and diagnosability of modern software, yet developers still struggle to decide where and how to log effectively. Existing automated logging techniques focus on isolated sub-tasks - predicting a single log position, level, or message - and therefore cannot produce complete, high-quality log statements that reflect real-world practice in which multiple logs often appear inside one method. They also neglect deeper semantic dependencies among methods and consider only a narrow set of candidate variables, leading to superficial or incomplete logs. In this paper, we present PDLogger, the first end-to-end log generation technique expressly designed for practical, multi-log scenarios. PDLogger operates in three phases. (1) Log position prediction: block-type-aware structured prompts guide a large language model (LLM) to suggest candidate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
