Privacy-Preserving Methods for Bug Severity Prediction
Havvanur Dervi\c{s}o\u{g}lu, Ru\c{s}en Halepmollas{\i}, Elif Eyvaz

TL;DR
This paper explores privacy-preserving techniques like federated learning and synthetic data generation for bug severity prediction in software engineering, demonstrating comparable performance to traditional methods without data sharing.
Contribution
It introduces and evaluates privacy-preserving approaches for bug severity prediction using source code metrics and Large Language Models, addressing data sharing constraints.
Findings
Federated learning and synthetic data achieve similar accuracy to centralized models.
Privacy-preserving methods enable effective bug severity prediction without sharing raw data.
Experimental results on two datasets validate the approaches' effectiveness.
Abstract
Bug severity prediction is a critical task in software engineering as it enables more efficient resource allocation and prioritization in software maintenance. While AI-based analyses and models significantly require access to extensive datasets, industrial applications face challenges due to data-sharing constraints and the limited availability of labeled data. In this study, we investigate method-level bug severity prediction using source code metrics and Large Language Models (LLMs) with two widely used datasets. We compare the performance of models trained using centralized learning, federated learning, and synthetic data generation. Our experimental results, obtained using two widely recognized software defect datasets, indicate that models trained with federated learning and synthetic data achieve comparable results to centrally trained models without data sharing. Our finding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software Reliability and Analysis Research
