What to Prioritize? Natural Language Processing for the Development of a Modern Bug Tracking Solution in Hardware Development
Thi Thu Hang Do, Markus Dobler, Niklas K\"uhl

TL;DR
This paper explores machine learning techniques, especially text embeddings and classifiers, to predict bug report resolution time, risk, and complexity in hardware development, aiming to improve bug prioritization and management.
Contribution
It introduces a novel approach combining text embeddings and machine learning models to predict bug report attributes, outperforming existing methods.
Findings
Universal Sentence Encoder with MLP yields best performance
Text embeddings significantly improve prediction accuracy
Active learning impacts model effectiveness
Abstract
Managing large numbers of incoming bug reports and finding the most critical issues in hardware development is time consuming, but crucial in order to reduce development costs. In this paper, we present an approach to predict the time to fix, the risk and the complexity of debugging and resolution of a bug report using different supervised machine learning algorithms, namely Random Forest, Naive Bayes, SVM, MLP and XGBoost. Further, we investigate the effect of the application of active learning and we evaluate the impact of different text representation techniques, namely TF-IDF, Word2Vec, Universal Sentence Encoder and XLNet on the model's performance. The evaluation shows that a combination of text embeddings generated through the Universal Sentence Encoder and MLP as classifier outperforms all other methods, and is well suited to predict the risk and complexity of bug tickets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software Engineering Techniques and Practices
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Residual Connection · Softmax · SentencePiece · Dropout · Layer Normalization
