TL;DR
This paper explores the use of transfer learning, particularly BERT models, to improve the classification of feature requests and bug reports from noisy, multilingual user comments on social media and app stores, showing monolingual BERT models outperform traditional methods.
Contribution
It demonstrates the effectiveness of monolingual BERT models over existing approaches for classifying user comments in English and Italian, highlighting the limitations of multilingual models.
Findings
Monolingual BERT models outperform traditional ML methods.
Multilingual BERT models perform worse than traditional methods.
Heavyweight transfer learning models do not always improve performance.
Abstract
Identifying feature requests and bug reports in user comments holds great potential for development teams. However, automated mining of RE-related information from social media and app stores is challenging since (1) about 70% of user comments contain noisy, irrelevant information, (2) the amount of user comments grows daily making manual analysis unfeasible, and (3) user comments are written in different languages. Existing approaches build on traditional machine learning (ML) and deep learning (DL), but fail to detect feature requests and bug reports with high Recall and acceptable Precision which is necessary for this task. In this paper, we investigate the potential of transfer learning (TL) for the classification of user comments. Specifically, we train both monolingual and multilingual BERT models and compare the performance with state-of-the-art methods. We found that monolingual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Multi-Head Attention · Layer Normalization · Dense Connections · Linear Warmup With Linear Decay · Residual Connection · Softmax
