Transfer Learning for Mining Feature Requests and Bug Reports from   Tweets and App Store Reviews

Pablo Restrepo Henao; Jannik Fischbach; Dominik Spies; Julian; Frattini; and Andreas Vogelsang

arXiv:2108.00663·cs.CL·August 3, 2021

Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews

Pablo Restrepo Henao, Jannik Fischbach, Dominik Spies, Julian, Frattini, and Andreas Vogelsang

PDF

1 Repo

TL;DR

This paper explores the use of transfer learning, particularly BERT models, to improve the classification of feature requests and bug reports from noisy, multilingual user comments on social media and app stores, showing monolingual BERT models outperform traditional methods.

Contribution

It demonstrates the effectiveness of monolingual BERT models over existing approaches for classifying user comments in English and Italian, highlighting the limitations of multilingual models.

Findings

01

Monolingual BERT models outperform traditional ML methods.

02

Multilingual BERT models perform worse than traditional methods.

03

Heavyweight transfer learning models do not always improve performance.

Abstract

Identifying feature requests and bug reports in user comments holds great potential for development teams. However, automated mining of RE-related information from social media and app stores is challenging since (1) about 70% of user comments contain noisy, irrelevant information, (2) the amount of user comments grows daily making manual analysis unfeasible, and (3) user comments are written in different languages. Existing approaches build on traditional machine learning (ML) and deep learning (DL), but fail to detect feature requests and bug reports with high Recall and acceptable Precision which is necessary for this task. In this paper, we investigate the potential of transfer learning (TL) for the classification of user comments. Specifically, we train both monolingual and multilingual BERT models and compare the performance with state-of-the-art methods. We found that monolingual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

prestrepoh/Topic-Mining-From-Tweets-and-App-Reviews
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Multi-Head Attention · Layer Normalization · Dense Connections · Linear Warmup With Linear Decay · Residual Connection · Softmax