A Transfer Learning Approach for Dialogue Act Classification of GitHub Issue Comments
Ayesha Enayet, Gita Sukthankar

TL;DR
This paper introduces a transfer learning method to classify dialogue acts in GitHub issue comments, leveraging existing dialogue datasets to analyze team interactions in open source projects.
Contribution
It proposes a transfer learning framework for dialogue act classification on GitHub comments, comparing multiple encoding models to address the lack of labeled data.
Findings
BERT outperforms other models in accuracy
Transfer learning improves classification performance
Effective mapping of comments to dialogue acts achieved
Abstract
Social coding platforms, such as GitHub, serve as laboratories for studying collaborative problem solving in open source software development; a key feature is their ability to support issue reporting which is used by teams to discuss tasks and ideas. Analyzing the dialogue between team members, as expressed in issue comments, can yield important insights about the performance of virtual teams. This paper presents a transfer learning approach for performing dialogue act classification on issue comments. Since no large labeled corpus of GitHub issue comments exists, employing transfer learning enables us to leverage standard dialogue act datasets in combination with our own GitHub comment dataset. We compare the performance of several word and sentence level encoding models including Global Vectors for Word Representations (GloVe), Universal Sentence Encoder (USE), and Bidirectional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Software Engineering Research · Expert finding and Q&A systems
