A Transformer-based Approach for Augmenting Software Engineering   Chatbots Datasets

Ahmad Abdellatif; Khaled Badran; Diego Elias Costa; Emad Shihab

arXiv:2407.11955·cs.SE·July 17, 2024

A Transformer-based Approach for Augmenting Software Engineering Chatbots Datasets

Ahmad Abdellatif, Khaled Badran, Diego Elias Costa, Emad Shihab

PDF

Open Access

TL;DR

This paper introduces a transformer-based method to automatically augment software engineering chatbot datasets, enhancing natural language understanding performance by generating diverse, semantically consistent queries.

Contribution

It presents a novel combination of NLP techniques with BART transformer for dataset augmentation in SE chatbots, improving intent classification accuracy.

Findings

01

Augmentation improves Rasa NLU performance.

02

Generated queries vary in structure but retain semantics.

03

Confidence in intent classification increases with augmentation.

Abstract

Background: The adoption of chatbots into software development tasks has become increasingly popular among practitioners, driven by the advantages of cost reduction and acceleration of the software development process. Chatbots understand users' queries through the Natural Language Understanding component (NLU). To yield reasonable performance, NLUs have to be trained with extensive, high-quality datasets, that express a multitude of ways users may interact with chatbots. However, previous studies show that creating a high-quality training dataset for software engineering chatbots is expensive in terms of both resources and time. Aims: Therefore, in this paper, we present an automated transformer-based approach to augment software engineering chatbot datasets. Method: Our approach combines traditional natural language processing techniques with the BART transformer to augment a dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions