Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. Bird, Anik\'o Ek\'art, Diego R. Faria

TL;DR
This paper introduces the CI-AI framework that enhances chatbot task classification by augmenting human data with T5 paraphrasing and ensemble transformer models, achieving near-perfect accuracy.
Contribution
It presents a novel data augmentation approach using T5 and demonstrates improved classification performance with transformer ensembles for chatbot interactions.
Findings
Data augmentation with T5 improves accuracy by 4.01%.
RoBERTa with T5 data achieves 98.96% accuracy.
Ensemble of top models reaches 99.59% accuracy.
Abstract
In this work, we present the Chatbot Interaction with Artificial Intelligence (CI-AI) framework as an approach to the training of deep learning chatbots for task classification. The intelligent system augments human-sourced data via artificial paraphrasing in order to generate a large set of training data for further classical, attention, and language transformation-based learning approaches for Natural Language Processing. Human beings are asked to paraphrase commands and questions for task identification for further execution of a machine. The commands and questions are split into training and validation sets. A total of 483 responses were recorded. Secondly, the training set is paraphrased by the T5 model in order to augment it with further data. Seven state-of-the-art transformer-based text classification algorithms (BERT, DistilBERT, RoBERTa, DistilRoBERTa, XLM, XLM-RoBERTa, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Linear Warmup With Linear Decay · Weight Decay · Residual Connection · Gated Linear Unit · Attention Is All You Need · Inverse Square Root Schedule · WordPiece · BERT · Adam
