Profitable Trade-Off Between Memory and Performance In Multi-Domain Chatbot Architectures
D. Emre Ta\c{s}ar, \c{S}\"ukr\"u Ozan, M. Fatih Akca, O\u{g}uzhan, \"Olmez, Semih G\"ul\"um, Se\c{c}ilay Kutal, Ceren Belhan

TL;DR
This paper explores a trade-off between memory usage and performance in multi-domain chatbot architectures by comparing a single, multi-task BERT model with multiple specialized models, aiming to optimize server load and classification accuracy.
Contribution
It introduces a masking method for multi-task BERT classification, evaluating its effectiveness against specialized models across diverse datasets in a chatbot context.
Findings
Single BERT model with masking performs comparably to multiple models.
Multi-task BERT reduces server load and memory usage.
Performance varies depending on dataset complexity.
Abstract
Text classification problem is a very broad field of study in the field of natural language processing. In short, the text classification problem is to determine which of the previously determined classes the given text belongs to. Successful studies have been carried out in this field in the past studies. In the study, Bidirectional Encoder Representations for Transformers (BERT), which is a frequently preferred method for solving the classification problem in the field of natural language processing, is used. By solving classification problems through a single model to be used in a chatbot architecture, it is aimed to alleviate the load on the server that will be created by more than one model used for solving more than one classification problem. At this point, with the masking method applied during the estimation of a single BERT model, which was created for classification in more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Spam and Phishing Detection · Sentiment Analysis and Opinion Mining
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Attention Dropout · WordPiece · Dropout · Weight Decay · Residual Connection · Dense Connections
