Profitable Trade-Off Between Memory and Performance In Multi-Domain   Chatbot Architectures

D. Emre Ta\c{s}ar; \c{S}\"ukr\"u Ozan; M. Fatih Akca; O\u{g}uzhan; \"Olmez; Semih G\"ul\"um; Se\c{c}ilay Kutal; Ceren Belhan

arXiv:2111.03963·cs.CL·December 28, 2021

Profitable Trade-Off Between Memory and Performance In Multi-Domain Chatbot Architectures

D. Emre Ta\c{s}ar, \c{S}\"ukr\"u Ozan, M. Fatih Akca, O\u{g}uzhan, \"Olmez, Semih G\"ul\"um, Se\c{c}ilay Kutal, Ceren Belhan

PDF

Open Access

TL;DR

This paper explores a trade-off between memory usage and performance in multi-domain chatbot architectures by comparing a single, multi-task BERT model with multiple specialized models, aiming to optimize server load and classification accuracy.

Contribution

It introduces a masking method for multi-task BERT classification, evaluating its effectiveness against specialized models across diverse datasets in a chatbot context.

Findings

01

Single BERT model with masking performs comparably to multiple models.

02

Multi-task BERT reduces server load and memory usage.

03

Performance varies depending on dataset complexity.

Abstract

Text classification problem is a very broad field of study in the field of natural language processing. In short, the text classification problem is to determine which of the previously determined classes the given text belongs to. Successful studies have been carried out in this field in the past studies. In the study, Bidirectional Encoder Representations for Transformers (BERT), which is a frequently preferred method for solving the classification problem in the field of natural language processing, is used. By solving classification problems through a single model to be used in a chatbot architecture, it is aimed to alleviate the load on the server that will be created by more than one model used for solving more than one classification problem. At this point, with the masking method applied during the estimation of a single BERT model, which was created for classification in more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Spam and Phishing Detection · Sentiment Analysis and Opinion Mining

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Attention Dropout · WordPiece · Dropout · Weight Decay · Residual Connection · Dense Connections