Multi-class Text Classification using BERT-based Active Learning
Sumanth Prabhu, Moosa Mohamed, Hemant Misra

TL;DR
This paper investigates the use of active learning strategies with BERT to efficiently classify short, incoherent, and code-mixed customer transaction descriptions in the pickup and delivery industry, reducing labeling costs.
Contribution
It introduces active learning approaches tailored for BERT to improve multi-class text classification on challenging, real-world customer data with minimal manual labeling.
Findings
Active learning improves labeling efficiency for BERT models.
BERT-based active learning outperforms random sampling in accuracy.
Effective in classifying short, incoherent, and code-mixed texts.
Abstract
Text Classification finds interesting applications in the pickup and delivery services industry where customers require one or more items to be picked up from a location and delivered to a certain destination. Classifying these customer transactions into multiple categories helps understand the market needs for different customer segments. Each transaction is accompanied by a text description provided by the customer to describe the products being picked up and delivered which can be used to classify the transaction. BERT-based models have proven to perform well in Natural Language Understanding. However, the product descriptions provided by the customers tend to be short, incoherent and code-mixed (Hindi-English) text which demands fine-tuning of such models with manually labelled data to achieve high accuracy. Collecting this labelled data can prove to be expensive. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Machine Learning and Algorithms · Imbalanced Data Classification Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · WordPiece · Residual Connection · Softmax · Linear Warmup With Linear Decay · Layer Normalization
