Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation
Rohan Sukumaran

TL;DR
This paper presents a semi-supervised knowledge distillation approach for classifying customer transactions from free text, achieving high accuracy with fewer parameters, and is deployed in production for trend analysis.
Contribution
It introduces a cost-effective semi-supervised framework using knowledge distillation with RoBERTa as teacher and ALBERT as student for transaction classification.
Findings
RoBERTa outperforms other models in categorization tasks.
ALBERT with knowledge distillation achieves comparable performance to RoBERTa.
The approach is effective on large internal and public datasets.
Abstract
In pickup and delivery services, transaction classification based on customer provided free text is a challenging problem. It involves the association of a wide variety of customer inputs to a fixed set of categories while adapting to the various customer writing styles. This categorization is important for the business: it helps understand the market needs and trends, and also assist in building a personalized experience for different segments of the customers. Hence, it is vital to capture these category information trends at scale, with high precision and recall. In this paper, we focus on a specific use-case where a single category drives each transaction. We propose a cost-effective transaction classification approach based on semi-supervision and knowledge distillation frameworks. The approach identifies the category of a transaction using free text input given by the customer. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Web Data Mining and Analysis · Sentiment Analysis and Opinion Mining
MethodsLinear Layer · Knowledge Distillation · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · LAMB · Layer Normalization · WordPiece · Dense Connections · Adam
