Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care
Saurabh Kumar, Sourav Bansal, Neeraj Agrawal, Priyanka Bhatt

TL;DR
This paper introduces a few-shot learning approach for cross-domain multilingual customer care intent classification, significantly improving accuracy with minimal labeled data across different channels, regions, and languages.
Contribution
It proposes a novel embedder-cum-classifier architecture with domain adaptation strategies for effective few-shot cross-domain multilingual NLU in customer care.
Findings
Achieved 20-23% accuracy improvement over SOTA models.
Effective cross-domain generalization with minimal labeled data.
Validated on e-commerce datasets from Canada and Mexico.
Abstract
Customer care is an essential pillar of the e-commerce shopping experience with companies spending millions of dollars each year, employing automation and human agents, across geographies (like US, Canada, Mexico, Chile), channels (like Chat, Interactive Voice Response (IVR)), and languages (like English, Spanish). SOTA pre-trained models like multilingual-BERT, fine-tuned on annotated data have shown good performance in downstream tasks relevant to Customer Care. However, model performance is largely subject to the availability of sufficient annotated domain-specific data. Cross-domain availability of data remains a bottleneck, thus building an intent classifier that generalizes across domains (defined by channel, geography, and language) with only a few annotations, is of great practical value. In this paper, we propose an embedder-cum-classifier model architecture which extends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsADaptive gradient method with the OPTimal convergence rate · Knowledge Distillation
