ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Ashish Shenoy, Sravan Bodapati, Katrin Kirchhoff

TL;DR
This paper enhances ASR accuracy for e-commerce chatbots by integrating cross-utterance context, domain adaptation techniques, and multi-task learning to improve content word recognition and slot filling.
Contribution
It introduces a multi-task language model that jointly detects content words and performs language modeling, improving ASR robustness in e-commerce scenarios.
Findings
19.2% reduction in content WER
6.4% improvement in slot labeling F1
Effective domain adaptation with cross-utterance context
Abstract
Automatic Speech Recognition (ASR) robustness toward slot entities are critical in e-commerce voice assistants that involve monetary transactions and purchases. Along with effective domain adaptation, it is intuitive that cross utterance contextual cues play an important role in disambiguating domain specific content words from speech. In this paper, we investigate various techniques to improve contextualization, content word robustness and domain adaptation of a Transformer-XL neural language model (NLM) to rescore ASR N-best hypotheses. To improve contextualization, we utilize turn level dialogue acts along with cross utterance context carry over. Additionally, to adapt our domain-general NLM towards e-commerce on-the-fly, we use embeddings derived from a finetuned masked LM on in-domain data. Finally, to improve robustness towards in-domain content words, we propose a multi-task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Adaptive Input Representations · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Warmup With Cosine Annealing · Dense Connections · Softmax
