Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You, Nuo Chen, Yuexian Zou

TL;DR
This paper introduces CADNet, a novel attention-based distillation method that enhances spoken conversational question answering by improving robustness to ASR errors through contextualized embeddings and knowledge distillation.
Contribution
The paper proposes a new attention-based distillation framework, CADNet, specifically designed to improve ASR robustness in spoken conversational QA tasks.
Findings
CADNet significantly outperforms baseline models on Spoken-CoQA dataset.
The approach effectively mitigates ASR noise impact on QA performance.
Extensive experiments validate the robustness and effectiveness of the proposed method.
Abstract
Spoken conversational question answering (SCQA) requires machines to model complex dialogue flow given the speech utterances and text corpora. Different from traditional text question answering (QA) tasks, SCQA involves audio signal processing, passage comprehension, and contextual understanding. However, ASR systems introduce unexpected noisy signals to the transcriptions, which result in performance degradation on SCQA. To overcome the problem, we propose CADNet, a novel contextualized attention-based distillation approach, which applies both cross-attention and self-attention to obtain ASR-robust contextualized embedding representations of the passage and dialogue history for performance improvements. We also introduce the spoken conventional knowledge distillation framework to distill the ASR-robust knowledge from the estimated probabilities of the teacher model to the student. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
