An Initial Investigation of Non-Native Spoken Question-Answering
Vatsal Raina, Mark J.F. Gales

TL;DR
This paper explores the application of text-based machine comprehension models to non-native spoken question-answering, addressing challenges like speech recognition errors and grammar mismatches, and demonstrating transfer learning effectiveness.
Contribution
It introduces a transfer-learning approach from text-based MC to spoken QA for non-native speakers, evaluating the impact of ASR errors and grammar mismatches.
Findings
Electra MC model trained on SQuAD2.0 transfers well to SQA.
ASR errors have an approximately linear impact on SQA scores.
Grammar mismatches minimally affect SQA performance.
Abstract
Text-based machine comprehension (MC) systems have a wide-range of applications, and standard corpora exist for developing and evaluating approaches. There has been far less research on spoken question answering (SQA) systems. The SQA task considered in this paper is to extract the answer from a candidates spoken response to a question in a prompt-response style language assessment test. Applying these MC approaches to this SQA task rather than, for example, off-topic response detection provides far more detailed information that can be used for further downstream processing. One significant challenge is the lack of appropriately annotated speech corpora to train systems for this task. Hence, a transfer-learning style approach is adopted where a system trained on text-based MC is evaluated on an SQA task with non-native speakers. Mismatches must be considered between text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Dropout · Weight Decay · Adam · Attention Dropout · Linear Warmup With Linear Decay
