Technical report on Conversational Question Answering
Ying Ju, Fubang Zhao, Shijie Chen, Bowen Zheng, Xuefeng Yang, Yunfeng, Liu

TL;DR
This paper presents a new conversational question answering system combining multiple techniques, achieving state-of-the-art performance on CoQA without data augmentation.
Contribution
It introduces a novel system integrating rationale tagging, adversarial training, knowledge distillation, and post-processing for improved conversational QA.
Findings
Achieved 90.4 F1 on CoQA test set, surpassing previous single models.
Demonstrated effectiveness of multi-task and adversarial training in conversational QA.
No data augmentation needed for top performance.
Abstract
Conversational Question Answering is a challenging task since it requires understanding of conversational history. In this project, we propose a new system RoBERTa + AT +KD, which involves rationale tagging multi-task, adversarial training, knowledge distillation and a linguistic post-process strategy. Our single model achieves 90.4(F1) on the CoQA test set without data augmentation, outperforming the current state-of-the-art single model by 2.6% F1.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsLinear Layer · Knowledge Distillation · Adam · Softmax · Layer Normalization · Dropout · Attention Is All You Need · Multi-Head Attention · Residual Connection · Attention Dropout
