Technical report on Conversational Question Answering

Ying Ju; Fubang Zhao; Shijie Chen; Bowen Zheng; Xuefeng Yang; Yunfeng; Liu

arXiv:1909.10772·cs.CL·September 25, 2019·42 cites

Technical report on Conversational Question Answering

Ying Ju, Fubang Zhao, Shijie Chen, Bowen Zheng, Xuefeng Yang, Yunfeng, Liu

PDF

Open Access

TL;DR

This paper presents a new conversational question answering system combining multiple techniques, achieving state-of-the-art performance on CoQA without data augmentation.

Contribution

It introduces a novel system integrating rationale tagging, adversarial training, knowledge distillation, and post-processing for improved conversational QA.

Findings

01

Achieved 90.4 F1 on CoQA test set, surpassing previous single models.

02

Demonstrated effectiveness of multi-task and adversarial training in conversational QA.

03

No data augmentation needed for top performance.

Abstract

Conversational Question Answering is a challenging task since it requires understanding of conversational history. In this project, we propose a new system RoBERTa + AT +KD, which involves rationale tagging multi-task, adversarial training, knowledge distillation and a linguistic post-process strategy. Our single model achieves 90.4(F1) on the CoQA test set without data augmentation, outperforming the current state-of-the-art single model by 2.6% F1.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsLinear Layer · Knowledge Distillation · Adam · Softmax · Layer Normalization · Dropout · Attention Is All You Need · Multi-Head Attention · Residual Connection · Attention Dropout