ParaQA: A Question Answering Dataset with Paraphrase Responses for   Single-Turn Conversation

Endri Kacupaj; Barshana Banerjee; Kuldeep Singh; Jens Lehmann

arXiv:2103.07771·cs.CL·March 16, 2021

ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation

Endri Kacupaj, Barshana Banerjee, Kuldeep Singh, Jens Lehmann

PDF

1 Repo

TL;DR

ParaQA is a new dataset for single-turn conversational question answering over knowledge graphs, featuring multiple paraphrased answers per question to improve answer diversity and robustness.

Contribution

The paper introduces ParaQA, a dataset with multiple paraphrased responses per question, created via semi-automated back-translation techniques, filling a gap in existing QA datasets.

Findings

01

Baseline models demonstrate the dataset's utility in improving answer diversity.

02

Multiple paraphrases enhance the robustness of QA systems.

03

The dataset is publicly available for research use.

Abstract

This paper presents ParaQA, a question answering (QA) dataset with multiple paraphrased responses for single-turn conversation over knowledge graphs (KG). The dataset was created using a semi-automated framework for generating diverse paraphrasing of the answers using techniques such as back-translation. The existing datasets for conversational question answering over KGs (single-turn/multi-turn) focus on question paraphrasing and provide only up to one answer verbalization. However, ParaQA contains 5000 question-answer pairs with a minimum of two and a maximum of eight unique paraphrased responses for each question. We complement the dataset with baseline models and illustrate the advantage of having multiple paraphrased answers through commonly used metrics such as BLEU and METEOR. The ParaQA dataset is publicly available on a persistent URI for broader usage and adaptation in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

barshana-banerjee/ParaQA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.