Improving Question Answering with Generation of NQ-like Questions

Saptarashmi Bandyopadhyay; Shraman Pal; Hao Zou; Abhranil Chandra,; Jordan Boyd-Graber

arXiv:2210.06599·cs.CL·October 14, 2022

Improving Question Answering with Generation of NQ-like Questions

Saptarashmi Bandyopadhyay, Shraman Pal, Hao Zou, Abhranil Chandra,, Jordan Boyd-Graber

PDF

Open Access

TL;DR

This paper presents an algorithm that automatically generates NQ-like questions from longer trivia questions, enhancing data availability and improving QA system performance, especially in low-resource settings.

Contribution

The paper introduces a novel method to automatically convert trivia questions into NQ-like questions, increasing data for QA systems while maintaining quality.

Findings

01

Generated data improves QA performance in low-resource settings

02

Neural classifier effectively filters ill-formed questions

03

Method enhances scalability of training data for QA systems

Abstract

Question Answering (QA) systems require a large amount of annotated data which is costly and time-consuming to gather. Converting datasets of existing QA benchmarks are challenging due to different formats and complexities. To address these issues, we propose an algorithm to automatically generate shorter questions resembling day-to-day human communication in the Natural Questions (NQ) dataset from longer trivia questions in Quizbowl (QB) dataset by leveraging conversion in style among the datasets. This provides an automated way to generate more data for our QA systems. To ensure quality as well as quantity of data, we detect and remove ill-formed questions using a neural classifier. We demonstrate that in a low resource setting, using the generated data improves the QA performance over the baseline system on both NQ and QB data. Our algorithm improves the scalability of training data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques