TL;DR
This paper introduces a contrastive learning framework for multi-document question generation, enabling the creation of more specific questions by distinguishing relevant documents from irrelevant ones.
Contribution
It proposes the MSCQG framework with a dual-stage training process and a novel contrastive regularization to improve question specificity in multi-document settings.
Findings
Outperforms strong baselines in automatic metrics
Achieves higher human evaluation scores for question relevance
Effectively generates more specific questions
Abstract
Multi-document question generation focuses on generating a question that covers the common aspect of multiple documents. Such a model is useful in generating clarifying options. However, a naive model trained only using the targeted ("positive") document set may generate too generic questions that cover a larger scope than delineated by the document set. To address this challenge, we introduce the contrastive learning strategy where given "positive" and "negative" sets of documents, we generate a question that is closely related to the "positive" set but is far away from the "negative" set. This setting allows generated questions to be more specific and related to the target document set. To generate such specific questions, we propose Multi-Source Coordinated Question Generator (MSCQG), a novel framework that includes a supervised learning (SL) stage and a reinforcement learning (RL)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
