TL;DR
This paper introduces CONQUER, a reinforcement learning approach that learns to answer conversational questions over knowledge graphs using noisy feedback from question reformulations, improving performance over existing methods.
Contribution
The paper proposes a novel reinforcement learning model, CONQUER, that learns from implicit reformulation signals in conversational QA over knowledge graphs, addressing the lack of explicit labeled data.
Findings
CONQUER outperforms baseline models in experimental evaluations.
The ConvRef benchmark contains 11k conversations with 205k reformulations.
Reinforcement learning effectively leverages noisy reformulation signals for better QA performance.
Abstract
The rise of personal assistants has made conversational question answering (ConvQA) a very popular mechanism for user-system interaction. State-of-the-art methods for ConvQA over knowledge graphs (KGs) can only learn from crisp question-answer pairs found in popular benchmarks. In reality, however, such training data is hard to come by: users would rarely mark answers explicitly as correct or wrong. In this work, we take a step towards a more natural learning paradigm - from noisy and implicit feedback via question reformulations. A reformulation is likely to be triggered by an incorrect system response, whereas a new follow-up question could be a positive signal on the previous turn's answer. We present a reinforcement learning model, termed CONQUER, that can learn from a conversational stream of questions and reformulations. CONQUER models the answering process as multiple agents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
