Generative Context Pair Selection for Multi-hop Question Answering
Dheeru Dua, Cicero Nogueira dos Santos, Patrick Ng, Ben Athiwaratkun,, Bing Xiang, Matt Gardner, Sameer Singh

TL;DR
This paper introduces a generative context selection approach for multi-hop question answering that improves reasoning robustness and outperforms baselines on adversarial datasets by modeling question generation from context pairs.
Contribution
It proposes a novel generative model for context selection in multi-hop QA, enhancing reasoning robustness and addressing dataset biases.
Findings
Achieves 4.9% higher performance on adversarial datasets.
Comparable overall answering performance to state-of-the-art.
Improves model robustness against dataset biases.
Abstract
Compositional reasoning tasks like multi-hop question answering, require making latent decisions to get the final answer, given a question. However, crowdsourced datasets often capture only a slice of the underlying task distribution, which can induce unanticipated biases in models performing compositional reasoning. Furthermore, discriminatively trained models exploit such biases to get a better held-out performance, without learning the right way to reason, as they do not necessitate paying attention to the question representation (conditioning variable) in its entirety, to estimate the answer likelihood. In this work, we propose a generative context selection model for multi-hop question answering that reasons about how the given question could have been generated given a context pair. While being comparable to the state-of-the-art answering performance, our proposed generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
