Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal

TL;DR
This paper introduces TeaBReaC, a synthetic dataset created using question decompositions to teach language models broad multi-step reasoning skills, significantly improving their accuracy and robustness in multi-step question-answering tasks.
Contribution
The paper presents a novel pretraining dataset, TeaBReaC, generated with question decompositions to enhance multi-step reasoning in language models, demonstrating substantial performance gains.
Findings
Pretraining on TeaBReaC improves F1 scores by up to 13 points.
Models show 5-8 point improvements on robustness contrast sets.
Pretraining benefits are consistent even with numerate pretrained models.
Abstract
Question-answering datasets require a broad set of reasoning skills. We show how to use question decompositions to teach language models these broad reasoning skills in a robust fashion. Specifically, we use widely available QDMR representations to programmatically create hard-to-cheat synthetic contexts for real questions in six multi-step reasoning datasets. These contexts are carefully designed to avoid reasoning shortcuts prevalent in real contexts that prevent models from learning the right skills. This results in a pretraining dataset, named TeaBReaC, containing 525K multi-step questions (with associated formal programs) covering about 900 reasoning patterns. We show that pretraining standard language models (LMs) on TeaBReaC before fine-tuning them on target datasets improves their performance by up to 13 F1 points across 4 multi-step QA datasets, with up to 21 point gain on more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗StonyBrookNLP/bart-large-dropmodel· 3 dl3 dl
- 🤗StonyBrookNLP/bart-large-iirc-goldmodel· 2 dl2 dl
- 🤗StonyBrookNLP/bart-large-iirc-retrievedmodel· 4 dl4 dl
- 🤗StonyBrookNLP/bart-large-numgluemodel· 1 dl1 dl
- 🤗StonyBrookNLP/bart-large-tatqamodel· 1 dl1 dl
- 🤗StonyBrookNLP/t5-large-dropmodel· 1 dl1 dl
- 🤗StonyBrookNLP/t5-large-iirc-goldmodel· 3 dl3 dl
- 🤗StonyBrookNLP/t5-large-iirc-retrievedmodel· 1 dl1 dl
- 🤗StonyBrookNLP/t5-large-numgluemodel· 1 dl1 dl
- 🤗StonyBrookNLP/t5-large-tatqamodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
