Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard   Contexts

Harsh Trivedi; Niranjan Balasubramanian; Tushar Khot; Ashish Sabharwal

arXiv:2205.12496·cs.CL·November 7, 2022·1 cites

Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts

Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal

PDF

Open Access 1 Repo 10 Models

TL;DR

This paper introduces TeaBReaC, a synthetic dataset created using question decompositions to teach language models broad multi-step reasoning skills, significantly improving their accuracy and robustness in multi-step question-answering tasks.

Contribution

The paper presents a novel pretraining dataset, TeaBReaC, generated with question decompositions to enhance multi-step reasoning in language models, demonstrating substantial performance gains.

Findings

01

Pretraining on TeaBReaC improves F1 scores by up to 13 points.

02

Models show 5-8 point improvements on robustness contrast sets.

03

Pretraining benefits are consistent even with numerate pretrained models.

Abstract

Question-answering datasets require a broad set of reasoning skills. We show how to use question decompositions to teach language models these broad reasoning skills in a robust fashion. Specifically, we use widely available QDMR representations to programmatically create hard-to-cheat synthetic contexts for real questions in six multi-step reasoning datasets. These contexts are carefully designed to avoid reasoning shortcuts prevalent in real contexts that prevent models from learning the right skills. This results in a pretraining dataset, named TeaBReaC, containing 525K multi-step questions (with associated formal programs) covering about 900 reasoning patterns. We show that pretraining standard language models (LMs) on TeaBReaC before fine-tuning them on target datasets improves their performance by up to 13 F1 points across 4 multi-step QA datasets, with up to 21 point gain on more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stonybrooknlp/teabreac
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications