Turning Tables: Generating Examples from Semi-structured Tables for   Endowing Language Models with Reasoning Skills

Ori Yoran; Alon Talmor; Jonathan Berant

arXiv:2107.07261·cs.CL·July 16, 2021

Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills

Ori Yoran, Alon Talmor, Jonathan Berant

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to enhance language models' reasoning abilities by generating synthetic question-answer pairs from semi-structured tables, focusing training on specific reasoning skills for improved performance.

Contribution

The authors propose a novel pre-training approach using automatically generated reasoning questions from tables, with targeted sampling strategies to improve reasoning skills in language models.

Findings

01

Model PReasM outperforms T5 on reasoning tasks.

02

Sampling based on model errors accelerates training.

03

Synthetic data improves reasoning skills in language models.

Abstract

Models pre-trained with a language modeling objective possess ample world knowledge and language skills, but are known to struggle in tasks that require reasoning. In this work, we propose to leverage semi-structured tables, and automatically generate at scale question-paragraph pairs, where answering the question requires reasoning over multiple facts in the paragraph. We add a pre-training step over this synthetic data, which includes examples that require 16 different reasoning skills such as number comparison, conjunction, and fact composition. To improve data efficiency, we propose sampling strategies that focus training on reasoning skills the model is currently lacking. We evaluate our approach on three reading comprehension datasets that are focused on reasoning, and show that our model, PReasM, substantially outperforms T5, a popular pre-trained encoder-decoder model. Moreover,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oriyor/turning_tables
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Gated Linear Unit · Inverse Square Root Schedule · Refunds@Expedia|||How do I get a full refund from Expedia? · SentencePiece · Residual Connection · Adafactor