Investigating the Robustness of Natural Language Generation from Logical   Forms via Counterfactual Samples

Chengyuan Liu; Leilei Gan; Kun Kuang; Fei Wu

arXiv:2210.08548·cs.CL·October 18, 2022

Investigating the Robustness of Natural Language Generation from Logical Forms via Counterfactual Samples

Chengyuan Liu, Leilei Gan, Kun Kuang, Fei Wu

PDF

Open Access 2 Repos

TL;DR

This paper examines the robustness of logical form-based text generation models, revealing their reliance on spurious correlations and proposing methods to improve genuine logical reasoning through causal analysis and counterfactual data.

Contribution

It introduces a causal perspective on model bias, and proposes hierarchical logical form encoding and counterfactual data augmentation to enhance reasoning capabilities.

Findings

01

Models perform poorly on counterfactual samples, indicating reliance on spurious correlations.

02

Proposed methods improve robustness and reasoning ability on both original and counterfactual datasets.

03

The approach advances the development of logically faithful text generation models.

Abstract

The aim of Logic2Text is to generate controllable and faithful texts conditioned on tables and logical forms, which not only requires a deep understanding of the tables and logical forms, but also warrants symbolic reasoning over the tables. State-of-the-art methods based on pre-trained models have achieved remarkable performance on the standard test dataset. However, we question whether these methods really learn how to perform logical reasoning, rather than just relying on the spurious correlations between the headers of the tables and operators of the logical form. To verify this hypothesis, we manually construct a set of counterfactual samples, which modify the original logical forms to generate counterfactual logical forms with rarely co-occurred table headers and logical operators. SOTA methods give much worse results on these counterfactual samples compared with the results on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsTest