PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales
Peifeng Wang, Aaron Chan, Filip Ilievski, Muhao Chen, Xiang Ren

TL;DR
PINTO is a prompt-based reasoning pipeline for language models that improves faithfulness and generalization by learning to reason over generated rationales with counterfactual regularization.
Contribution
It introduces a novel prompt-based learning approach with counterfactual regularization to produce more faithful and generalizable rationales in language reasoning tasks.
Findings
PINTO outperforms baselines on multiple datasets.
Rationales generated by PINTO are more faithful to model decisions.
PINTO enhances both in-distribution and out-of-distribution performance.
Abstract
Neural language models (LMs) have achieved impressive results on various language-based reasoning tasks by utilizing latent knowledge encoded in their own pretrained parameters. To make this reasoning process more explicit, recent works retrieve a rationalizing LM's internal knowledge by training or prompting it to generate free-text rationales, which can be used to guide task predictions made by either the same LM or a separate reasoning LM. However, rationalizing LMs require expensive rationale annotation and/or computation, without any assurance that their generated rationales improve LM task performance or faithfully reflect LM decision-making. In this paper, we propose PINTO, an LM pipeline that rationalizes via prompt-based learning, and learns to faithfully reason over rationales via counterfactual regularization. First, PINTO maps out a suitable reasoning process for the task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsTest
