TL;DR
This paper explores training neural models to generate explanations for nonmonotonic reasoning tasks using only distant supervision, aiming to reduce annotation costs and improve generalizability.
Contribution
It introduces methods to automatically generate rationales without human annotations and evaluates their effectiveness on defeasible inference tasks.
Findings
Models can generate rationales explaining inference likelihood changes
Generated rationales are often trivial due to neural language model limitations
Joint prediction of updates and rationales remains a challenging future direction
Abstract
The black-box nature of neural models has motivated a line of research that aims to generate natural language rationales to explain why a model made certain predictions. Such rationale generation models, to date, have been trained on dataset-specific crowdsourced rationales, but this approach is costly and is not generalizable to new tasks and domains. In this paper, we investigate the extent to which neural models can reason about natural language rationales that explain model predictions, relying only on distant supervision with no additional annotation cost for human-written rationales. We investigate multiple ways to automatically generate rationales using pre-trained language models, neural knowledge models, and distant supervision from related tasks, and train generative models capable of composing explanatory rationales for unseen instances. We demonstrate our approach on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
