TL;DR
ParaPattern is a novel method that generates natural language deductions for open-domain reasoning, leveraging automated training data construction and outperforming baselines in accuracy and flexibility.
Contribution
It introduces a BART-based model trained with an automated pipeline to generate logical inferences from natural language without requiring in-domain supervision.
Findings
Achieves 85% validity on EntailmentBank substitution examples
Outperforms baseline systems in accuracy and flexibility
Matches performance of domain-specific fine-tuned models
Abstract
An interpretable system for open-domain reasoning needs to express its reasoning process in a transparent form. Natural language is an attractive representation for this purpose -- it is both highly expressive and easy for humans to understand. However, manipulating natural language statements in logically consistent ways is hard: models must cope with variation in how meaning is expressed while remaining precise. In this paper, we describe ParaPattern, a method for building models to generate deductive inferences from diverse natural language inputs without direct human supervision. We train BART-based models (Lewis et al., 2020) to generate the result of applying a particular logical operation to one or more premise statements. Crucially, we develop a largely automated pipeline for constructing suitable training examples from Wikipedia. We evaluate our models using out-of-domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
