Crystal: Introspective Reasoners Reinforced with Self-Feedback
Jiacheng Liu, Ramakanth Pasunuru, Hannaneh Hajishirzi, Yejin Choi,, Asli Celikyilmaz

TL;DR
Crystal is an introspective reasoning model that improves commonsense reasoning by self-reflectively identifying relevant knowledge and grounding its predictions, with reinforcement learning tuning its introspection and reasoning processes.
Contribution
It introduces a novel introspective approach with self-feedback for commonsense reasoning, outperforming existing methods and enhancing transparency.
Findings
Crystal significantly outperforms supervised finetuning methods.
It enhances the transparency of the reasoning process.
Reinforcement learning effectively tunes introspection and reasoning modes.
Abstract
Extensive work has shown that the performance and interpretability of commonsense reasoning can be improved via knowledge-augmented reasoning methods, where the knowledge that underpins the reasoning process is explicitly verbalized and utilized. However, existing implementations, including "chain-of-thought" and its variants, fall short in capturing the introspective nature of knowledge required in commonsense reasoning, and in accounting for the mutual adaptation between the generation and utilization of knowledge. We propose a novel method to develop an introspective commonsense reasoner, Crystal. To tackle commonsense problems, it first introspects for knowledge statements related to the given question, and subsequently makes an informed prediction that is grounded in the previously introspected knowledge. The knowledge introspection and knowledge-grounded reasoning modes of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference
