Crystal: Introspective Reasoners Reinforced with Self-Feedback

Jiacheng Liu; Ramakanth Pasunuru; Hannaneh Hajishirzi; Yejin Choi,; Asli Celikyilmaz

arXiv:2310.04921·cs.AI·October 19, 2023

Crystal: Introspective Reasoners Reinforced with Self-Feedback

Jiacheng Liu, Ramakanth Pasunuru, Hannaneh Hajishirzi, Yejin Choi,, Asli Celikyilmaz

PDF

Open Access 1 Repo 1 Models

TL;DR

Crystal is an introspective reasoning model that improves commonsense reasoning by self-reflectively identifying relevant knowledge and grounding its predictions, with reinforcement learning tuning its introspection and reasoning processes.

Contribution

It introduces a novel introspective approach with self-feedback for commonsense reasoning, outperforming existing methods and enhancing transparency.

Findings

01

Crystal significantly outperforms supervised finetuning methods.

02

It enhances the transparency of the reasoning process.

03

Reinforcement learning effectively tunes introspection and reasoning modes.

Abstract

Extensive work has shown that the performance and interpretability of commonsense reasoning can be improved via knowledge-augmented reasoning methods, where the knowledge that underpins the reasoning process is explicitly verbalized and utilized. However, existing implementations, including "chain-of-thought" and its variants, fall short in capturing the introspective nature of knowledge required in commonsense reasoning, and in accounting for the mutual adaptation between the generation and utilization of knowledge. We propose a novel method to develop an introspective commonsense reasoner, Crystal. To tackle commonsense problems, it first introspects for knowledge statements related to the given question, and subsequently makes an informed prediction that is grounded in the previously introspected knowledge. The knowledge introspection and knowledge-grounded reasoning modes of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liujch1998/crystal
pytorchOfficial

Models

🤗
liujch1998/crystal-11b
model· 6 dl· ♡ 1
6 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference