Rainier: Reinforced Knowledge Introspector for Commonsense Question   Answering

Jiacheng Liu; Skyler Hallinan; Ximing Lu; Pengfei He; Sean Welleck,; Hannaneh Hajishirzi; Yejin Choi

arXiv:2210.03078·cs.CL·October 25, 2022·1 cites

Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Jiacheng Liu, Skyler Hallinan, Ximing Lu, Pengfei He, Sean Welleck,, Hannaneh Hajishirzi, Yejin Choi

PDF

Open Access 1 Repo 1 Models

TL;DR

Rainier is a reinforcement learning-based system that generates relevant knowledge to improve commonsense question answering, outperforming larger models and enhancing performance across multiple benchmarks.

Contribution

Rainier introduces a novel reinforcement learning approach to generate high-quality, contextually relevant knowledge for QA, surpassing GPT-3 generated knowledge without direct supervision.

Findings

01

Rainier improves QA performance on 9 benchmarks.

02

Generated knowledge from smaller models exceeds GPT-3 quality.

03

Reinforcement learning effectively aligns knowledge generation with QA performance.

Abstract

Knowledge underpins reasoning. Recent research demonstrates that when relevant knowledge is provided as additional context to commonsense question answering (QA), it can substantially enhance the performance even on top of state-of-the-art. The fundamental challenge is where and how to find such knowledge that is high quality and on point with respect to the question; knowledge retrieved from knowledge bases are incomplete and knowledge generated from language models are inconsistent. We present Rainier, or Reinforced Knowledge Introspector, that learns to generate contextually relevant knowledge in response to given questions. Our approach starts by imitating knowledge generated by GPT-3, then learns to generate its own knowledge via reinforcement learning where rewards are shaped based on the increased performance on the resulting question answering. Rainier demonstrates substantial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liujch1998/rainier
pytorchOfficial

Models

🤗
liujch1998/rainier-large
model· 6 dl· ♡ 2
6 dl♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Layer Normalization · {Dispute@FaQ-s}How to file a dispute with Expedia? · Refunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · Residual Connection · Attention Dropout