Zero-shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
Zi-Yi Dou, Nanyun Peng

TL;DR
This paper explores translating natural questions into cloze-style prompts to better extract implicit commonsense knowledge from pre-trained language models for zero-shot question answering, achieving state-of-the-art results.
Contribution
It introduces four translation methods and a consistency optimization technique to enhance zero-shot commonsense question answering without relying on external knowledge bases.
Findings
Effective translation methods improve zero-shot performance.
Consistency optimization enhances model predictions.
Combining methods achieves state-of-the-art results.
Abstract
Commonsense question answering (CQA) aims to test if models can answer questions regarding commonsense knowledge that everyone knows. Prior works that incorporate external knowledge bases have shown promising results, but knowledge bases are expensive to construct and are often limited to a fixed set of relations. In this paper, we instead focus on better utilizing the \textit{implicit knowledge} stored in pre-trained language models. While researchers have found that the knowledge embedded in pre-trained language models can be extracted by having them fill in the blanks of carefully designed prompts for relation extraction and text classification, it remains unclear if we can adopt this paradigm in CQA where the inputs and outputs take much more flexible forms. To this end, we investigate four translation methods that can translate natural questions into cloze-style sentences to better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
MethodsBalanced Selection
