Vera: A General-Purpose Plausibility Estimation Model for Commonsense   Statements

Jiacheng Liu; Wenya Wang; Dianzhuo Wang; Noah A. Smith; Yejin Choi,; Hannaneh Hajishirzi

arXiv:2305.03695·cs.CL·October 19, 2023·2 cites

Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements

Jiacheng Liu, Wenya Wang, Dianzhuo Wang, Noah A. Smith, Yejin Choi,, Hannaneh Hajishirzi

PDF

Open Access 1 Repo 1 Models 1 Datasets

TL;DR

Vera is a versatile plausibility estimation model trained on extensive commonsense data, effectively identifying correct statements, filtering generated knowledge, and detecting errors across diverse domains and unseen tasks.

Contribution

Introduces Vera, a general-purpose plausibility model for commonsense statements, trained on large datasets, with multiple objectives, outperforming existing models in verification tasks.

Findings

01

Vera outperforms existing models in commonsense verification.

02

Vera generalizes well to unseen tasks.

03

Vera effectively filters and detects erroneous commonsense statements.

Abstract

Despite the much discussed capabilities of today's language models, they are still prone to silly and unexpected commonsense failures. We consider a retrospective verification approach that reflects on the correctness of LM outputs, and introduce Vera, a general-purpose model that estimates the plausibility of declarative statements based on commonsense knowledge. Trained on ~7M commonsense statements created from 19 QA datasets and two large-scale knowledge bases, and with a combination of three training objectives, Vera is a versatile model that effectively separates correct from incorrect statements across diverse commonsense domains. When applied to solving commonsense problems in the verification format, Vera substantially outperforms existing models that can be repurposed for commonsense verification, and it further exhibits generalization capabilities to unseen tasks and provides…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liujch1998/vera
pytorchOfficial

Models

🤗
liujch1998/vera
model· 32 dl· ♡ 13
32 dl♡ 13

Datasets

liujch1998/vera_contrib
dataset· 47 dl
47 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems