Probing Commonsense Knowledge in Pre-trained Language Models with   Sense-level Precision and Expanded Vocabulary

Daniel Loureiro; Al\'ipio M\'ario Jorge

arXiv:2210.06376·cs.CL·October 13, 2022

Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded Vocabulary

Daniel Loureiro, Al\'ipio M\'ario Jorge

PDF

Open Access 1 Repo

TL;DR

This paper introduces SenseLAMA, a sense-level cloze task using an enriched vocabulary from WordNet, enabling more precise evaluation of commonsense knowledge in pre-trained language models like BERT.

Contribution

It presents a method to augment language models with a grounded sense inventory, allowing sense-level queries without additional training, and introduces the SenseLAMA benchmark for evaluation.

Findings

01

BERT with WordNet enrichment (SynBERT) learns non-trivial commonsense knowledge.

02

SynBERT outperforms similarity-based approaches in commonsense tasks.

03

Enrichment enables finer-grained, sense-level commonsense evaluation.

Abstract

Progress on commonsense reasoning is usually measured from performance improvements on Question Answering tasks designed to require commonsense knowledge. However, fine-tuning large Language Models (LMs) on these specific tasks does not directly evaluate commonsense learned during pre-training. The most direct assessments of commonsense knowledge in pre-trained LMs are arguably cloze-style tasks targeting commonsense assertions (e.g., A pen is used for [MASK].). However, this approach is restricted by the LM's vocabulary available for masked predictions, and its precision is subject to the context provided by the assertion. In this work, we present a method for enriching LMs with a grounded sense inventory (i.e., WordNet) available at the vocabulary level, without further training. This modification augments the prediction space of cloze-style prompts to the size of a large ontology…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

danlou/synbert
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsAttention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Dropout · Weight Decay · Adam · Dense Connections · Linear Warmup With Linear Decay · Multi-Head Attention