Modeling Event Plausibility with Consistent Conceptual Abstraction

Ian Porada; Kaheer Suleman; Adam Trischler; and Jackie Chi Kit Cheung

arXiv:2104.10247·cs.CL·April 22, 2021

Modeling Event Plausibility with Consistent Conceptual Abstraction

Ian Porada, Kaheer Suleman, Adam Trischler, and Jackie Chi Kit Cheung

PDF

1 Repo

TL;DR

This paper investigates the inconsistency of Transformer-based models in judging event plausibility across conceptual classes and proposes a post-hoc method to improve their consistency and alignment with human judgments.

Contribution

It identifies the inconsistency issue in plausibility models across conceptual classes and introduces a simple post-hoc technique to enhance model consistency and human correlation.

Findings

01

Transformer models are inconsistent across conceptual classes.

02

Injecting lexical knowledge does not fully resolve inconsistency.

03

Post-hoc adjustment improves model plausibility correlation with humans.

Abstract

Understanding natural language requires common sense, one aspect of which is the ability to discern the plausibility of events. While distributional models -- most recently pre-trained, Transformer language models -- have demonstrated improvements in modeling event plausibility, their performance still falls short of humans'. In this work, we show that Transformer-based plausibility models are markedly inconsistent across the conceptual classes of a lexical hierarchy, inferring that "a person breathing" is plausible while "a dentist breathing" is not, for example. We find this inconsistency persists even when models are softly injected with lexical knowledge, and we present a simple post-hoc method of forcing model consistency that improves correlation with human plausibility judgements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ianporada/modeling_event_plausibility
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Layer Normalization · Label Smoothing · Residual Connection · Byte Pair Encoding · Adam