Pragmatic competence of pre-trained language models through the lens of   discourse connectives

Lalchand Pandia; Yan Cong; Allyson Ettinger

arXiv:2109.12951·cs.CL·September 28, 2021

Pragmatic competence of pre-trained language models through the lens of discourse connectives

Lalchand Pandia, Yan Cong, Allyson Ettinger

PDF

Open Access

TL;DR

This paper evaluates pre-trained language models' pragmatic understanding of discourse connectives, revealing they perform reasonably in natural contexts but lack sensitivity to high-level pragmatic cues and human-like temporal preferences.

Contribution

It introduces a novel framework for assessing pragmatic competence in language models focusing on discourse connectives using psycholinguistic-inspired tests.

Findings

01

Models predict connectives well in natural data

02

Sensitivity to high-level pragmatic cues is low

03

Models lack human-like temporal preferences

Abstract

As pre-trained language models (LMs) continue to dominate NLP, it is increasingly important that we understand the depth of language capabilities in these models. In this paper, we target pre-trained LMs' competence in pragmatics, with a focus on pragmatics relating to discourse connectives. We formulate cloze-style tests using a combination of naturally-occurring data and controlled inputs drawn from psycholinguistics. We focus on testing models' ability to use pragmatic cues to predict discourse connectives, models' ability to understand implicatures relating to connectives, and the extent to which models show humanlike preferences regarding temporal dynamics of connectives. We find that although models predict connectives reasonably well in the context of naturally-occurring data, when we control contexts to isolate high-level pragmatic cues, model sensitivity is much lower. Models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems