The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters   for Implicature Resolution by LLMs

Laura Ruis; Akbir Khan; Stella Biderman; Sara Hooker; Tim; Rockt\"aschel; Edward Grefenstette

arXiv:2210.14986·cs.CL·December 5, 2023·21 cites

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Laura Ruis, Akbir Khan, Stella Biderman, Sara Hooker, Tim, Rockt\"aschel, Edward Grefenstette

PDF

Open Access 1 Repo 3 Videos

TL;DR

This paper investigates how different fine-tuning strategies affect large language models' ability to understand pragmatic inferences, revealing that instruction tuning significantly improves their contextual interpretative skills.

Contribution

It demonstrates that instruction-tuned LLMs outperform other models in pragmatic inference tasks, highlighting the importance of fine-tuning strategies for contextual understanding.

Findings

01

Instruction-tuned models perform significantly better on implicature tasks.

02

Most models perform close to random on pragmatic inference without specific tuning.

03

Fine-tuning strategies greatly influence models' ability to interpret language in context.

Abstract

Despite widespread use of LLMs as conversational agents, evaluations of performance fail to capture a crucial aspect of communication: interpreting language in context -- incorporating its pragmatics. Humans interpret language using beliefs and prior knowledge about the world. For example, we intuitively understand the response "I wore gloves" to the question "Did you leave fingerprints?" as meaning "No". To investigate whether LLMs have the ability to make this type of inference, known as an implicature, we design a simple task and evaluate four categories of widely used state-of-the-art models. We find that, despite only evaluating on utterances that require a binary inference (yes or no), models in three of these categories perform close to random. However, LLMs instruction-tuned at the example-level perform significantly better. These results suggest that certain fine-tuning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lauraruis/do-pigs-fly
noneOfficial

Videos

#103 - Prof. Edward Grefenstette - Language, Semantics, Philosophy· youtube

#84 LAURA RUIS - Large language models are not zero-shot communicators [NEURIPS UNPLUGGED]· youtube

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

Methodsfail