Evidence from counterfactual tasks supports emergent analogical   reasoning in large language models

Taylor Webb; Keith J. Holyoak; Hongjing Lu

arXiv:2404.13070·cs.CL·May 1, 2024

Evidence from counterfactual tasks supports emergent analogical reasoning in large language models

Taylor Webb, Keith J. Holyoak, Hongjing Lu

PDF

Open Access 1 Repo

TL;DR

This paper defends previous findings that large language models can perform analogical reasoning in zero-shot settings, demonstrating their ability to generalize to counterfactual tasks despite critiques.

Contribution

It clarifies misunderstandings about the original tests and provides evidence that language models can generalize to counterfactual analogy tasks.

Findings

01

Language models solve counterfactual analogy tasks.

02

Models generalize beyond training data.

03

Clarification of previous misunderstandings.

Abstract

We recently reported evidence that large language models are capable of solving a wide range of text-based analogy problems in a zero-shot manner, indicating the presence of an emergent capacity for analogical reasoning. Two recent commentaries have challenged these results, citing evidence from so-called `counterfactual' tasks in which the standard sequence of the alphabet is arbitrarily permuted so as to decrease similarity with materials that may have been present in the language model's training data. Here, we reply to these critiques, clarifying some misunderstandings about the test materials used in our original work, and presenting evidence that language models are also capable of generalizing to these new counterfactual task variants.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

taylorwwebb/counterfactual_analogies
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Language and cultural evolution · Topic Modeling