LLM Cognitive Judgements Differ From Human

Sotiris Lamprinidis

arXiv:2307.11787·cs.CL·August 17, 2023

LLM Cognitive Judgements Differ From Human

Sotiris Lamprinidis

PDF

Open Access 1 Repo

TL;DR

This paper investigates GPT-3 and ChatGPT's performance on a cognitive science inductive reasoning task, revealing that their cognitive judgments significantly differ from human reasoning patterns.

Contribution

It provides an empirical comparison showing that LLMs' cognitive judgments are not aligned with human-like reasoning, highlighting limitations in their cognitive modeling.

Findings

01

LLMs' judgments differ from humans in inductive reasoning tasks

02

GPT-3 and ChatGPT show limited human-like cognitive capabilities

03

Results suggest LLMs may not serve as accurate models of human cognition

Abstract

Large Language Models (LLMs) have lately been on the spotlight of researchers, businesses, and consumers alike. While the linguistic capabilities of such models have been studied extensively, there is growing interest in investigating them as cognitive subjects. In the present work I examine GPT-3 and ChatGPT capabilities on an limited-data inductive reasoning task from the cognitive science literature. The results suggest that these models' cognitive judgements are not human-like.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sotlampr/llm-cognitive-judgements
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques

Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Adam · Layer Normalization · Attention Dropout · Linear Layer · Softmax · Cosine Annealing · Dense Connections · Weight Decay