From Words to Worlds: Compositionality for Cognitive Architectures

Ruchira Dhar; Anders S{\o}gaard

arXiv:2407.13419·cs.CL·May 21, 2025

From Words to Worlds: Compositionality for Cognitive Architectures

Ruchira Dhar, Anders S{\o}gaard

PDF

Open Access

TL;DR

This paper investigates the compositional abilities of large language models, revealing that scaling improves these skills but instruction tuning may reduce them, raising questions about aligning models with human cognition.

Contribution

It provides empirical analysis across multiple LLMs and tasks, highlighting the complex effects of scaling and instruction tuning on compositionality.

Findings

01

Scaling enhances compositional strategies in LLMs

02

Instruction tuning can decrease compositional abilities

03

Open issues in aligning LLMs with human cognition

Abstract

Large language models (LLMs) are very performant connectionist systems, but do they exhibit more compositionality? More importantly, is that part of why they perform so well? We present empirical analyses across four LLM families (12 models) and three task categories, including a novel task introduced below. Our findings reveal a nuanced relationship in learning of compositional strategies by LLMs -- while scaling enhances compositional abilities, instruction tuning often has a reverse effect. Such disparity brings forth some open issues regarding the development and improvement of large language models in alignment with human cognitive capacities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConstraint Satisfaction and Optimization