# Assessing the Relational Abilities of Large Language Models and Large Reasoning Models

**Authors:** Matthias Raemaekers, Martin Finn, Jan De Houwer

PMC · DOI: 10.3390/bs16010045 · Behavioral Sciences · 2025-12-25

## TL;DR

This paper evaluates how well large language and reasoning models handle relational tasks using thousands of syllogistic problems.

## Contribution

The study introduces a new framework for assessing general relational abilities in artificial systems using diverse and complex syllogistic problems.

## Key findings

- Models performed well overall in the relational task battery.
- Performance varied across different types of relations and was minimally affected by task variations.
- Model performance remained robust despite randomization of premise order.

## Abstract

We assessed the relational abilities of two state-of-the-art large language models (LLMs) and two large reasoning models (LRMs) using a new battery of several thousand syllogistic problems, similar to those used in behavior-analytic tasks for relational abilities. To probe the models’ general (as opposed to task- or domain-specific) abilities, the problems involved multiple relations (sameness, difference, comparison, hierarchy, analogy, temporal and deictic), specified between randomly selected nonwords and varied in terms of complexity (number of premises, inclusion of irrelevant premises) and format (valid or invalid conclusion prompted). We also tested transformations of stimulus function. Our results show that the models generally performed well in this new task battery. The models did show some variability across different relations and were to a limited extent affected by task variations. Model performance was, however, robust against the randomization of premise order in a replication study. Our research provides a new framework for testing a core aspect of intellectual (i.e., relational) abilities in artificial systems; we discuss the implications of this and future research directions.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12837307/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12837307/full.md

## References

72 references — full list in the complete paper: https://tomesphere.com/paper/PMC12837307/full.md

---
Source: https://tomesphere.com/paper/PMC12837307