# Making sense of transformer success

**Authors:** Nicola Angius, Pietro Perconti, Alessio Plebe, Alessandro Acciai

PMC · DOI: 10.3389/frai.2025.1509338 · Frontiers in Artificial Intelligence · 2025-04-01

## TL;DR

This paper explores how neural language models, like Transformers, achieve linguistic competence and compares their explanations to theories in cognitive science.

## Contribution

The paper introduces a mechanist explanation of in-context learning and the concept of co-simulation between NLMs and the brain.

## Key findings

- Explanatory strategies for NLMs are similar to those used in cognitive science for human intelligence.
- The copying algorithm and induction head phenomenon offer mechanist explanations for in-context learning.
- NLMs can co-simulate with the brain to predict language-related brain activation patterns.

## Abstract

This article provides an epistemological analysis of current attempts of explaining how the relatively simple algorithmic components of neural language models (NLMs) provide them with genuine linguistic competence. After introducing the Transformer architecture, at the basis of most of current NLMs, the paper firstly emphasizes how the central question in the philosophy of AI has been shifted from “can machines think?”, as originally put by Alan Turing, to “how can machines think?”, pointing to an explanatory gap for NLMs. Subsequently, existing explanatory strategies for the functioning of NLMs are analyzed to argue that they, however debated, do not differ from the explanatory strategies used in cognitive science to explain intelligent behaviors of humans. In particular, available experimental studies turned to test the theory of mind, discourse entity tracking, and property induction in NLMs are examined under the light of the functional analysis in the philosophy of cognitive science; the so-called copying algorithm and the induction head phenomenon of a Transformer are shown to provide a mechanist explanation of in-context learning; finally, current pioneering attempts to use NLMs to predict brain activation patterns when processing language are here shown to involve what we call a co-simulation, in which a NLM and the brain are used to simulate and understand each other.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11996879/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11996879/full.md

## References

163 references — full list in the complete paper: https://tomesphere.com/paper/PMC11996879/full.md

---
Source: https://tomesphere.com/paper/PMC11996879