# Private speech: similarities between a large language model and children

**Authors:** Zhiyu Liang, Leon On Tay, Simon Dennis

PMC · DOI: 10.3389/frai.2026.1691074 · Frontiers in Artificial Intelligence · 2026-01-29

## TL;DR

This study compares the private speech of a large language model to that of children, finding surprising similarities under specific task conditions.

## Contribution

The study introduces a novel framework for comparing private speech in large language models and children using a block-construction task.

## Key findings

- GPT-4o's private speech showed strong similarity to recent human benchmarks (r = 0.93) but not classic ones (r = 0.01).
- Task nature (goal-directed vs. self-determined) strongly influences private speech patterns in both models and children.
- Private speech was observed in GPT-3.5-Turbo-instruct during a serial recall task, suggesting broader applicability.

## Abstract

This study investigates the capability of a non-reasoning large language model (GPT-4o) to generate private speech and evaluates its similarity to human private speech. We placed the model in a simulated solitary block-construction scenario via textual prompts, eliciting and classifying its self-directed utterances using an established semantic framework for categorizing private speech in children. The distribution of these categories was compared to two human benchmarks: a classic block-construction study and a more recent experiment employing a similar task setting. Analysis using scatter plots and Pearson correlation coefficients revealed a striking pattern: GPT-4o’s semantic profile showed negligible similarity to the classic benchmark (r = 0.01) but very strong similarity to the recent benchmark (r = 0.93). This discrepancy is interpreted as stemming from differences in task nature, namely goal-directed, scaffolded task versus self-determined, unscaffolded play, which exert a stronger influence on speech content than experimental subject difference between GPT-4o and children. In an exploratory serial recall study, we tasked GPT-3.5-Turbo-instruct and observed incidental private speech, indicating that the phenomenon extends across contexts. This provides an avenue for investigating LLM replication of private speech and, potentially, computational consciousness.

## Full-text entities

- **Diseases:** LLMs (MESH:D007806)
- **Chemicals:** 4o (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Liphistius sp. LM (species) [taxon 1285381]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12894341/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12894341/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC12894341/full.md

---
Source: https://tomesphere.com/paper/PMC12894341