Multi-agent AI systems outperform human teams in creativity

Tiancheng Hu; Yixuan Jiang; Haotian Li; Jos\'e Hern\'andez-Orallo; Xing Xie; Nigel Collier; David Stillwell; Luning Sun

arXiv:2605.17885·cs.CL·May 19, 2026

Multi-agent AI systems outperform human teams in creativity

Tiancheng Hu, Yixuan Jiang, Haotian Li, Jos\'e Hern\'andez-Orallo, Xing Xie, Nigel Collier, David Stillwell, Luning Sun

PDF

TL;DR

Multi-agent large language model teams significantly outperform human teams in creativity across diverse tasks by leveraging conversation dynamics and exploration strategies.

Contribution

This study demonstrates that multi-agent LLM teams surpass human teams in creativity and identifies conversational patterns that enhance creative output.

Findings

01

Multi-agent LLM teams outperform humans in creativity (Cohen's d=1.50).

02

Creative ideas are linked to wide-ranging conversations and exploration strategies.

03

Model choice and discussion structure explain 26.8% of variance in conversational dynamics.

Abstract

Although artificial intelligence (AI) now matches or exceeds human performance across numerous cognitive tasks, creativity remains a highly contested frontier. As AI systems based on large language models (LLMs) are increasingly adopted in research and innovation, it is essential to understand and augment their creativity. Here we demonstrate that multi-agent LLM teams not only surpass single agents, but also substantially outperform human teams in creativity (Cohen's d=1.50) across 4,541 multi-agent LLM ideas and 341 human-team ideas on six diverse problem-solving tasks. This advantage is driven by novelty while maintaining comparable usefulness. To investigate the generative processes in both groups, we represent conversations as paths through semantic space using neural language model representations. Both LLM and human teams produce more creative ideas when conversations range…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.