Multi-agent AI systems outperform human teams in creativity
Tiancheng Hu, Yixuan Jiang, Haotian Li, Jos\'e Hern\'andez-Orallo, Xing Xie, Nigel Collier, David Stillwell, Luning Sun

TL;DR
Multi-agent large language model teams significantly outperform human teams in creativity across diverse tasks by leveraging conversation dynamics and exploration strategies.
Contribution
This study demonstrates that multi-agent LLM teams surpass human teams in creativity and identifies conversational patterns that enhance creative output.
Findings
Multi-agent LLM teams outperform humans in creativity (Cohen's d=1.50).
Creative ideas are linked to wide-ranging conversations and exploration strategies.
Model choice and discussion structure explain 26.8% of variance in conversational dynamics.
Abstract
Although artificial intelligence (AI) now matches or exceeds human performance across numerous cognitive tasks, creativity remains a highly contested frontier. As AI systems based on large language models (LLMs) are increasingly adopted in research and innovation, it is essential to understand and augment their creativity. Here we demonstrate that multi-agent LLM teams not only surpass single agents, but also substantially outperform human teams in creativity (Cohen's d=1.50) across 4,541 multi-agent LLM ideas and 341 human-team ideas on six diverse problem-solving tasks. This advantage is driven by novelty while maintaining comparable usefulness. To investigate the generative processes in both groups, we represent conversations as paths through semantic space using neural language model representations. Both LLM and human teams produce more creative ideas when conversations range…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
