Universe of Thoughts: Enabling Creative Reasoning with Large Language Models

Yuto Suzuki; Farnoush Banaei-Kashani

arXiv:2511.20471·cs.AI·November 27, 2025

Universe of Thoughts: Enabling Creative Reasoning with Large Language Models

Yuto Suzuki, Farnoush Banaei-Kashani

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a new framework and methods for enabling large language models to perform creative reasoning, addressing the gap in generating innovative solutions in expansive problem spaces.

Contribution

It proposes a computational framework for creative reasoning inspired by cognitive science, introduces the Universe of Thoughts methods, and develops new tasks and benchmarks for evaluating creativity in LLMs.

Findings

01

UoT outperforms state-of-the-art reasoning techniques in creative problem-solving.

02

The framework enables systematic exploration of the universe of thoughts for innovative solutions.

03

New benchmarks effectively measure creativity in terms of feasibility, utility, and novelty.

Abstract

Reasoning based on Large Language Models (LLMs) has garnered increasing attention due to outstanding performance of these models in mathematical and complex logical tasks. Beginning with the Chain-of-Thought (CoT) prompting technique, numerous reasoning methods have emerged that decompose problems into smaller, sequential steps (or thoughts). However, existing reasoning models focus on conventional problem-solving and do not necessarily generate creative solutions by ``creative reasoning''. In domains where the solution space is expansive and conventional solutions are suboptimal, such as drug discovery or business strategization, creative reasoning to discover innovative solutions is crucial. To address this gap, first we introduce a computational framework for creative reasoning inspired by established cognitive science principles. With this framework, we propose three core creative…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 6Confidence 4

Strengths

1. The paper studies an interesting problem about creative reasoning and proposed UoT with combinatory, exploratory, and transformative reasoning, which is intuitive and showed good results. 2. The UoT part is backed with detailed formalization and complexity analysis in the appendix.

Weaknesses

The quality of both the constructed tasks and evaluation methods remain unclear. 1. I am not sure how much I can trust the experimental results as the quality of the set of thoughts, search space, environment, etc., for the three tasks (One-Lane Bridge, Electricity Tariff, Social Cohesion) needs further validation. I could imagine this requires significant amount of engineering/simulation work and domain expert knowledge. However, no discussion about the the process and validation to ensure t

Reviewer 02Rating 6Confidence 3

Strengths

Timely contribution to the recent area of improving creativity in LLMs, with new creativity benchmark task. The proposed approach is sound and based on well-studied paradigm of creative reasoning. Experiments show that with proposed approaches, weaker (GPT-4o) LLM can outperform stronger comparison (GPT-5) in the measure of creativity proposed by the authors

Weaknesses

Generalization to other creative tasks remains to be seen. The proposed creativity improvement approach seems to be tailored for the benchmark tasks proposed. The approach seems to rely mostly on special prompts to help evoke creativity in LLMs and hence has limited technical novelty beyond prompt engineering.

Reviewer 03Rating 2Confidence 4

Strengths

Strengths: - This paper tackles an interesting topic, which formalizes “creative reasoning” for LLMs. - By grounding the conceptual framework in cognitive science (Boden’s theory), this paper establishes a solid theoretical background for creative reasoning. - The proposed benchmark may inspire further exploration of creativity evaluation in reasoning systems.

Weaknesses

Weaknesses: - Limited Novelty and Positioning. The notion of enabling creativity in LLMs is not new. Substantial prior work has investigated open-ended or ill-posed tasks, such as creative writing [5], story generation [7], and idea synthesis [6]. The paper does not adequately engage with this literature or clarify how its proposal differs conceptually or methodologically. - Missing Discussion of Established Reasoning Paradigms. The manuscript omits connections to analogical reasoning [1,2], i

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Artificial Intelligence in Healthcare and Education · Creativity in Education and Neuroscience