Reasoning Beyond the Obvious: Evaluating Divergent and Convergent Thinking in LLMs for Financial Scenarios

Zhuang Qiang Bok; Watson Wei Khong Chua

arXiv:2507.18368·cs.AI·July 25, 2025

Reasoning Beyond the Obvious: Evaluating Divergent and Convergent Thinking in LLMs for Financial Scenarios

Zhuang Qiang Bok, Watson Wei Khong Chua

PDF

Open Access

TL;DR

This paper introduces ConDiFi, a benchmark for evaluating both divergent and convergent reasoning in large language models within financial scenarios, highlighting differences in model performance on creative and decision-making tasks.

Contribution

The paper presents ConDiFi, a novel benchmark that assesses reasoning beyond factual accuracy, specifically targeting financial decision-making and creative thinking in LLMs.

Findings

01

GPT-4o underperforms on novelty and actionability

02

DeepSeek-R1 and Cohere Command R+ excel in actionable insights

03

Models show significant variation in reasoning capabilities

Abstract

Most reasoning benchmarks for LLMs emphasize factual accuracy or step-by-step logic. In finance, however, professionals must not only converge on optimal decisions but also generate creative, plausible futures under uncertainty. We introduce ConDiFi, a benchmark that jointly evaluates divergent and convergent thinking in LLMs for financial tasks. ConDiFi features 607 macro-financial prompts for divergent reasoning and 990 multi-hop adversarial MCQs for convergent reasoning. Using this benchmark, we evaluated 14 leading models and uncovered striking differences. Despite high fluency, GPT-4o underperforms on Novelty and Actionability. In contrast, models like DeepSeek-R1 and Cohere Command R+ rank among the top for generating actionable, insights suitable for investment decisions. ConDiFi provides a new perspective to assess reasoning capabilities essential to safe and strategic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinTech, Crowdfunding, Digital Finance