Judgment-of-Thought Prompting: A Courtroom-Inspired Framework for Binary Logical Reasoning with Large Language Models

Sungjune Park; Heehwan Kim; Haehyun Cho; Daeseon Choi

arXiv:2409.16635·cs.AI·May 23, 2025

Judgment-of-Thought Prompting: A Courtroom-Inspired Framework for Binary Logical Reasoning with Large Language Models

Sungjune Park, Heehwan Kim, Haehyun Cho, Daeseon Choi

PDF

Open Access

TL;DR

The paper introduces Judgment of Thought (JoT), a multi-agent prompting framework inspired by courtroom roles, which significantly improves binary logical reasoning accuracy in large language models through debate and systematic evaluation.

Contribution

JoT is a novel multi-agent prompting approach that models courtroom roles to enhance logical reasoning in large language models, outperforming existing methods on key benchmarks.

Findings

01

Achieves 98% accuracy on Boolean expressions

02

Outperforms existing prompting approaches on benchmarks

03

Ablation studies confirm the importance of each role and feedback mechanisms

Abstract

This paper proposes a novel prompting approach, Judgment of Thought (JoT), specifically tailored for binary logical reasoning tasks. Despite advances in prompt engineering, existing approaches still face limitations in handling complex logical reasoning tasks. To address these issues, JoT introduces a multi-agent approach with three specialized roles $\unicode x 2010$ $\unicode x 2010$ $\unicode x 2010$ lawyer, prosecutor, and judge $\unicode x 2010$ $\unicode x 2010$ $\unicode x 2010$ where a high-level model acts as the judge, and lower-level models serve as lawyer and prosecutor to systematically debate and evaluate arguments. Experimental evaluations on benchmarks such as BigBenchHard and Winogrande demonstrate JoT's superior performance compared to existing prompting approaches, achieving notable improvements, including 98\% accuracy in Boolean expressions. Also, our ablation studies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques