Judgment-of-Thought Prompting: A Courtroom-Inspired Framework for Binary Logical Reasoning with Large Language Models
Sungjune Park, Heehwan Kim, Haehyun Cho, Daeseon Choi

TL;DR
The paper introduces Judgment of Thought (JoT), a multi-agent prompting framework inspired by courtroom roles, which significantly improves binary logical reasoning accuracy in large language models through debate and systematic evaluation.
Contribution
JoT is a novel multi-agent prompting approach that models courtroom roles to enhance logical reasoning in large language models, outperforming existing methods on key benchmarks.
Findings
Achieves 98% accuracy on Boolean expressions
Outperforms existing prompting approaches on benchmarks
Ablation studies confirm the importance of each role and feedback mechanisms
Abstract
This paper proposes a novel prompting approach, Judgment of Thought (JoT), specifically tailored for binary logical reasoning tasks. Despite advances in prompt engineering, existing approaches still face limitations in handling complex logical reasoning tasks. To address these issues, JoT introduces a multi-agent approach with three specialized roleslawyer, prosecutor, and judgewhere a high-level model acts as the judge, and lower-level models serve as lawyer and prosecutor to systematically debate and evaluate arguments. Experimental evaluations on benchmarks such as BigBenchHard and Winogrande demonstrate JoT's superior performance compared to existing prompting approaches, achieving notable improvements, including 98\% accuracy in Boolean expressions. Also, our ablation studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques
