CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning
Yuanheng Fang, Guoqing Chao, Wenqiang Lei, Shaobo Li, Dianhui Chu

TL;DR
CDW-CoT enhances large language model reasoning by dynamically tailoring prompts to data characteristics through clustering, significantly improving accuracy across diverse reasoning tasks.
Contribution
It introduces a novel clustering-based prompt optimization framework that adapts prompts to data diversity, outperforming traditional static CoT methods.
Findings
Achieves 25.34% accuracy improvement on LLaMA2 (13B).
Achieves 15.72% accuracy improvement on LLaMA3 (8B).
Outperforms traditional CoT across six datasets.
Abstract
Large Language Models (LLMs) have recently achieved impressive results in complex reasoning tasks through Chain of Thought (CoT) prompting. However, most existing CoT methods rely on using the same prompts, whether manually designed or automatically generated, to handle the entire dataset. This one-size-fits-all approach may fail to meet the specific needs arising from the diversities within a single dataset. To solve this problem, we propose the Clustered Distance-Weighted Chain of Thought (CDW-CoT) method, which dynamically constructs prompts tailored to the characteristics of each data instance by integrating clustering and prompt optimization techniques. Our method employs clustering algorithms to categorize the dataset into distinct groups, from which a candidate pool of prompts is selected to reflect the inherent diversity within the dataset. For each cluster, CDW-CoT trains the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSemantic Web and Ontologies · Rough Sets and Fuzzy Logic · Cognitive Computing and Networks
