Nash CoT: Multi-Path Inference with Preference Equilibrium
Ziqi Zhang, Cunxiang Wang, Xiong Xiao, Yue Zhang, Donglin Wang

TL;DR
Nash CoT introduces a game-theoretic approach to multi-path inference in large language models, balancing role-specific and general reasoning to improve accuracy and reduce inference costs.
Contribution
The paper proposes Nash CoT, a novel method that balances role-specific and general reasoning in multi-path inference, enhancing accuracy while reducing the number of inference paths needed.
Findings
Achieves comparable or better results than traditional multi-path CoT.
Reduces inference path requirements without sacrificing performance.
Effective across diverse inference tasks like reasoning and question answering.
Abstract
Chain of thought (CoT) is a reasoning framework that can enhance the performance of Large Language Models (LLMs) on complex inference tasks. In particular, among various studies related to CoT, multi-path inference stands out as a simple yet effective improvement. However, there is no optimal setting for the number of inference paths. Therefore, we have to increase the number of inference paths to obtain better results, which in turn increases the inference cost. To address this limitation, we can utilize question-related role templates to guide LLMs into relevant roles, thereby increasing the possibility of correct inferences for each path and further reducing dependence on the number of inference paths while improving reasoning accuracy. However, placing LLMs into specific roles may reduce their reasoning diversity and performance on a few tasks where role dependence is low. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGame Theory and Applications
