Nash CoT: Multi-Path Inference with Preference Equilibrium

Ziqi Zhang; Cunxiang Wang; Xiong Xiao; Yue Zhang; Donglin Wang

arXiv:2407.07099·cs.CL·December 31, 2024

Nash CoT: Multi-Path Inference with Preference Equilibrium

Ziqi Zhang, Cunxiang Wang, Xiong Xiao, Yue Zhang, Donglin Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

Nash CoT introduces a game-theoretic approach to multi-path inference in large language models, balancing role-specific and general reasoning to improve accuracy and reduce inference costs.

Contribution

The paper proposes Nash CoT, a novel method that balances role-specific and general reasoning in multi-path inference, enhancing accuracy while reducing the number of inference paths needed.

Findings

01

Achieves comparable or better results than traditional multi-path CoT.

02

Reduces inference path requirements without sacrificing performance.

03

Effective across diverse inference tasks like reasoning and question answering.

Abstract

Chain of thought (CoT) is a reasoning framework that can enhance the performance of Large Language Models (LLMs) on complex inference tasks. In particular, among various studies related to CoT, multi-path inference stands out as a simple yet effective improvement. However, there is no optimal setting for the number of inference paths. Therefore, we have to increase the number of inference paths to obtain better results, which in turn increases the inference cost. To address this limitation, we can utilize question-related role templates to guide LLMs into relevant roles, thereby increasing the possibility of correct inferences for each path and further reducing dependence on the number of inference paths while improving reasoning accuracy. However, placing LLMs into specific roles may reduce their reasoning diversity and performance on a few tasks where role dependence is low. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stevezhangza/nash-chain-of-thought
pytorchOfficial

Videos

Nash CoT: Multi-Path Inference with Preference Equilibrium· underline

Taxonomy

TopicsGame Theory and Applications