Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
Peijie Liu, Fengli Xu, Yong Li

TL;DR
This paper introduces a novel method called Dynamic CoT that uses token probability distribution indicators to assess and improve chain-of-thought reasoning in large language models, reducing token usage while maintaining accuracy.
Contribution
It proposes token distribution-based indicators for evaluating CoT effectiveness and develops Dynamic CoT, a dynamic selection method that enhances reasoning efficiency in LLMs.
Findings
Indicators achieve 89.2% accuracy in assessing CoT effectiveness.
Dynamic CoT reduces token consumption by over 35%.
Method extends to closed-source models via transfer learning.
Abstract
Chain-of-Thought (CoT) technique has proven effective in improving the performance of large language models (LLMs) on complex reasoning tasks. However, the performance gains are inconsistent across different tasks, and the underlying mechanism remains a long-standing research question. In this work, we make a preliminary observation that the monotonicity of token probability distributions may be correlated with the gains achieved through CoT reasoning. Leveraging this insight, we propose two indicators based on the token probability distribution to assess CoT effectiveness across different tasks. By combining instance-level indicators with logistic regression model, we introduce Dynamic CoT, a method that dynamically select between CoT and direct answer. Furthermore, we extend Dynamic CoT to closed-source models by transferring decision strategies learned from open-source models. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Mobile Crowdsensing and Crowdsourcing
MethodsLogistic Regression
