C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Yu Kang, Xianghui Sun, Liangyu Chen, Wei Zou

TL;DR
This paper introduces C3oT, a framework that compresses Chain-of-Thoughts in large language models to reduce length and costs while preserving reasoning quality, enabling faster and more efficient applications.
Contribution
The paper proposes a novel CoT compression framework with a compressor, conditioned training, and inference methods to shorten reasoning chains without losing effectiveness.
Findings
Compressed CoT length by over 50%
Maintained reasoning accuracy in multiple datasets
Reduced decoding costs significantly
Abstract
Generating Chain-of-Thought (CoT) before deriving the answer can effectively improve the reasoning capabilities of large language models (LLMs) and significantly improve the accuracy of the generated answer. However, in most cases, the length of the generated CoT is much longer than the desired final answer, which results in additional decoding costs. Furthermore, existing research has discovered that shortening the reasoning steps in CoT, even while preserving the key information, diminishes LLMs' abilities. These phenomena make it difficult to use LLMs and CoT in many real-world applications that only require the final answer and are sensitive to latency, such as search and recommendation. To reduce the costs of model decoding and shorten the length of the generated CoT, this paper presents onditioned ompressed hain-of-hought (C3oT), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEducational Games and Gamification · Mind wandering and attention · Cognitive Abilities and Testing
