BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation
Bo Pang, Hanze Dong, Jiacheng Xu, Silvio Savarese, Yingbo Zhou,, Caiming Xiong

TL;DR
This paper presents BOLT, a method to enable large language models to perform long chain-of-thought reasoning without relying on distillation from specialized models, using a three-stage bootstrapping process with minimal data.
Contribution
BOLT introduces a novel three-stage bootstrapping approach to instill LongCoT reasoning in LLMs without distillation or extensive human annotations, applicable across various model sizes.
Findings
BOLT achieves strong performance on diverse reasoning benchmarks.
Only 10 in-context examples are needed for effective bootstrapping.
The method generalizes across multiple model scales.
Abstract
Large language models (LLMs), such as o1 from OpenAI, have demonstrated remarkable reasoning capabilities. o1 generates a long chain-of-thought (LongCoT) before answering a question. LongCoT allows LLMs to analyze problems, devise plans, reflect, and backtrack effectively. These actions empower LLM to solve complex problems. After the release of o1, many teams have attempted to replicate its LongCoT and reasoning capabilities. In terms of methods, they primarily rely on knowledge distillation with data from existing models with LongCoT capacities (e.g., OpenAI-o1, Qwen-QwQ, DeepSeek-R1-Preview), leaving significant uncertainties on systematically developing such reasoning abilities. In terms of data domains, these works focus narrowly on math while a few others include coding, limiting their generalizability. This paper introduces a novel approach to enable LLM's LongCoT capacity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsKnowledge Distillation · Focus
