Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think"   Step-by-Step

Liunian Harold Li; Jack Hessel; Youngjae Yu; Xiang Ren; Kai-Wei Chang,; Yejin Choi

arXiv:2306.14050·cs.CL·April 17, 2024·2 cites

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step

Liunian Harold Li, Jack Hessel, Youngjae Yu, Xiang Ren, Kai-Wei Chang,, Yejin Choi

PDF

Open Access 1 Repo 1 Models 1 Datasets

TL;DR

This paper introduces Symbolic Chain-of-Thought Distillation (SCoTD), enabling small language models to emulate large models' reasoning capabilities by training on their rationalizations, significantly improving performance on reasoning tasks.

Contribution

The paper proposes SCoTD, a novel method for training small models on large models' rationalizations, making step-by-step reasoning accessible to smaller models.

Findings

01

SCoTD improves small model performance on commonsense benchmarks.

02

Sampling multiple reasoning chains from the teacher is crucial.

03

Humans find student model rationalizations comparable to teacher's.

Abstract

Chain-of-thought prompting (e.g., "Let's think step-by-step") primes large language models to verbalize rationalization for their predictions. While chain-of-thought can lead to dramatic performance gains, benefits appear to emerge only for sufficiently large models (beyond 50B parameters). We show that orders-of-magnitude smaller models (125M -- 1.3B parameters) can still benefit from chain-of-thought prompting. To achieve this, we introduce Symbolic Chain-of-Thought Distillation (SCoTD), a method to train a smaller student model on rationalizations sampled from a significantly larger teacher model. Experiments across several commonsense benchmarks show that: 1) SCoTD enhances the performance of the student model in both supervised and few-shot settings, and especially for challenge sets; 2) sampling many reasoning chains per instance from the teacher is paramount; and 3) after…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liunian-harold-li/scotd
pytorchOfficial

Models

🤗
44David/qwen-0.5b-reasoning-v2
model· 8 dl· ♡ 1
8 dl♡ 1

Datasets

44David/SCoTD-deepseek-math-7B
dataset· 7 dl
7 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks