$\textit{SKIntern}$: Internalizing Symbolic Knowledge for Distilling   Better CoT Capabilities into Small Language Models

Huanxuan Liao; Shizhu He; Yupu Hao; Xiang Li; Yuanzhe Zhang; Jun Zhao,; Kang Liu

arXiv:2409.13183·cs.CL·December 17, 2024

$\textit{SKIntern}$: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models

Huanxuan Liao, Shizhu He, Yupu Hao, Xiang Li, Yuanzhe Zhang, Jun Zhao,, Kang Liu

PDF

Open Access 1 Repo

TL;DR

SKIntern is a novel method that internalizes symbolic knowledge into small language models through progressive fine-tuning, enhancing reasoning capabilities, reducing inference costs, and outperforming existing methods across various tasks.

Contribution

It introduces a curriculum learning-based progressive fine-tuning approach for small language models to internalize symbolic knowledge efficiently.

Findings

01

Outperforms state-of-the-art baselines by over 5%

02

Reduces inference FLOPs by up to 4x

03

Improves reasoning and out-of-domain generalization

Abstract

Small Language Models (SLMs) are attracting attention due to the high computational demands and privacy concerns of Large Language Models (LLMs). Some studies fine-tune SLMs using Chains of Thought (CoT) data distilled from LLMs, aiming to enhance their reasoning ability. Furthermore, Some CoT distillation methods introduce external symbolic knowledge into the generation process to improve the limited knowledge memory, reasoning ability and out-of-domain (OOD) generalization of SLMs. However, the introduction of symbolic knowledge increases computational overhead and introduces potential noise. In this paper, we introduce $SKIntern$ , an innovative approach that empowers SLMs to internalize symbolic knowledge and few-shot examples gradually through a progressive fine-tuning process, guided by a predefined linear decay schedule under curriculum learning. By efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xnhyacinth/skintern
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsSoftmax · Attention Is All You Need