Sci-CoT: Leveraging Large Language Models for Enhanced Knowledge   Distillation in Small Models for Scientific QA

Yuhan Ma; Haiqi Jiang; Chenyou Fan

arXiv:2308.04679·cs.CL·August 10, 2023·1 cites

Sci-CoT: Leveraging Large Language Models for Enhanced Knowledge Distillation in Small Models for Scientific QA

Yuhan Ma, Haiqi Jiang, Chenyou Fan

PDF

Open Access

TL;DR

This paper introduces Sci-CoT, a two-stage knowledge distillation framework that transfers reasoning abilities from large language models to smaller models, significantly improving scientific question-answering performance.

Contribution

The paper presents Sci-CoT, a novel two-stage distillation method that enhances small models' reasoning skills by mimicking large models' chain-of-thought reasoning in scientific QA tasks.

Findings

01

Small model outperforms BLOOM-176B on ARC-Easy with few-shot learning

02

Sci-CoT effectively transfers reasoning capabilities from large to small models

03

80-million parameter model surpasses large models in specific scientific QA benchmarks

Abstract

Large Language Models (LLMs) have shown outstanding performance across wide range of downstream tasks. This competency is attributed to their substantial parameter size and pre-training on extensive corpus. Moreover, LLMs have exhibited enhanced reasoning capabilities in tackling complex reasoning tasks, owing to the utilization of a method named ``Chain-of-Thought (CoT) prompting''. This method is designed to generate intermediate reasoning steps that guide the inference of the final answer. However, it is essential to highlight that these advanced reasoning abilities appear to emerge in models with a minimum of 10 billion parameters, thereby limiting its efficacy in situations where computational resources are constrained. In this paper, we investigate the possibility of transferring the reasoning capabilities of LLMs to smaller models via knowledge distillation. Specifically, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Expert finding and Q&A systems