Explanations from Large Language Models Make Small Reasoners Better

Shiyang Li; Jianshu Chen; Yelong Shen; Zhiyu Chen; Xinlu Zhang; Zekun; Li; Hong Wang; Jing Qian; Baolin Peng; Yi Mao; Wenhu Chen; Xifeng Yan

arXiv:2210.06726·cs.CL·October 14, 2022·35 cites

Explanations from Large Language Models Make Small Reasoners Better

Shiyang Li, Jianshu Chen, Yelong Shen, Zhiyu Chen, Xinlu Zhang, Zekun, Li, Hong Wang, Jing Qian, Baolin Peng, Yi Mao, Wenhu Chen, Xifeng Yan

PDF

Open Access

TL;DR

This paper presents a method for training small language models to improve their reasoning and explanation capabilities by leveraging explanations generated by large models, outperforming larger models in accuracy.

Contribution

The paper introduces a multi-task learning framework that uses LLM-generated explanations to enhance small models' reasoning and explanation abilities, surpassing finetuning baselines.

Findings

01

Small models outperform finetuning baselines in reasoning tasks.

02

Method surpasses GPT-3 (175B) by up to 9.5% in accuracy.

03

Generated explanations are of high quality, supporting explainable AI.

Abstract

Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations. In this paper, we consider the problem of leveraging the explanations generated by LLM to improve the training of small reasoners, which are more favorable in real-production deployment due to their low cost. We systematically explore three explanation generation approaches from LLM and utilize a multi-task learning framework to facilitate small models to acquire strong reasoning power together with explanation generation capabilities. Experiments on multiple reasoning tasks show that our method can consistently and significantly outperform finetuning baselines across different settings, and even perform better than finetuning/prompting a 60x larger GPT-3 (175B) model by up to 9.5% in accuracy. As a side benefit,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Cosine Annealing · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Softmax · Linear Warmup With Cosine Annealing · Attention Dropout