A Survey on Symbolic Knowledge Distillation of Large Language Models

Kamal Acharya; Alvaro Velasquez; Houbing Herbert Song

arXiv:2408.10210·cs.CL·August 21, 2024

A Survey on Symbolic Knowledge Distillation of Large Language Models

Kamal Acharya, Alvaro Velasquez, Houbing Herbert Song

PDF

TL;DR

This survey reviews the emerging field of symbolic knowledge distillation in large language models, highlighting methods, challenges, and future opportunities to improve interpretability and efficiency of AI systems.

Contribution

It categorizes existing research on symbolic knowledge distillation in LLMs, providing a comprehensive overview and identifying research gaps and future directions.

Findings

01

Categorizes methodologies and applications of symbolic knowledge distillation.

02

Highlights challenges in maintaining knowledge depth and interpretability.

03

Identifies research gaps and potential future advancements.

Abstract

This survey paper delves into the emerging and critical area of symbolic knowledge distillation in Large Language Models (LLMs). As LLMs like Generative Pre-trained Transformer-3 (GPT-3) and Bidirectional Encoder Representations from Transformers (BERT) continue to expand in scale and complexity, the challenge of effectively harnessing their extensive knowledge becomes paramount. This survey concentrates on the process of distilling the intricate, often implicit knowledge contained within these models into a more symbolic, explicit form. This transformation is crucial for enhancing the interpretability, efficiency, and applicability of LLMs. We categorize the existing research based on methodologies and applications, focusing on how symbolic knowledge distillation can be used to improve the transparency and functionality of smaller, more efficient Artificial Intelligence (AI) models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation