Quantum Knowledge Distillation for Large Language Models

Lingxiao Li; Yihao Wang; Jiacheng Fan; Jing Li; Sujuan Qin; Qiaoyan Wen; Fei Gao

arXiv:2505.13205·quant-ph·August 4, 2025

Quantum Knowledge Distillation for Large Language Models

Lingxiao Li, Yihao Wang, Jiacheng Fan, Jing Li, Sujuan Qin, Qiaoyan Wen, Fei Gao

PDF

Open Access

TL;DR

This paper introduces a quantum knowledge distillation approach for large language models using variational quantum circuits, achieving superior performance in text classification and demonstrating practical feasibility on quantum hardware.

Contribution

It presents the first quantum knowledge distillation model for LLMs, leveraging quantum circuits to outperform classical methods and exploring quantum-inspired classical algorithms.

Findings

01

Quantum distillation outperforms classical methods in accuracy and efficiency.

02

Simulation of quantum models can inspire new classical algorithms.

03

Quantum circuits maintain performance on real hardware despite constraints.

Abstract

As foundational tools in natural language processing, Large Language Models (LLMs) have immense parameter scales, which makes deployment and inference increasingly prohibitive, especially in resource-constrained devices. Therefore, knowledge distillation for LLMs, i.e., compressing the LLM to a smaller model, is meaningful. With strong parameter representation capacity, quantum computing is regarded as a promising solution. Here, we propose a Quantum knowledge Distillation model for LLMs (QD-LLM) that leverages variational quantum circuits to learn from LLMs. In classical simulation, QD-LLM outperforms several mainstream distillation methods on multiple text classification tasks in terms of both accuracy and efficiency using only 11 qubits. The results reveal an interesting phenomenon that the simulation of quantum student models may be regarded as a new class of quantum-inspired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture · Machine Learning in Materials Science · Quantum many-body systems