Quantum Knowledge Distillation for Large Language Models
Lingxiao Li, Yihao Wang, Jiacheng Fan, Jing Li, Sujuan Qin, Qiaoyan Wen, Fei Gao

TL;DR
This paper introduces a quantum knowledge distillation approach for large language models using variational quantum circuits, achieving superior performance in text classification and demonstrating practical feasibility on quantum hardware.
Contribution
It presents the first quantum knowledge distillation model for LLMs, leveraging quantum circuits to outperform classical methods and exploring quantum-inspired classical algorithms.
Findings
Quantum distillation outperforms classical methods in accuracy and efficiency.
Simulation of quantum models can inspire new classical algorithms.
Quantum circuits maintain performance on real hardware despite constraints.
Abstract
As foundational tools in natural language processing, Large Language Models (LLMs) have immense parameter scales, which makes deployment and inference increasingly prohibitive, especially in resource-constrained devices. Therefore, knowledge distillation for LLMs, i.e., compressing the LLM to a smaller model, is meaningful. With strong parameter representation capacity, quantum computing is regarded as a promising solution. Here, we propose a Quantum knowledge Distillation model for LLMs (QD-LLM) that leverages variational quantum circuits to learn from LLMs. In classical simulation, QD-LLM outperforms several mainstream distillation methods on multiple text classification tasks in terms of both accuracy and efficiency using only 11 qubits. The results reveal an interesting phenomenon that the simulation of quantum student models may be regarded as a new class of quantum-inspired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Machine Learning in Materials Science · Quantum many-body systems
