Routing Distilled Knowledge via Mixture of LoRA Experts for Large Language Model based Bundle Generation

Kaidong Feng; Zhu Sun; Hui Fang; Jie Yang; Wenyuan Liu; Yew-Soon Ong

arXiv:2508.17250·cs.CL·August 26, 2025

Routing Distilled Knowledge via Mixture of LoRA Experts for Large Language Model based Bundle Generation

Kaidong Feng, Zhu Sun, Hui Fang, Jie Yang, Wenyuan Liu, Yew-Soon Ong

PDF

TL;DR

RouteDK introduces a dynamic routing framework with mixture of LoRA experts to distill and integrate diverse knowledge types from large language models, achieving high accuracy and efficiency in bundle generation.

Contribution

The paper proposes RouteDK, a novel knowledge routing framework with input-aware dynamic fusion of LoRA experts for efficient large language model distillation.

Findings

01

Achieves accuracy comparable or superior to teacher LLMs.

02

Outperforms state-of-the-art bundle generation methods.

03

Maintains strong computational efficiency.

Abstract

Large Language Models (LLMs) have shown potential in automatic bundle generation but suffer from prohibitive computational costs. Although knowledge distillation offers a pathway to more efficient student models, our preliminary study reveals that naively integrating diverse types of distilled knowledge from teacher LLMs into student LLMs leads to knowledge conflict, negatively impacting the performance of bundle generation. To address this, we propose RouteDK, a framework for routing distilled knowledge through a mixture of LoRA expert architecture. Specifically, we first distill knowledge from the teacher LLM for bundle generation in two complementary types: high-level knowledge (generalizable rules) and fine-grained knowledge (session-specific reasoning). We then train knowledge-specific LoRA experts for each type of knowledge together with a base LoRA expert. For effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.