Trustworthy and Controllable Professional Knowledge Utilization in Large Language Models with TEE-GPU Execution
Yifeng Cai, Zhida An, Yuhan Meng, Houqian Liu, Pengli Wang, Hanwen Lei, Yao Guo, Ding Li

TL;DR
This paper introduces PKUS, a system that enables trustworthy, controllable, and efficient utilization of professional knowledge in large language models by leveraging TEE-GPU execution and adapter-based knowledge separation.
Contribution
PKUS presents a novel architecture that treats professional knowledge as a separable artifact, ensuring privacy, control, and efficiency in LLM knowledge integration.
Findings
PKUS matches full fine-tuning accuracy and F1 scores.
Achieves 8.1-11.9x speedup over CPU-only inference.
Supports multi-provider knowledge aggregation.
Abstract
Future improvements in large language model (LLM) services increasingly hinge on access to high-value professional knowledge rather than more generic web data. However, the data providers of this knowledge face a skewed tradeoff between income and risk: they receive little share of downstream value yet retain copyright and privacy liability, making them reluctant to contribute their assets to LLM services. Existing techniques do not offer a trustworthy and controllable way to use professional knowledge, because they keep providers in the dark and combine knowledge parameters with the underlying LLM backbone. In this paper, we present PKUS, the Professional Knowledge Utilization System, which treats professional knowledge as a first-class, separable artifact. PKUS keeps the backbone model on GPUs and encodes each provider's contribution as a compact adapter that executes only inside an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Big Data and Digital Economy · Cloud Computing and Resource Management
