PAE MobiLLM: Privacy-Aware and Efficient LLM Fine-Tuning on the Mobile Device via Additive Side-Tuning
Xingke Yang, Liang Li, Zhiyi Wan, Sicong Li, Xiaoqi Qi, Jiang Liu, Tomoaki Ohtsuki, Xin Fu, Miao Pan

TL;DR
PAE MobiLLM enables privacy-aware, efficient on-device LLM fine-tuning by leveraging additive side-tuning, activation caching, and communication reduction techniques, making it suitable for mobile devices with limited resources.
Contribution
It introduces a novel privacy-preserving, server-assisted additive side-tuning method with activation caching and shortcut techniques for efficient mobile LLM fine-tuning.
Findings
Outperforms existing methods in efficiency and privacy preservation.
Reduces communication costs significantly.
Achieves competitive fine-tuning accuracy on mobile devices.
Abstract
There is a huge gap between numerous intriguing applications fostered by on-device large language model (LLM) fine-tuning (FT) from fresh mobile data and the limited resources of a mobile device. While existing server-assisted methods (e.g., split learning or side-tuning) may enable LLM FT on the local mobile device, they suffer from heavy communication burdens of activation transmissions, and may disclose data and labels to the server. To address those issues, we develop PAE MobiLLM, a a privacy-aware and efficient LLM FT method which can be deployed on the mobile device via server-assisted additive side-tuning. To further accelerate FT convergence and improve computing efficiency, PAE MobiLLM integrates activation caching on the server side, which allows the server to reuse historical activations and saves the mobile device from repeatedly computing forward passes for the recurring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Big Data and Digital Economy · IoT and Edge/Fog Computing
