MTA: A Merge-then-Adapt Framework for Personalized Large Language Model
Xiaopeng Li, Yuanjin Zheng, Wanyu Wang, wenlin zhang, Pengyue Jia, Yiqi Wang, Maolin Wang, Xuetao Wei, Xiangyu Zhao

TL;DR
The paper introduces MTA, a scalable framework for personalized large language models that dynamically merges shared meta-LoRA modules to improve personalization without high storage costs.
Contribution
MTA proposes a novel merge-then-adapt framework with dynamic personalization, reducing storage and enhancing few-shot personalization for PLLMs.
Findings
Outperforms SOTA methods on LaMP benchmark
Supports dynamic, scalable personalization without user-specific storage
Effective in few-shot personalization scenarios
Abstract
Personalized Large Language Models (PLLMs) aim to align model outputs with individual user preferences, a crucial capability for user-centric applications. However, the prevalent approach of fine-tuning a separate module for each user faces two major limitations: (1) storage costs scale linearly with the number of users, rendering the method unscalable; and (2) fine-tuning a static model from scratch often yields suboptimal performance for users with sparse data. To address these challenges, we propose MTA, a Merge-then-Adapt framework for PLLMs. MTA comprises three key stages. First, we construct a shared Meta-LoRA Bank by selecting anchor users and pre-training meta-personalization traits within meta-LoRA modules. Second, to ensure scalability and enable dynamic personalization combination beyond static models, we introduce an Adaptive LoRA Fusion stage. This stage retrieves and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Machine Learning in Healthcare
