FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models
Fatemeh Nourzad, Amirhossein Roknilamouki, Eylem Ekici, Jia Liu, Ness Shroff

TL;DR
FIRM is a federated learning algorithm that efficiently aligns large language models with human values by reducing communication overhead and balancing multiple objectives through in-client regularization.
Contribution
FIRM introduces a novel in-client regularized multi-objective optimization method that eliminates multi-gradient communication, ensuring convergence and improved trade-offs in federated LLM alignment.
Findings
Reduces communication by transmitting only adapted parameters
Achieves convergence to Pareto-stationary points with finite-time guarantees
Improves training stability and objective trade-offs in experiments
Abstract
Aligning Large Language Models (LLMs) with human values often involves balancing multiple, conflicting objectives such as helpfulness and harmlessness. Training these models is computationally intensive, and centralizing the process raises significant data privacy concerns. Federated Learning (FL) offers a compelling alternative, but existing Federated Multi-Objective Optimization (FMOO) methods face severe communication bottlenecks as their reliance on transmitting multiple gradients to a server is unscalable for large models. We introduce FIRM (Federated In-client Regularized Multi-objective alignment), a novel algorithm that achieves both client disagreement drift mitigation and communication efficiency. In FIRM, each client locally solves a regularized multi-objective optimization problem. By directly mitigating client disagreement drift through in-client regularization, our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Graph Neural Networks · Mobile Crowdsensing and Crowdsourcing
