Task-Centric Personalized Federated Fine-Tuning of Language Models

Gabriel U. Talasso; Meghdad Kurmanji; Allan M. de Souza; Nicholas D. Lane; Leandro A. Villas

arXiv:2604.00050·cs.LG·April 7, 2026

Task-Centric Personalized Federated Fine-Tuning of Language Models

Gabriel U. Talasso, Meghdad Kurmanji, Allan M. de Souza, Nicholas D. Lane, Leandro A. Villas

PDF

TL;DR

FedRouter is a clustering-based personalized federated learning method that creates task-specific models to improve robustness and generalization in federated language model training.

Contribution

It introduces a novel clustering approach with adapters and an evaluation router to enhance task-centric personalization and robustness in federated learning.

Findings

01

FedRouter outperforms existing methods by up to 6.1% under task interference.

02

Achieves up to 136% relative improvement in generalization.

03

Demonstrates strong resilience in challenging federated learning scenarios.

Abstract

Federated Learning (FL) has emerged as a promising technique for training language models on distributed and private datasets of diverse tasks. However, aggregating models trained on heterogeneous tasks often degrades the overall performance of individual clients. To address this issue, Personalized FL (pFL) aims to create models tailored for each client's data distribution. Although these approaches improve local performance, they usually lack robustness in two aspects: (i) generalization: when clients must make predictions on unseen tasks, or face changes in their data distributions, and (ii) intra-client tasks interference: when a single client's data contains multiple distributions that may interfere with each other during local training. To tackle these two challenges, we propose FedRouter, a clustering-based pFL that builds specialized models for each task rather than for each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.