TL;DR
Co-LoRA advances personalized federated learning by enabling knowledge sharing across heterogeneous models and data, addressing real-world scenario complexities with a new benchmark and demonstrating superior performance.
Contribution
The paper introduces Co-LoRA, a novel dimension-invariant module for model aggregation in heterogeneous PFL, and a multi-modal benchmark with diverse tasks and distribution shifts.
Findings
Co-LoRA significantly outperforms existing PFL methods in heterogeneous settings.
The proposed task-relevance-aware aggregation reduces parameter interference.
The multi-modal benchmark effectively simulates real-world task diversity.
Abstract
As AI becomes more personal, e.g., Agentic AI, there is an increasing need for personalizing models for various use cases. Personalized federated learning (PFL) enables each client to collaboratively leverage other clients' knowledge for better adaptation to the task of interest, without privacy risks. Despite its potential, existing PFL methods remain confined to rather simplified scenarios where data and models are the same across clients. To move towards realistic scenarios, we move beyond these restrictive assumptions by addressing both data and model heterogeneity. We propose a task-relevance-aware model aggregation strategy to reduce parameter interference under heterogeneous data. Moreover, we introduce Co-LoRA, a dimension-invariant module that enables knowledge sharing across heterogeneous architectures. To mimic the real-world task diversity, we propose a multi-modal PFL…
Peer Reviews
Decision·ICLR 2026 Poster
(S1) The writing and presentation of the paper are very clear in terms of both algorithm design and experimental results. Adequate intuitions are provided throughout the paper and appendices. The problem is well motivated, related work is well cited, and the contributions are contextualized appropriately. (S2) FedMosaic is an original and interesting solution to a very complex practical problem of multiple heterogeneities in pFL of MLLMs. This is a significant contribution to the field in terms
(W1) Introducing a new benchmark in an algorithms paper is counterproductive. The benchmark would be difficult to discover for any reader. To a reviewer, the benchmark's design is impossible to evaluate when only 10 lines can be allocated to it in the main body. While DRAKE looks extremely useful, there are several nuances which can only be understood by carefully reading multiple sections in the appendices. My opinion is that DRAKE should be submitted as a separate datasets & benchmarks style p
Overall, this paper represents a meaningful problem in personalized federated learning. The authors find that existing personalized federated learning methods are still confined to simplified scenarios with highly homogeneous data and models across clients, while real-world scenarios are more complex. They proposed FedMosaic, which addresses the simultaneous heterogeneity of data and models through a task-correlation-aware model aggregation strategy and dimension-invariant modules. Additionally,
**Major Weaknesses:** Overall, this paper has some merits, but there are a few weaknesses that stop me from giving a higher rating. My major concerns are as follows. (1) The paper mentions that the FedMosaic method does not require high computational costs, and the authors' experiments indeed include sections related to computational costs. However, the process of weight alignment in PQ-LoRA seems to be relatively complex, and the paper does not provide information about the computational cost
1. Experimental results demonstrate consistent improvements over strong baselines, indicating the effectiveness of the approach. 2. The appendix further provides a thorough and extensive suite of experiments, supporting the validity and robustness of the reported findings. 3. The paper is generally well-written and clearly organized, making the technical ideas easy to follow.
1. The comparison against prior works using non-IID splits of a single dataset may not be entirely equitable. The contextual settings differ significantly (maybe the earlier studies targeted models specialized for single-domain or unimodal tasks, rather than fine-tuning billion-parameter foundation models). Moreover, the motivation for exploring multi-modal tasks in PFL requires further clarification. What are the practical or deployment-oriented use cases where clients naturally possess distinc
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
