MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage   Models

Ahmed Elbakary; Chaouki Ben Issaid; Tamer ElBatt; Karim Seddik; Mehdi; Bennis

arXiv:2410.15524·cs.LG·October 22, 2024

MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models

Ahmed Elbakary, Chaouki Ben Issaid, Tamer ElBatt, Karim Seddik, Mehdi, Bennis

PDF

Open Access

TL;DR

This paper presents MIRA, a federated multi-task learning method for large language models that uses parameter-efficient fine-tuning to improve local and global performance across clients.

Contribution

It introduces a novel federated multi-task learning approach for LLMs that incorporates client task structures and uses LoRA for efficient training.

Findings

01

Outperforms existing federated fine-tuning methods in local loss reduction.

02

Maintains comparable global performance across clients.

03

Reduces computational and communication costs with LoRA.

Abstract

In this paper, we introduce a method for fine-tuning Large Language Models (LLMs), inspired by Multi-Task learning in a federated manner. Our approach leverages the structure of each client's model and enables a learning scheme that considers other clients' tasks and data distribution. To mitigate the extensive computational and communication overhead often associated with LLMs, we utilize a parameter-efficient fine-tuning method, specifically Low-Rank Adaptation (LoRA), reducing the number of trainable parameters. Experimental results, with different datasets and models, demonstrate the proposed method's effectiveness compared to existing frameworks for federated fine-tuning of LLMs in terms of average and local performances. The proposed scheme outperforms existing baselines by achieving lower local loss for each client while maintaining comparable global performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Recommender Systems and Techniques · Speech Recognition and Synthesis