# FedReFT: Federated Representation Fine-Tuning with All-But-Me Aggregation

**Authors:** Fatema Siddika, Md Anwar Hossen, J. Pablo Mu\~noz, Tanya Roosta, Anuj Sharma, Ali Jannesari

arXiv: 2508.20295 · 2025-12-24

## TL;DR

FedReFT introduces a lightweight, representation-focused federated fine-tuning method that enhances personalization and stability across heterogeneous clients, achieving state-of-the-art results with high parameter efficiency.

## Contribution

It proposes FedReFT with All-But-Me aggregation, a novel approach for federated representation fine-tuning that improves stability and personalization in heterogeneous settings.

## Key findings

- State-of-the-art performance on multiple benchmarks.
- Achieves 1-49x higher parameter efficiency.
- Effective handling of client heterogeneity.

## Abstract

Parameter-efficient fine-tuning (PEFT) adapts large pre-trained models by updating only a small subset of parameters. Recently, Representation Fine-Tuning (ReFT) has emerged as an effective alternative. ReFT shifts the fine-tuning paradigm from updating model weights to directly manipulating hidden representations that capture rich semantic information, and outperforms state-of-the-art PEFTs in standalone settings. However, its application in Federated Learning (FL) remains challenging due to heterogeneity in clients' data distributions, model capacities, and computational resources. To address these challenges, we introduce Federated Representation Fine-Tuning (FedReFT), a novel approach to fine-tune clients' hidden representations. FedReFT applies sparse intervention layers to steer hidden representations directly, offering a lightweight and semantically rich fine-tuning alternative ideal for edge devices. However, representation-level updates are especially vulnerable to aggregation mismatch under different task heterogeneity, where naive averaging can corrupt semantic alignment. To mitigate this issue, we propose All-But-Me (ABM) aggregation, where each client receives the aggregated updates of others and partially incorporates them, enabling stable and personalized learning by balancing local focus with global knowledge. We further design an adaptive update strategy inspired by Test-Time Computing (TTC) to balance local and global contributions under heterogeneous conditions. FedReFT achieves state-of-the-art performance on commonsense reasoning, arithmetic reasoning, and GLUE benchmarks, while delivering 1-49 times higher parameter efficiency compared to leading LoRA-based methods.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20295/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20295/full.md

---
Source: https://tomesphere.com/paper/2508.20295