# Adaptive Federated Distillation for Multi-Domain Non-IID Textual Data

**Authors:** Jiahao Xiao, Jiangming Liu

arXiv: 2508.20557 · 2025-08-29

## TL;DR

This paper introduces a comprehensive benchmarking framework and an adaptive federated distillation method to effectively handle multi-domain non-IID textual data in federated learning, improving model performance across diverse language domains.

## Contribution

It proposes a unified benchmarking framework for multi-domain non-IID scenarios and an adaptive federated distillation method tailored for diverse natural language processing tasks.

## Key findings

- Models better capture local client diversity.
- Achieves improved performance over existing methods.
- Effective in both homogeneous and heterogeneous settings.

## Abstract

The widespread success of pre-trained language models has established a new training paradigm, where a global PLM is fine-tuned using task-specific data from local clients. The local data are highly different from each other and can not capture the global distribution of the whole data in real world. To address the challenges of non-IID data in real environments, privacy-preserving federated distillation has been proposed and highly investigated. However, previous experimental non-IID scenarios are primarily identified with the label (output) diversity, without considering the diversity of language domains (input) that is crucial in natural language processing. In this paper, we introduce a comprehensive set of multi-domain non-IID scenarios and propose a unified benchmarking framework that includes diverse data. The benchmark can be used to evaluate the federated learning framework in a real environment. To this end, we propose an Adaptive Federated Distillation (AdaFD) framework designed to address multi-domain non-IID challenges in both homogeneous and heterogeneous settings. Experimental results demonstrate that our models capture the diversity of local clients and achieve better performance compared to the existing works. The code for this paper is available at: https://github.com/jiahaoxiao1228/AdaFD.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20557/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20557/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/2508.20557/full.md

---
Source: https://tomesphere.com/paper/2508.20557