Provable Sparse Inversion and Token Relabel Enhanced One-shot Federated Learning with ViTs

Li Shen; Xiaolei Hao; Qinglun Li; Xiaochun Cao; Zhifeng Hao; Xun Yang

arXiv:2605.10748·cs.LG·May 12, 2026

Provable Sparse Inversion and Token Relabel Enhanced One-shot Federated Learning with ViTs

Li Shen, Xiaolei Hao, Qinglun Li, Xiaochun Cao, Zhifeng Hao, Xun Yang

PDF

TL;DR

FedMITR is a novel federated learning framework that enhances one-shot training with sparse inversion and token relabeling for ViTs, improving data quality and model generalization in non-IID scenarios.

Contribution

The paper introduces FedMITR, combining sparse model inversion and token relabeling strategies, with theoretical analysis and empirical results showing significant performance improvements.

Findings

01

FedMITR outperforms existing methods in various non-IID federated learning settings.

02

Sparse model inversion reduces gradient instability caused by background noise.

03

Token relabeling decreases gradient variance, enhancing model generalization.

Abstract

One-Shot Federated Learning, where a central server learns a global model in a single communication round, has emerged as a promising paradigm. However, under extremely non-IID settings, existing data-free methods often generate low-quality data that suffers from severe semantic misalignment with ground-truth labels. To overcome these issues, we propose a novel Federated Model Inversion and Token Relabel (FedMITR) framework, which trains the global model by fully exploiting all patches of synthetic images. Specifically, FedMITR employs sparse model inversion during data generation, selectively inverting semantic foregrounds while halting the inversion of uninformative backgrounds. To address semantically meaningless tokens that hinder ViT predictions, we implement a differentiated strategy: patches with high information density utilize generated pseudo-labels, while patches with low…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.