Task2vec Readiness: Diagnostics for Federated Learning from Pre-Training Embeddings
Cristiano Mafuz, Rodrigo Silva

TL;DR
This paper introduces Task2Vec-based readiness indices to predict federated learning performance before training, using unsupervised metrics from client embeddings, validated across multiple datasets and heterogeneity levels.
Contribution
It presents a novel pre-training diagnostic method for federated learning based on Task2Vec embeddings, enabling better anticipation of federation outcomes.
Findings
High correlation (often >0.9) between readiness indices and final performance.
Readiness metrics reliably predict federated learning success across datasets and heterogeneity levels.
Task2Vec-based diagnostics can guide client selection and improve federated learning strategies.
Abstract
Federated learning (FL) performance is highly sensitive to heterogeneity across clients, yet practitioners lack reliable methods to anticipate how a federation will behave before training. We propose readiness indices, derived from Task2Vec embeddings, that quantifies the alignment of a federation prior to training and correlates with its eventual performance. Our approach computes unsupervised metrics -- such as cohesion, dispersion, and density -- directly from client embeddings. We evaluate these indices across diverse datasets (CIFAR-10, FEMNIST, PathMNIST, BloodMNIST) and client counts (10--20), under Dirichlet heterogeneity levels spanning and FedAVG aggregation strategy. Correlation analyses show consistent and significant Pearson and Spearman coefficients between some of the Task2Vec-based readiness and final performance, with values often…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
