Sample Selection Using Multi-Task Autoencoders in Federated Learning with Non-IID Data
Emre Ard{\i}\c{c}, Yakup Gen\c{c}

TL;DR
This paper introduces novel sample selection techniques using multi-task autoencoders and outlier detection methods to improve federated learning performance on non-IID image data.
Contribution
It proposes a multi-task autoencoder-based approach combined with outlier detection and SVDD loss for effective sample filtering in federated learning.
Findings
Significant accuracy improvements on CIFAR10 and MNIST datasets.
Loss-based sample selection methods outperform baseline approaches.
Feature-based sample selection with SVDD further enhances model accuracy.
Abstract
Federated learning is a machine learning paradigm in which multiple devices collaboratively train a model under the supervision of a central server while ensuring data privacy. However, its performance is often hindered by redundant, malicious, or abnormal samples, leading to model degradation and inefficiency. To overcome these issues, we propose novel sample selection methods for image classification, employing a multitask autoencoder to estimate sample contributions through loss and feature analysis. Our approach incorporates unsupervised outlier detection, using one-class support vector machine (OCSVM), isolation forest (IF), and adaptive loss threshold (AT) methods managed by a central server to filter noisy samples on clients. We also propose a multi-class deep support vector data description (SVDD) loss controlled by a central server to enhance feature-based sample selection. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
