A Comparative Study of Sampling Methods with Cross-Validation in the   FedHome Framework

Arash Ahmadi; Sarah S. Sharif; and Yaser M. Banad

arXiv:2406.01950·cs.LG·June 5, 2024

A Comparative Study of Sampling Methods with Cross-Validation in the FedHome Framework

Arash Ahmadi, Sarah S. Sharif, and Yaser M. Banad

PDF

TL;DR

This study compares sampling methods in federated learning for health monitoring, finding SMOTE-ENN provides the most stable and reliable performance in addressing class imbalance.

Contribution

It evaluates six oversampling techniques with cross-validation in the FedHome framework, highlighting SMOTE-ENN's superior stability for personalized health data.

Findings

01

SMOTE-ENN achieves the most consistent test accuracy.

02

SMOTE and SVM-SMOTE show higher performance variability.

03

Random OverSampler exhibits significant deviation in results.

Abstract

This paper presents a comparative study of sampling methods within the FedHome framework, designed for personalized in-home health monitoring. FedHome leverages federated learning (FL) and generative convolutional autoencoders (GCAE) to train models on decentralized edge devices while prioritizing data privacy. A notable challenge in this domain is the class imbalance in health data, where critical events such as falls are underrepresented, adversely affecting model performance. To address this, the research evaluates six oversampling techniques using Stratified K-fold cross-validation: SMOTE, Borderline-SMOTE, Random OverSampler, SMOTE-Tomek, SVM-SMOTE, and SMOTE-ENN. These methods are tested on FedHome's public implementation over 200 training rounds with and without stratified K-fold cross-validation. The findings indicate that SMOTE-ENN achieves the most consistent test accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSynthetic Minority Over-sampling Technique.