A Comparative Study of Sampling Methods with Cross-Validation in the FedHome Framework
Arash Ahmadi, Sarah S. Sharif, and Yaser M. Banad

TL;DR
This study compares sampling methods in federated learning for health monitoring, finding SMOTE-ENN provides the most stable and reliable performance in addressing class imbalance.
Contribution
It evaluates six oversampling techniques with cross-validation in the FedHome framework, highlighting SMOTE-ENN's superior stability for personalized health data.
Findings
SMOTE-ENN achieves the most consistent test accuracy.
SMOTE and SVM-SMOTE show higher performance variability.
Random OverSampler exhibits significant deviation in results.
Abstract
This paper presents a comparative study of sampling methods within the FedHome framework, designed for personalized in-home health monitoring. FedHome leverages federated learning (FL) and generative convolutional autoencoders (GCAE) to train models on decentralized edge devices while prioritizing data privacy. A notable challenge in this domain is the class imbalance in health data, where critical events such as falls are underrepresented, adversely affecting model performance. To address this, the research evaluates six oversampling techniques using Stratified K-fold cross-validation: SMOTE, Borderline-SMOTE, Random OverSampler, SMOTE-Tomek, SVM-SMOTE, and SMOTE-ENN. These methods are tested on FedHome's public implementation over 200 training rounds with and without stratified K-fold cross-validation. The findings indicate that SMOTE-ENN achieves the most consistent test accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSynthetic Minority Over-sampling Technique.
