Evaluation and Optimization of Distributed Machine Learning Techniques for Internet of Things
Yansong Gao, Minki Kim, Chandra Thapa, Sharif Abuadbba, Zhi Zhang,, Seyit A. Camtepe, Hyoungshick Kim, Surya Nepal

TL;DR
This paper empirically compares federated, split, and splitfed learning on Raspberry Pi IoT devices, analyzing their performance, overhead, and proposing optimizations to improve efficiency and scalability.
Contribution
It provides the first real-world comparison of FL, SL, and SFL on IoT devices and introduces a generalized SFL with hybrid training and communication reduction techniques.
Findings
SL outperforms FL under imbalanced data but worse under extreme non-IID data.
Optimizations reduce communication overhead by up to four times.
Generalized SFL improves scalability for large IoT systems.
Abstract
Federated learning (FL) and split learning (SL) are state-of-the-art distributed machine learning techniques to enable machine learning training without accessing raw data on clients or end devices. However, their \emph{comparative training performance} under real-world resource-restricted Internet of Things (IoT) device settings, e.g., Raspberry Pi, remains barely studied, which, to our knowledge, have not yet been evaluated and compared, rendering inconvenient reference for practitioners. This work firstly provides empirical comparisons of FL and SL in real-world IoT settings regarding (i) learning performance with heterogeneous data distributions and (ii) on-device execution overhead. Our analyses in this work demonstrate that the learning performance of SL is better than FL under an imbalanced data distribution but worse than FL under an extreme non-IID data distribution. Recently,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Age of Information Optimization · Distributed Sensor Networks and Detection Algorithms
