STORM: Foundations of End-to-End Empirical Risk Minimization on the Edge
Benjamin Coleman, Gaurav Gupta, John Chen, Anshumali Shrivastava

TL;DR
STORM introduces an online sketching method that enables efficient, accurate empirical risk minimization directly on edge devices, reducing the need for cloud-based training and preserving data security.
Contribution
The paper presents STORM, a novel online sketching technique for empirical risk minimization that is suitable for edge computing environments and provides theoretical guarantees.
Findings
STORM accurately estimates surrogate losses for linear regression.
Experimental results show STORM enables effective training on real-world datasets.
STORM reduces computational and communication costs in distributed learning.
Abstract
Empirical risk minimization is perhaps the most influential idea in statistical learning, with applications to nearly all scientific and technical domains in the form of regression and classification models. To analyze massive streaming datasets in distributed computing environments, practitioners increasingly prefer to deploy regression models on edge rather than in the cloud. By keeping data on edge devices, we minimize the energy, communication, and data security risk associated with the model. Although it is equally advantageous to train models at the edge, a common assumption is that the model was originally trained in the cloud, since training typically requires substantial computation and memory. To this end, we propose STORM, an online sketch for empirical risk minimization. STORM compresses a data stream into a tiny array of integer counters. This sketch is sufficient to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data
MethodsLinear Regression
