Robust Distributed Learning of Functional Data From Simulators through Data Sketching
R. Jacob Andros, Rajarshi Guhaniyogi, Devin Francom, Donatella, Pasqualini

TL;DR
This paper introduces a robust distributed Bayesian learning method for functional data from environmental simulators, using random data sketches to enhance privacy and reduce sensitivity to data partitioning.
Contribution
It proposes a novel approach employing random linear projections for distributed inference, improving robustness and privacy in functional data emulation.
Findings
Method achieves robustness against data shard selection.
Approach maintains privacy of individual data units.
Demonstrated effective emulation of environmental simulator data.
Abstract
In environmental studies, realistic simulations are essential for understanding complex systems. Statistical emulation with Gaussian processes (GPs) in functional data models have become a standard tool for this purpose. Traditional centralized processing of such models requires substantial computational and storage resources, leading to emerging distributed Bayesian learning algorithms that partition data into shards for distributed computations. However, concerns about the sensitivity of distributed inference to shard selection arise. Instead of using data shards, our approach employs multiple random matrices to create random linear projections, or sketches, of the dataset. Posterior inference on functional data models is conducted using random data sketches on various machines in parallel. These individual inferences are combined across machines at a central server. The aggregation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Machine Learning and Algorithms
