Distributed Hybrid Sketching for $\ell_2$-Embeddings
Neophytos Charalambides, Arya Mazumdar

TL;DR
This paper introduces a novel distributed hybrid sketching technique for $ ext{ell}_2$-embeddings that improves data compression and privacy in large-scale linear algebra tasks by combining local and global randomized sketches.
Contribution
It proposes a new hybrid sketching method that applies multiple layers of randomized sketches to enhance embedding quality and efficiency in distributed $ ext{ell}_2$-subspace embeddings.
Findings
Hybrid sketching interpolates between existing methods, offering better trade-offs.
Numerical experiments validate the effectiveness of the proposed approach.
The method reduces communication and computation costs in distributed settings.
Abstract
Linear algebraic operations are ubiquitous in engineering applications, and arise often in a variety of fields including statistical signal processing and machine learning. With contemporary large datasets, to perform linear algebraic methods and regression tasks, it is necessary to resort to both distributed computations as well as data compression. In this paper, we study \textit{distributed} -subspace embeddings, a common technique used to efficiently perform linear regression. In our setting, data is distributed across multiple computing nodes and a goal is to minimize communication between the nodes and the coordinator in the distributed centralized network, while maintaining the geometry of the dataset. Furthermore, there is also the concern of keeping the data private and secure from potential adversaries. In this work, we address these issues through randomized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Algorithms and Data Compression · Optimization and Search Problems
