Covariance estimation for vertically partitioned data in a distributed environment
Aruna Govada, Sanjay K. Sahay

TL;DR
This paper introduces a communication-efficient distributed algorithm for estimating the covariance matrix from geographically partitioned data, achieving results identical to centralized methods with improved computational speed.
Contribution
It presents a novel distributed approach for covariance estimation that reduces communication costs and balances computational load, maintaining accuracy equivalent to centralized methods.
Findings
Exact covariance estimation matching centralized results
Significant speed-up due to parallel local computations
Scalability demonstrated on large datasets
Abstract
The major sources of abundant data are constantly expanding with the available data collection methodologies in various applications - medical, insurance, scientific, bio-informatics and business. These data sets may be distributed geographically, rich in size and as well as dimensions also. To analyze these data sets to find out the hidden patterns, it is required to down- load the data to a centralized site which is a challenging task in terms of the limited bandwidth available and computationally also expensive. The covariance matrix is one of the methods to estimate the relation between any two dimensions. In this paper, we propose a communication efficient algorithm to estimate the covariance matrix in a distributed manner. The global covariance matrix is computed by merging the local covariance matrices using a distributed approach. The results show that it is exactly same as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Human Mobility and Location-Based Analysis · Advanced Clustering Algorithms Research
