Fast networked data selection via distributed smoothed quantile estimation
Xu Zhang, Marcos M. Vasconcelos

TL;DR
This paper introduces a fast, distributed method for selecting the most informative data in large, networked datasets by smoothing nonsmooth quantile estimation problems, improving scalability and convergence.
Contribution
It proposes an accelerated smoothing-based algorithm for distributed top-$k$ selection, addressing the slow convergence of nonsmooth convex optimization in networked data collection.
Findings
The method achieves faster convergence in distributed top-$k$ selection.
Numerical results validate the effectiveness and scalability of the proposed algorithm.
The approach effectively handles the lack of strong convexity in quantile estimation.
Abstract
Collecting the most informative data from a large dataset distributed over a network is a fundamental problem in many fields, including control, signal processing and machine learning. In this paper, we establish a connection between selecting the most informative data and finding the top- elements of a multiset. The top- selection in a network can be formulated as a distributed nonsmooth convex optimization problem known as quantile estimation. Unfortunately, the lack of smoothness in the local objective functions leads to extremely slow convergence and poor scalability with respect to the network size. To overcome the deficiency, we propose an accelerated method that employs smoothing techniques. Leveraging the piecewise linearity of the local objective functions in quantile estimation, we characterize the iteration complexity required to achieve top- selection, a challenging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Neural Networks and Applications · Gaussian Processes and Bayesian Inference
