Quantized Adaptive Subgradient Algorithms and Their Applications
Ke Xu, Jianqiao Wangni, Yifan Zhang, Deheng Ye, Jiaxiang Wu, Peilin, Zhao

TL;DR
This paper introduces quantized adaptive subgradient algorithms for distributed machine learning, balancing communication efficiency, model sparsity, and performance through novel quantization and theoretical analysis.
Contribution
It proposes new quantized adaptive subgradient algorithms that effectively reduce communication costs while maintaining model accuracy and sparsity in distributed training.
Findings
Algorithms reduce communication costs significantly.
Theoretical analysis links quantization error to convergence.
Empirical results confirm improved efficiency and performance.
Abstract
Data explosion and an increase in model size drive the remarkable advances in large-scale machine learning, but also make model training time-consuming and model storage difficult. To address the above issues in the distributed model training setting which has high computation efficiency and less device limitation, there are still two main difficulties. On one hand, the communication costs for exchanging information, e.g., stochastic gradients among different workers, is a key bottleneck for distributed training efficiency. On the other hand, less parameter model is easy for storage and communication, but the risk of damaging the model performance. To balance the communication costs, model capacity and model performance simultaneously, we propose quantized composite mirror descent adaptive subgradient (QCMD adagrad) and quantized regularized dual average adaptive subgradient (QRDA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
MethodsAdaGrad
