Accelerating Optimization and Machine Learning through Decentralization
Ziqin Chen, Zuang Wang, Yongqiang Wang

TL;DR
Decentralized optimization can accelerate convergence in machine learning tasks, outperforming centralized methods in iteration efficiency while preserving privacy and scalability.
Contribution
This work demonstrates that decentralization can paradoxically speed up convergence, challenging the view of it as merely a compromise due to communication or privacy constraints.
Findings
Decentralized methods converge faster than centralized ones in logistic regression.
Distributed training of neural networks can outperform centralized training in iteration count.
Decentralization offers a strategic advantage for efficient optimization.
Abstract
Decentralized optimization enables multiple devices to learn a global machine learning model while each individual device only has access to its local dataset. By avoiding the need for training data to leave individual users' devices, it enhances privacy and scalability compared to conventional centralized learning, where all data has to be aggregated to a central server. However, decentralized optimization has traditionally been viewed as a necessary compromise, used only when centralized processing is impractical due to communication constraints or data privacy concerns. In this study, we show that decentralization can paradoxically accelerate convergence, outperforming centralized methods in the number of iterations needed to reach optimal solutions. Through examples in logistic regression and neural network training, we demonstrate that distributing data and computation across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
