Pufferfish: Communication-efficient Models At No Extra Cost

Hongyi Wang; Saurabh Agarwal; Dimitris Papailiopoulos

arXiv:2103.03936·cs.LG·March 9, 2021·5 cites

Pufferfish: Communication-efficient Models At No Extra Cost

Hongyi Wang, Saurabh Agarwal, Dimitris Papailiopoulos

PDF

Open Access 1 Repo

TL;DR

Pufferfish is a novel distributed training framework that reduces communication costs and computational overheads by integrating gradient compression into low-rank, pre-factorized models, maintaining accuracy and improving training speed.

Contribution

It introduces a method to incorporate gradient compression directly into model training via low-rank factorization, eliminating additional computational overheads and preserving accuracy.

Findings

01

Achieves up to 1.64x speedup in distributed training.

02

Maintains state-of-the-art accuracy with compressed models.

03

Produces smaller, more accurate models than existing pruning methods.

Abstract

To mitigate communication overheads in distributed model training, several studies propose the use of compressed stochastic gradients, usually achieved by sparsification or quantization. Such techniques achieve high compression ratios, but in many cases incur either significant computational overheads or some accuracy loss. In this work, we present Pufferfish, a communication and computation efficient distributed training framework that incorporates the gradient compression into the model training process via training low-rank, pre-factorized deep networks. Pufferfish not only reduces communication, but also completely bypasses any computation overheads related to compression, and achieves the same accuracy as state-of-the-art, off-the-shelf deep models. Pufferfish can be directly integrated into current deep learning frameworks with minimum implementation modification. Our extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hwang595/Pufferfish
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis

MethodsPruning