Matching DNN Compression and Cooperative Training with Resources and   Data Availability

Francesco Malandrino; Giuseppe Di Giacomo; Armin Karamzade and; Marco Levorato; Carla Fabiana Chiasserini

arXiv:2212.02304·cs.LG·December 6, 2022·1 cites

Matching DNN Compression and Cooperative Training with Resources and Data Availability

Francesco Malandrino, Giuseppe Di Giacomo, Armin Karamzade and, Marco Levorato, Carla Fabiana Chiasserini

PDF

Open Access

TL;DR

This paper presents PACT, a novel framework for jointly optimizing DNN compression and training resource allocation, improving efficiency and performance in resource-constrained environments.

Contribution

It introduces a formal multi-dimensional model and an approximate dynamic programming solution for joint decision-making in DNN training and compression.

Findings

01

PACT outperforms state-of-the-art methods in energy efficiency.

02

The approach closely matches the optimal solutions in various settings.

03

The method is polynomial-time and adaptable to different resource scenarios.

Abstract

To make machine learning (ML) sustainable and apt to run on the diverse devices where relevant data is, it is essential to compress ML models as needed, while still meeting the required learning quality and time performance. However, how much and when an ML model should be compressed, and {\em where} its training should be executed, are hard decisions to make, as they depend on the model itself, the resources of the available nodes, and the data such nodes own. Existing studies focus on each of those aspects individually, however, they do not account for how such decisions can be made jointly and adapted to one another. In this work, we model the network system focusing on the training of DNNs, formalize the above multi-dimensional problem, and, given its NP-hardness, formulate an approximate dynamic programming problem that we solve through the PACT algorithmic framework. Importantly,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Algorithms and Data Compression · Stochastic Gradient Optimization Techniques