Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Brian R. Bartoldson, Bhavya Kailkhura, Davis Blalock

TL;DR
This paper provides a comprehensive overview of algorithmically-efficient deep learning, formalizing the problem, developing a taxonomy of methods, and discussing evaluation practices, bottlenecks, and future research directions to reduce training costs.
Contribution
It offers a structured taxonomy of algorithmic speedup methods, formalizes the problem, and discusses evaluation and bottleneck mitigation strategies in deep learning training.
Findings
Identification of commonalities among diverse speedup methods
Development of evaluation best practices for fair comparison
Analysis of training pipeline bottlenecks and mitigation strategies
Abstract
Although deep learning has made great progress in recent years, the exploding economic and environmental costs of training neural networks are becoming unsustainable. To address this problem, there has been a great deal of research on *algorithmically-efficient deep learning*, which seeks to reduce training costs not at the hardware or implementation level, but through changes in the semantics of the training program. In this paper, we present a structured and comprehensive overview of the research in this field. First, we formalize the *algorithmic speedup* problem, then we use fundamental building blocks of algorithmically efficient training to develop a taxonomy. Our taxonomy highlights commonalities of seemingly disparate methods and reveals current research gaps. Next, we present evaluation best practices to enable comprehensive, fair, and reliable comparisons of speedup…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications
