Efficient Deep Learning: A Survey on Making Deep Learning Models   Smaller, Faster, and Better

Gaurav Menghani

arXiv:2106.08962·cs.LG·April 17, 2024·37 cites

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Gaurav Menghani

PDF

Open Access 1 Repo

TL;DR

This survey comprehensively reviews techniques and hardware considerations for making deep learning models smaller, faster, and more resource-efficient, providing practical guidance and insights for researchers and practitioners.

Contribution

It is the first extensive survey covering model efficiency techniques, infrastructure, and hardware, with an experimental guide and code for optimization.

Findings

01

Identifies five core areas of model efficiency.

02

Provides a practical guide with code for optimization.

03

Highlights the importance of hardware in model efficiency.

Abstract

Deep Learning has revolutionized the fields of computer vision, natural language understanding, speech recognition, information retrieval and more. However, with the progressive improvements in deep learning models, their number of parameters, latency, resources required to train, etc. have all have increased significantly. Consequently, it has become important to pay attention to these footprint metrics of a model as well, not just its quality. We present and motivate the problem of efficiency in deep learning, followed by a thorough survey of the five core areas of model efficiency (spanning modeling techniques, infrastructure, and hardware) and the seminal work there. We also present an experiment-based guide along with code, for practitioners to optimize their model training and deployment. We believe this is the first comprehensive survey in the efficient deep learning space that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

reddragon/efficient-dl-survey-paper
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification