Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu, Marco Galindo, Hongxia Xie, Lai-Kuan Wong, Hong-Han Shuai,, Yung-Hui Li, and Wen-Huang Cheng

TL;DR
This survey reviews methods for designing, compressing, and accelerating deep learning models suitable for resource-limited devices like mobile phones and microcontrollers, highlighting future directions in TinyML and large language models.
Contribution
It provides comprehensive guidance on lightweight model design, compression, and hardware acceleration, and discusses future deployment challenges for TinyML and large language models.
Findings
Guidance on designing lightweight deep learning models.
Analysis of compression and hardware acceleration techniques.
Identification of future challenges in deploying TinyML and large language models.
Abstract
Over the past decade, the dominance of deep learning has prevailed across various domains of artificial intelligence, including natural language processing, computer vision, and biomedical signal processing. While there have been remarkable improvements in model accuracy, deploying these models on lightweight devices, such as mobile phones and microcontrollers, is constrained by limited resources. In this survey, we provide comprehensive design guidance tailored for these devices, detailing the meticulous design of lightweight models, compression methods, and hardware acceleration strategies. The principal goal of this work is to explore methods and concepts for getting around hardware constraints without compromising the model's accuracy. Additionally, we explore two notable paths for lightweight deep learning in the future: deployment techniques for TinyML and Large Language Models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · IoT and Edge/Fog Computing · Topic Modeling
