Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models
Guangji Bai, Zheng Chai, Chen Ling, Shiyu Wang, Jiaying Lu, Nan Zhang,, Tingwei Shi, Ziyang Yu, Mengdan Zhu, Yifei Zhang, Xinyuan Song, Carl Yang,, Yue Cheng, Liang Zhao

TL;DR
This survey comprehensively reviews techniques to improve resource efficiency in large language models across their lifecycle, addressing computational, memory, energy, and financial challenges to promote sustainable AI development.
Contribution
It introduces a detailed categorization of resource efficiency methods, evaluation metrics, and datasets, providing a foundational reference for future research in sustainable LLMs.
Findings
Categorized resource efficiency techniques by resource type and lifecycle stage
Presented standardized evaluation metrics and datasets for fair comparison
Identified open research challenges and future directions
Abstract
The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated models like OpenAI's ChatGPT, represents a significant advancement in artificial intelligence. These models, however, bring forth substantial challenges in the high consumption of computational, memory, energy, and financial resources, especially in environments with limited resource capabilities. This survey aims to systematically address these challenges by reviewing a broad spectrum of techniques designed to enhance the resource efficiency of LLMs. We categorize methods based on their optimization focus: computational, memory, energy, financial, and network resources and their applicability across various stages of an LLM's lifecycle, including architecture design, pretraining, finetuning, and system design. Additionally, the survey introduces a nuanced categorization of resource efficiency techniques…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Ferroelectric and Negative Capacitance Devices
MethodsSparse Evolutionary Training
