A Survey of Resource-efficient LLM and Multimodal Foundation Models

Mengwei Xu; Wangsong Yin; Dongqi Cai; Rongjie Yi; Daliang Xu; Qipeng; Wang; Bingyang Wu; Yihao Zhao; Chen Yang; Shihe Wang; Qiyang Zhang; Zhenyan; Lu; Li Zhang; Shangguang Wang; Yuanchun Li; Yunxin Liu; Xin Jin; Xuanzhe Liu

arXiv:2401.08092·cs.LG·September 24, 2024·32 cites

A Survey of Resource-efficient LLM and Multimodal Foundation Models

Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng, Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan, Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

PDF

Open Access 1 Repo

TL;DR

This survey reviews recent advances in resource-efficient large foundation models, highlighting algorithmic and systemic strategies to reduce hardware costs while maintaining performance in LLMs and multimodal models.

Contribution

It provides a comprehensive analysis of current resource-efficient methods across architectures, training, serving, and system design for large foundation models.

Findings

01

Summarizes key resource-efficient techniques in model architectures and training.

02

Analyzes systemic approaches for scalable deployment.

03

Identifies gaps and future directions in resource-efficient modeling.

Abstract

Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine learning lifecycle, from training to deployment. However, the substantial advancements in versatility and performance these models offer come at a significant cost in terms of hardware resources. To support the growth of these large models in a scalable and environmentally sustainable way, there has been a considerable focus on developing resource-efficient strategies. This survey delves into the critical importance of such research, examining both algorithmic and systemic aspects. It offers a comprehensive analysis and valuable insights gleaned from existing literature, encompassing a broad array of topics from cutting-edge model architectures and training/serving algorithms to practical system designs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ubiquitouslearning/efficient_foundation_model_survey
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsFocus