LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
Shizhe Diao, Rui Pan, Hanze Dong, Ka Shun Shum, Jipeng Zhang, Wei, Xiong, Tong Zhang

TL;DR
LMFlow is an extensible toolkit designed to simplify and accelerate the domain-specific fine-tuning and inference of large foundation models, supporting various training and optimization techniques.
Contribution
It introduces a comprehensive, lightweight toolkit that streamlines specialized training of foundation models with limited resources and supports multiple fine-tuning and inference methods.
Findings
Supports continuous pretraining and instruction tuning
Enables parameter-efficient finetuning and alignment tuning
Facilitates inference acceleration and multimodal finetuning
Abstract
Foundation models have demonstrated a great ability to achieve general human-level intelligence far beyond traditional approaches. As the technique keeps attracting attention from the AI community, an increasing number of foundation models are becoming publicly accessible. However, a significant shortcoming of most of these models lies in their performance in specialized-domain and task-specific applications, necessitating domain- and task-aware fine-tuning to develop effective scientific language models. As the number of available foundation models and specialized tasks keeps growing, the job of training scientific language models becomes highly nontrivial. In this paper, we initiate steps to tackle this issue. We introduce an extensible and lightweight toolkit, LMFlow, which aims to simplify the domain- and task-aware finetuning of general foundation models. LMFlow offers a complete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗OptimalScale/robin-7b-v2-deltamodel· 863 dl· ♡ 11863 dl♡ 11
- 🤗OptimalScale/robin-13b-v2-deltamodel· 870 dl· ♡ 7870 dl♡ 7
- 🤗OptimalScale/robin-33b-v2-deltamodel· 848 dl· ♡ 8848 dl♡ 8
- 🤗OptimalScale/robin-65b-v2-deltamodel· 890 dl· ♡ 12890 dl♡ 12
- 🤗weqweasdas/hh_rlhf_rm_open_llama_3bmodel· 18 dl· ♡ 1718 dl♡ 17
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices · Machine Learning and Data Classification
