Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation
Aayush Shah, Shankar Jayaratnam

TL;DR
This paper introduces small, energy-efficient protein language models using LoRA, capable of controllable protein generation with high structural accuracy, reducing training costs and computational resources significantly.
Contribution
The work presents novel small protein language models based on Llama-3-8B and Phi-3-mini, utilizing LoRA to enable controllable protein generation with reduced training time and energy consumption.
Findings
Achieved an average pLDDT score of 69.75 for protein structure generation.
Attained a TM-Score of 0.84 for controllable protein generation.
Reduced training time by 70% and model size by up to 60%.
Abstract
Large language models (LLMs) have demonstrated significant success in natural language processing (NLP) tasks and have shown promising results in other domains such as protein sequence generation. However, there remain salient differences between LLMs used for NLP, which effectively handle multiple tasks and are available in small sizes, and protein language models that are often specialized for specific tasks and only exist in larger sizes. In this work, we introduce two small protein language models, based on Llama-3-8B and Phi-3-mini, that are capable of both uncontrollable and controllable protein generation. For the uncontrollable generation task, our best model achieves an average pLDDT score of 69.75, demonstrating robust performance in generating viable protein structures. For the controllable generation task, in which the model generates proteins according to properties…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Esperanto/Protein-Llama-3-8Bmodel· 24 dl· ♡ 524 dl♡ 5
- 🤗Esperanto/Protein-Phi-3-minimodel· 7 dl· ♡ 27 dl♡ 2
- 🤗Esperanto/Protein-Llama-3-8B-kvc-fp16-onnxmodel· 3 dl3 dl
- 🤗RichardErkhov/Esperanto_-_Protein-Llama-3-8B-4bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/Esperanto_-_Protein-Llama-3-8B-8bitsmodel
- 🤗RichardErkhov/Esperanto_-_Protein-Llama-3-8B-awqmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Bioinformatics · Software Engineering Research
MethodsLLaMA
