LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning
Yahe Yang, Chunliang Tao, Xiaojing Fan

TL;DR
LoRA-LiteE is a resource-efficient framework combining supervised fine-tuning, low-rank adaptation, and ensemble learning to improve chatbot preference tuning, achieving comparable performance to GPT-4 with less computational cost.
Contribution
This paper introduces LoRA-LiteE, a novel, scalable framework that effectively balances performance and computational efficiency for chatbot preference tuning.
Findings
LoRA-LiteE outperforms single large models under limited resources.
Achieves comparable results to GPT-4 trained with RLHF.
Enhances scalability and accessibility of preference-tuned chatbots.
Abstract
Effective preference tuning is pivotal in aligning chatbot responses with human expectations, enhancing user satisfaction and engagement. Traditional approaches, notably Reinforcement Learning from Human Feedback (RLHF) as employed in advanced models like GPT-4, have demonstrated considerable success in this domain. However, RLHF methods are often computationally intensive and resource-demanding, limiting their scalability and accessibility for broader applications. To address these challenges, this study introduces LoRA-Lite Ensemble (LoRA-LiteE), an innovative framework that combines Supervised Fine-tuning (SFT) with Low-Rank Adaptation (LoRA) and Ensemble Learning techniques to effectively aggregate predictions of lightweight models, which aim to achieve a balance between the performance and computational cost. Utilizing the Chatbot Arena benchmark dataset, we conduct a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Topic Modeling
MethodsAttention Is All You Need · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Label Smoothing · Softmax
