LoRA-LiteE: A Computationally Efficient Framework for Chatbot   Preference-Tuning

Yahe Yang; Chunliang Tao; Xiaojing Fan

arXiv:2411.09947·cs.CL·January 9, 2025·5 cites

LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning

Yahe Yang, Chunliang Tao, Xiaojing Fan

PDF

Open Access

TL;DR

LoRA-LiteE is a resource-efficient framework combining supervised fine-tuning, low-rank adaptation, and ensemble learning to improve chatbot preference tuning, achieving comparable performance to GPT-4 with less computational cost.

Contribution

This paper introduces LoRA-LiteE, a novel, scalable framework that effectively balances performance and computational efficiency for chatbot preference tuning.

Findings

01

LoRA-LiteE outperforms single large models under limited resources.

02

Achieves comparable results to GPT-4 trained with RLHF.

03

Enhances scalability and accessibility of preference-tuned chatbots.

Abstract

Effective preference tuning is pivotal in aligning chatbot responses with human expectations, enhancing user satisfaction and engagement. Traditional approaches, notably Reinforcement Learning from Human Feedback (RLHF) as employed in advanced models like GPT-4, have demonstrated considerable success in this domain. However, RLHF methods are often computationally intensive and resource-demanding, limiting their scalability and accessibility for broader applications. To address these challenges, this study introduces LoRA-Lite Ensemble (LoRA-LiteE), an innovative framework that combines Supervised Fine-tuning (SFT) with Low-Rank Adaptation (LoRA) and Ensemble Learning techniques to effectively aggregate predictions of lightweight models, which aim to achieve a balance between the performance and computational cost. Utilizing the Chatbot Arena benchmark dataset, we conduct a comprehensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Topic Modeling

MethodsAttention Is All You Need · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Label Smoothing · Softmax