BoRA: Bayesian Hierarchical Low-Rank Adaption for Multi-Task Large   Language Models

Simen Eide; Arnoldo Frigessi

arXiv:2407.15857·cs.LG·January 8, 2025

BoRA: Bayesian Hierarchical Low-Rank Adaption for Multi-Task Large Language Models

Simen Eide, Arnoldo Frigessi

PDF

1 Repo

TL;DR

BoRA introduces a Bayesian hierarchical low-rank adaptation method for multi-task LLM finetuning, enabling better task sharing and specialization, leading to improved performance over existing approaches.

Contribution

It presents a novel Bayesian hierarchical approach for low-rank adaptation in multi-task LLM finetuning, addressing trade-offs between task-specific and unified models.

Findings

01

Outperforms individual and unified models in perplexity and generalization

02

Enables effective knowledge sharing among related tasks

03

Improves finetuning efficiency for multi-task large language models

Abstract

This paper introduces Bayesian Hierarchical Low-Rank Adaption (BoRA), a novel method for finetuning multi-task Large Language Models (LLMs). Current finetuning approaches, such as Low-Rank Adaption (LoRA), perform exeptionally well in reducing training parameters and memory usage but face limitations when applied to multiple similar tasks. Practitioners usually have to choose between training separate models for each task or a single model for all tasks, both of which come with trade-offs in specialization and data utilization. BoRA addresses these trade-offs by leveraging a Bayesian hierarchical model that allows tasks to share information through global hierarchical priors. This enables tasks with limited data to benefit from the overall structure derived from related tasks while allowing tasks with more data to specialize. Our experimental results show that BoRA outperforms both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

simeneide/bora
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.