Text-to-LoRA: Instant Transformer Adaption

Rujikorn Charakorn; Edoardo Cetin; Yujin Tang; Robert Tjarko Lange

arXiv:2506.06105·cs.LG·June 10, 2025

Text-to-LoRA: Instant Transformer Adaption

Rujikorn Charakorn, Edoardo Cetin, Yujin Tang, Robert Tjarko Lange

PDF

Open Access 1 Datasets

TL;DR

Text-to-LoRA (T2L) is a hypernetwork that rapidly adapts large language models to new tasks using natural language descriptions, eliminating the need for extensive fine-tuning and enabling zero-shot generalization.

Contribution

We introduce T2L, a hypernetwork that constructs LoRA adapters from natural language descriptions, allowing fast, task-specific adaptation of LLMs without costly fine-tuning.

Findings

01

T2L matches the performance of task-specific adapters.

02

It can compress and generalize across multiple tasks.

03

Zero-shot generalization to unseen tasks is achieved.

Abstract

While Foundation Models provide a general tool for rapid content creation, they regularly require task-specific adaptation. Traditionally, this exercise involves careful curation of datasets and repeated fine-tuning of the underlying model. Fine-tuning techniques enable practitioners to adapt foundation models for many new applications but require expensive and lengthy training while being notably sensitive to hyperparameter choices. To overcome these limitations, we introduce Text-to-LoRA (T2L), a model capable of adapting large language models (LLMs) on the fly solely based on a natural language description of the target task. T2L is a hypernetwork trained to construct LoRAs in a single inexpensive forward pass. After training T2L on a suite of 9 pre-trained LoRA adapters (GSM8K, Arc, etc.), we show that the ad-hoc reconstructed LoRA instances match the performance of task-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

olzhasAl/adilet-legal-qa-kz
dataset· 34 dl
34 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Multimodal Machine Learning Applications