Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Zhiyuan Liang; Dongwen Tang; Yuhao Zhou; Xuanlei Zhao; Mingjia Shi; Wangbo Zhao; Zekai Li; Peihao Wang; Konstantin Sch\"urholt; Damian Borth; Michael M. Bronstein; Yang You; Zhangyang Wang; Kai Wang

arXiv:2506.16406·cs.LG·June 23, 2025

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Zhiyuan Liang, Dongwen Tang, Yuhao Zhou, Xuanlei Zhao, Mingjia Shi, Wangbo Zhao, Zekai Li, Peihao Wang, Konstantin Sch\"urholt, Damian Borth, Michael M. Bronstein, Yang You, Zhangyang Wang, Kai Wang

PDF

Open Access 5 Datasets

TL;DR

The paper introduces Drag-and-Drop LLMs (DnD), a prompt-conditioned parameter generator that rapidly produces task-specific LoRA weights without training, significantly reducing overhead and improving performance on various benchmarks.

Contribution

DnD is a novel method that maps prompts directly to LoRA weights, eliminating the need for per-task training and enabling fast, zero-shot adaptation of large language models.

Findings

01

Up to 12,000× lower overhead than full fine-tuning

02

Up to 30% performance gains on unseen tasks

03

Robust cross-domain generalization without target data

Abstract

Modern Parameter-Efficient Fine-Tuning (PEFT) methods such as low-rank adaptation (LoRA) reduce the cost of customizing large language models (LLMs), yet still require a separate optimization run for every downstream dataset. We introduce \textbf{Drag-and-Drop LLMs (\textit{DnD})}, a prompt-conditioned parameter generator that eliminates per-task training by mapping a handful of unlabeled task prompts directly to LoRA weight updates. A lightweight text encoder distills each prompt batch into condition embeddings, which are then transformed by a cascaded hyper-convolutional decoder into the full set of LoRA matrices. Once trained in a diverse collection of prompt-checkpoint pairs, DnD produces task-specific parameters in seconds, yielding i) up to \textbf{12,000 $\times$ } lower overhead than full fine-tuning, ii) average gains up to \textbf{30\%} in performance over the strongest training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications