Output Composability of QLoRA PEFT Modules for Plug-and-Play Attribute-Controlled Text Generation

Michela Lorandi; Anya Belz

arXiv:2605.12345·cs.CL·May 13, 2026

Output Composability of QLoRA PEFT Modules for Plug-and-Play Attribute-Controlled Text Generation

Michela Lorandi, Anya Belz

PDF

TL;DR

This paper investigates methods for combining PEFT modules, specifically QLoRA, for flexible, attribute-controlled text generation, demonstrating effective output composition techniques across multiple models and tasks.

Contribution

It introduces and evaluates three approaches for generalizing PEFT modules beyond single-task training, highlighting the effectiveness of output summation for composition.

Findings

01

Summing PEFT module outputs often outperforms other composition methods.

02

Three-module output composition improves sentiment control performance by 2% points.

03

Output composition maintains performance even when compared to specialized single-task modules.

Abstract

Parameter-efficient fine-tuning (PEFT) techniques offer task-specific fine-tuning at a fraction of the cost of full fine-tuning, but require separate fine-tuning for every new task (combination). In this paper, we explore three ways of generalising beyond single-task training/inference: (i) training on combinations of multiple, related datasets; (ii) at inference, composing the weight matrices of separately trained PEFT modules; and (iii) at inference, composing the outputs of separately trained PEFT modules. We test these approaches on three different LLMs, QLoRA as the PEFT technique, and three sets of controlled text generation datasets for sentiment control, topic control, and multi-attribute control. We find that summing PEFT module outputs is a particularly strong composition method, which consistently either outperforms or matches the performance of alternative approaches. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.