CURP: Codebook-based Continuous User Representation for Personalized Generation with LLMs

Liang Wang; Xinyi Mou; Xiaoyou Liu; Xuanjing Huang; Zhongyu Wei

arXiv:2602.00742·cs.CL·February 3, 2026

CURP: Codebook-based Continuous User Representation for Personalized Generation with LLMs

Liang Wang, Xinyi Mou, Xiaoyou Liu, Xuanjing Huang, Zhongyu Wei

PDF

Open Access

TL;DR

CURP introduces a codebook-based user modeling framework that enhances personalized generation with LLMs, achieving high performance, interpretability, and efficiency with minimal additional parameters.

Contribution

The paper presents a novel plug-and-play user representation method using a discrete codebook, improving personalization quality and scalability in LLM-based generation.

Findings

01

Outperforms strong baselines in variant generation tasks

02

Uses only about 20M trainable parameters for personalization

03

Offers better interpretability and scalability

Abstract

User modeling characterizes individuals through their preferences and behavioral patterns to enable personalized simulation and generation with Large Language Models (LLMs) in contemporary approaches. However, existing methods, whether prompt-based or training-based methods, face challenges in balancing personalization quality against computational and data efficiency. We propose a novel framework CURP, which employs a bidirectional user encoder and a discrete prototype codebook to extract multi-dimensional user traits. This design enables plug-and-play personalization with a small number of trainable parameters (about 20M parameters, about 0.2\% of the total model size). Through extensive experiments on variant generation tasks, we show that CURP achieves superior performance and generalization compared to strong baselines, while offering better interpretability and scalability. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare · Natural Language Processing Techniques