Unsupervised Human Preference Learning

Sumuk Shashidhar; Abhinav Chinta; Vaibhav Sahai; Dilek Hakkani-T\"ur

arXiv:2410.03731·cs.CL·December 10, 2024

Unsupervised Human Preference Learning

Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani-T\"ur

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel personalization method for large language models using small preference agent models to generate guiding rules, enabling efficient, fine-tuning-free customization based on individual user preferences.

Contribution

It proposes a new approach where small models act as preference agents to steer larger models, improving personalization without fine-tuning the large model.

Findings

01

Significantly outperforms baseline personalization methods

02

Enables data-efficient customization of large language models

03

Demonstrates effectiveness on email and article datasets

Abstract

Large language models demonstrate impressive reasoning abilities but struggle to provide personalized content due to their lack of individual user preference information. Existing methods, such as in-context learning and parameter-efficient fine-tuning, fall short in capturing the complexity of human preferences, especially given the small, personal datasets individuals possess. In this paper, we propose a novel approach utilizing small parameter models as preference agents to generate natural language rules that guide a larger, pre-trained model, enabling efficient personalization. Our method involves a small, local "steering wheel" model that directs the outputs of a much larger foundation model, producing content tailored to an individual's preferences while leveraging the extensive knowledge and capabilities of the large model. Importantly, this personalization is achieved without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Unsupervised Human Preference Learning· underline

Taxonomy

TopicsData Management and Algorithms