CollabLLM: From Passive Responders to Active Collaborators

Shirley Wu; Michel Galley; Baolin Peng; Hao Cheng; Gavin Li; Yao Dou; Weixin Cai; James Zou; Jure Leskovec; Jianfeng Gao

arXiv:2502.00640·cs.AI·July 31, 2025

CollabLLM: From Passive Responders to Active Collaborators

Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, Jure Leskovec, Jianfeng Gao

PDF

Open Access 1 Video

TL;DR

CollabLLM is a new training framework for large language models that enhances their ability to actively collaborate over multiple turns, improving task performance, interactivity, and user satisfaction in long-term interactions.

Contribution

It introduces a collaborative simulation and multiturn-aware rewards for reinforcement fine-tuning, enabling LLMs to better understand user intent and provide more insightful, proactive responses.

Findings

01

18.5% higher task performance

02

46.3% improved interactivity

03

17.6% increase in user satisfaction

Abstract

Large Language Models are typically trained with next-turn rewards, limiting their ability to optimize for long-term interaction. As a result, they often respond passively to ambiguous or open-ended user requests, failing to help users reach their ultimate intents and leading to inefficient conversations. To address these limitations, we introduce CollabLLM, a novel and general training framework that enhances multiturn human-LLM collaboration. Its key innovation is a collaborative simulation that estimates the long-term contribution of responses using Multiturn-aware Rewards. By reinforcement fine-tuning these rewards, CollabLLM goes beyond responding to user requests, and actively uncovers user intent and offers insightful suggestions-a key step towards more human-centered AI. We also devise a multiturn interaction benchmark with three challenging tasks such as document creation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

CollabLLM: From Passive Responders to Active Collaborators· slideslive

Taxonomy

TopicsSemantic Web and Ontologies