Doing Personal LAPS: LLM-Augmented Dialogue Construction for   Personalized Multi-Session Conversational Search

Hideaki Joko; Shubham Chatterjee; Andrew Ramsay; Arjen P. de Vries,; Jeff Dalton; and Faegheh Hasibi

arXiv:2405.03480·cs.IR·May 7, 2024

Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search

Hideaki Joko, Shubham Chatterjee, Andrew Ramsay, Arjen P. de Vries,, Jeff Dalton, and Faegheh Hasibi

PDF

1 Repo

TL;DR

This paper introduces LAPS, a novel method using large language models to efficiently generate large-scale, realistic, multi-session personalized dialogues, overcoming data scarcity for personalized conversational agents.

Contribution

LAPS leverages LLMs to guide human workers in creating diverse, multi-domain personalized dialogues, improving scalability and quality over expert-only data collection methods.

Findings

01

LAPS-produced dialogues are as natural and diverse as expert-created ones.

02

Responses using extracted preferences better match user preferences.

03

LAPS accelerates data collection for personalized conversational AI.

Abstract

The future of conversational agents will provide users with personalized information responses. However, a significant challenge in developing models is the lack of large-scale dialogue datasets that span multiple sessions and reflect real-world user preferences. Previous approaches rely on experts in a wizard-of-oz setup that is difficult to scale, particularly for personalized tasks. Our method, LAPS, addresses this by using large language models (LLMs) to guide a single human worker in generating personalized dialogues. This method has proven to speed up the creation process and improve quality. LAPS can collect large-scale, human-written, multi-session, and multi-domain conversations, including extracting user preferences. When compared to existing datasets, LAPS-produced conversations are as natural and diverse as expert-created ones, which stays in contrast with fully synthetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

informagi/laps
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings