Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction

Ioannis Tsaknakis; Bingqing Song; Shuyu Gan; Dongyeop Kang; Alfredo Garcia; Gaowen Liu; Charles Fleming; Mingyi Hong

arXiv:2510.17132·cs.LG·October 21, 2025

Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction

Ioannis Tsaknakis, Bingqing Song, Shuyu Gan, Dongyeop Kang, Alfredo Garcia, Gaowen Liu, Charles Fleming, Mingyi Hong

PDF

Open Access

TL;DR

This paper introduces a benchmark to evaluate how well Large Language Models can uncover and reason about users' hidden preferences through multi-turn conversations across various realistic scenarios.

Contribution

It presents the first systematic benchmark for assessing latent information discovery in LLMs during personalized interactions, covering three realistic settings.

Findings

01

LLMs can surface latent information through dialogue.

02

Success rates vary from 32% to 98% depending on context.

03

Benchmark reveals significant variability in LLMs' ability to infer preferences.

Abstract

Large Language Models (LLMs) excel at producing broadly relevant text, but this generality becomes a limitation when user-specific preferences are required, such as recommending restaurants or planning travel. In these scenarios, users rarely articulate every preference explicitly; instead, much of what they care about remains latent, waiting to be inferred. This raises a fundamental question: Can LLMs uncover and reason about such latent information through conversation? We address this problem by introducing a unified benchmark for evaluating latent information discovery - the ability of LLMs to reveal and utilize hidden user attributes through multi-turn interaction. The benchmark spans three progressively realistic settings: the classic 20 Questions game, Personalized Question Answering, and Personalized Text Summarization. All tasks share a tri-agent framework (User, Assistant,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Speech and dialogue systems