Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation
Jiazhou Liang, Yifan Simon Liu, David Guo, Minqi Sun, Yilun Jiang, Scott Sanner

TL;DR
This paper introduces a new evaluation framework for immersive conversational recommendation systems that highlight items in scene-based environments, benchmarking multiple methods and identifying key limitations.
Contribution
It formalizes information needs for immersive CRS, proposes novel evaluation metrics, and benchmarks IR, LLM, and VLM methods across diverse scenarios.
Findings
Existing methods fail to leverage scenario-specific information modalities.
They present redundant, visually inferable information.
They poorly anticipate proactive user information needs.
Abstract
The growing ubiquity of Extended Reality (XR) is driving Conversational Recommendation Systems (CRS) toward visually immersive experiences. We formalize this paradigm as Immersive CRS (ICRS), where recommended items are highlighted directly in the user's scene-based visual environment and augmented with in-situ labels. While item recommendation has been widely studied, the problem of how to select and evaluate which information to present as immersive labels remains an open problem. To this end, we introduce a principled categorization of information needs into explicit intent satisfaction and proactive information needs and use these to define novel evaluation metrics for item label selection. We benchmark IR-, LLM-, and VLM-based methods across three datasets and ICRS scenarios: fashion, movie recommendation, and retail shopping. Our evaluation reveals three important limitations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
