Beyond the Click: A Framework for Inferring Cognitive Traces in Search
Saber Zerhoudi, Michael Granitzer

TL;DR
This paper introduces a framework that infers users' cognitive states from behavioral logs using a multi-agent LLM grounded in IFT, enhancing user simulation by capturing underlying thoughts like confusion or satisfaction.
Contribution
The authors develop a novel method for annotating cognitive states in user sessions, validated across multiple datasets, and release tools and data for future research in cognitively aware user simulation.
Findings
Cognitive labels improve F1 by up to 6.6% on MovieLens.
The model performs best where behavioral signals are weak.
The approach is validated across three public datasets.
Abstract
User simulators are essential for evaluating search systems, but they primarily reproduce user actions without modeling the underlying thought process. Large-scale interaction logs record what users do, but not what they might be thinking or feeling, such as confusion or satisfaction. We present a framework for inferring cognitive traces from behavioral logs. Our method uses a multi-agent LLM system grounded in Information Foraging Theory (IFT) and validated by human experts. We annotate three public datasets (AOL, Stack Overflow, and MovieLens), producing over 530,000 cognitive labels across 50,000 sessions. A cross-dataset evaluation with a shuffled-label control reveals that cognitive labels provide the strongest signal where behavioral features are weakest: on MovieLens, the cognitive model improves F1 by up to 6.6% over the behavioral baseline and 1.8% above the shuffled control,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
