ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions

Chuanyang Jin; Binze Li; Haopeng Xie; Cathy Mengying Fang; Tianjian Li; Shayne Longpre; Hongxiang Gu; Maximillian Chen; Tianmin Shu

arXiv:2605.20087·cs.CL·May 20, 2026

ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions

Chuanyang Jin, Binze Li, Haopeng Xie, Cathy Mengying Fang, Tianjian Li, Shayne Longpre, Hongxiang Gu, Maximillian Chen, Tianmin Shu

PDF

1 Repo 1 Datasets

TL;DR

ThoughtTrace introduces a large-scale dataset pairing human-AI conversations with users' self-reported thoughts, enabling better understanding of user cognition and improving AI personalization.

Contribution

It is the first dataset to include user thoughts in real-world multi-turn conversations, facilitating research on cognitive dynamics in human-AI interactions.

Findings

01

Thoughts are semantically distinct from messages and hard for LLMs to infer.

02

Thoughts improve user-behavior prediction and aid in personalized assistant training.

03

ThoughtTrace captures diverse, long-horizon interactions across multiple language models.

Abstract

Conversational AI has now reached billions of users, yet existing datasets capture only what people say, not what they think. We introduce ThoughtTrace, the first large-scale dataset that pairs real-world multi-turn human--AI conversations with users' self-reported thoughts: their reasons for sending prompts and reactions to assistant responses. ThoughtTrace comprises 1,058 users, 2,155 conversations, 17,058 turns, and 10,174 thought annotations collected across 20 language models. Our analysis shows that ThoughtTrace captures long-horizon, topically diverse interactions, and that thoughts are semantically distinct from messages, difficult for frontier LLMs to infer from context, diverse in content, and tied to conversation stages. We further demonstrate the utility of thoughts for downstream modeling. First, thoughts improve user-behavior prediction as inference-time context. Second,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thoughttrace-project/thoughttrace
github

Datasets

SCAI-JHU/ThoughtTrace
dataset· 3.7k dl
3.7k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.