Causal Stories from Sensor Traces: Auditing Epistemic Overreach in LLM-Generated Personal Sensing Explanations
Shanshan Zhu, Han Zhang, J. Doris Chi, Subigya Nepal, and Koustuv Saha

TL;DR
This paper introduces the concept of epistemic overreach in LLM-generated personal sensing explanations, demonstrating that models often overstate causal claims beyond available evidence, with minimal mitigation from prompting strategies.
Contribution
It defines epistemic overreach as a new measure, audits its occurrence across datasets and models, and highlights the need for evidential grounding in explanations.
Findings
LLMs frequently make unsupported causal claims in explanations.
Providing more context does not significantly reduce epistemic overreach.
Bounded prompting helps but does not fully eliminate overreach.
Abstract
LLMs are increasingly used to explain personal sensing data, translating traces of activity and mood into natural-language accounts of why an anomalous day may have occurred. However, such explanations can sound coherent and personally meaningful even when the underlying evidence is sparse or missing. We introduce epistemic overreach (EO) as a measure for cases where a generated explanation implies more than the available sensing evidence can justify. To audit how often and in what forms EO occurs, we obtained anomalous-day scenarios from three longitudinal sensing datasets of college students: StudentLife, GLOBEM, and CollegeExperience. Across activity, sleep, and affect anomalies, we generated 14,922 explanations using three LLM families -- Llama, Qwen, and GPT -- under two prompting conditions: one minimally constrained prompt and another prompt explicitly instructing models to bound…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
