LENS: LLM-Enabled Narrative Synthesis for Mental Health by Aligning Multimodal Sensing with Language Models

Wenxuan Xu; Arvind Pillai; Subigya Nepal; Amanda C Collins; Daniel M Mackin; Michael V Heinz; Tess Z Griffin; Nicholas C Jacobson; Andrew Campbell

arXiv:2512.23025·cs.CL·May 12, 2026

LENS: LLM-Enabled Narrative Synthesis for Mental Health by Aligning Multimodal Sensing with Language Models

Wenxuan Xu, Arvind Pillai, Subigya Nepal, Amanda C Collins, Daniel M Mackin, Michael V Heinz, Tess Z Griffin, Nicholas C Jacobson, Andrew Campbell

PDF

TL;DR

LENS is a novel framework that aligns multimodal health sensor data with language models to generate meaningful mental health narratives, improving clinical interpretability and decision support.

Contribution

The paper introduces a large-scale dataset, a patch-level encoder for raw sensor integration, and demonstrates improved narrative generation for mental health assessment.

Findings

01

LENS outperforms baselines on NLP and symptom severity metrics.

02

User study shows narratives are comprehensive and clinically meaningful.

03

Constructed over 100,000 sensor-text QA pairs from 258 participants.

Abstract

Multimodal health sensing offers rich behavioral signals for assessing mental health, yet translating these numerical time-series measurements into natural language remains challenging. Current LLMs cannot natively ingest long-duration sensor streams, and paired sensor-text datasets are scarce. To address these challenges, we introduce LENS, a framework that aligns multimodal sensing data with language models to generate clinically grounded mental-health narratives. LENS first constructs a large-scale dataset by transforming Ecological Momentary Assessment (EMA) responses related to depression and anxiety symptoms into natural-language descriptions, yielding over 100,000 sensor-text QA pairs from 258 participants. To enable native time-series integration, we train a patch-level encoder that projects raw sensor signals directly into an LLM's representation space. Our results show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.