MindMirror: A Local-First Multimodal State-Aware Support System for Digital Workers

Wenqi Luo; Changbo Wang; Yan Wang

arXiv:2605.11700·cs.HC·May 13, 2026

MindMirror: A Local-First Multimodal State-Aware Support System for Digital Workers

Wenqi Luo, Changbo Wang, Yan Wang

PDF

TL;DR

MindMirror is a local-first, multimodal support system for digital workers that uses facial expressions, text, and speech to monitor and reflect on user states, enhancing productivity and well-being.

Contribution

The paper introduces MindMirror, a novel local-first multimodal system integrating facial expression analysis, LLM-based responses, and structured reflection for digital workers.

Findings

01

Fine-tuned emotion recognition model achieves 94.49% accuracy.

02

Users value local-first design and manual correction features.

03

Prototype validation shows improved user engagement and reliability.

Abstract

Digital workers often experience fatigue, anxiety, reduced attention, and task blockage during prolonged computer-based work. Existing productivity tools mainly focus on task completion, while general-purpose AI chatbots require users to formulate clear prompts before receiving useful help. This paper presents MindMirror, a local-first multimodal state-aware support system for digital workers. MindMirror integrates camera-based facial expression cues, text input, optional speech interaction, structured blockage reflection, local large language model (LLM)-based response generation, and daily/weekly review reports. The system forms a closed workflow of state checking, manual correction, structured articulation, suggestion generation, and state review. The current prototype follows a local-first design, while optional speech services may rely on third-party APIs when enabled. It is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.