MindMirror: A Local-First Multimodal State-Aware Support System for Digital Workers
Wenqi Luo, Changbo Wang, Yan Wang

TL;DR
MindMirror is a local-first, multimodal support system for digital workers that uses facial expressions, text, and speech to monitor and reflect on user states, enhancing productivity and well-being.
Contribution
The paper introduces MindMirror, a novel local-first multimodal system integrating facial expression analysis, LLM-based responses, and structured reflection for digital workers.
Findings
Fine-tuned emotion recognition model achieves 94.49% accuracy.
Users value local-first design and manual correction features.
Prototype validation shows improved user engagement and reliability.
Abstract
Digital workers often experience fatigue, anxiety, reduced attention, and task blockage during prolonged computer-based work. Existing productivity tools mainly focus on task completion, while general-purpose AI chatbots require users to formulate clear prompts before receiving useful help. This paper presents MindMirror, a local-first multimodal state-aware support system for digital workers. MindMirror integrates camera-based facial expression cues, text input, optional speech interaction, structured blockage reflection, local large language model (LLM)-based response generation, and daily/weekly review reports. The system forms a closed workflow of state checking, manual correction, structured articulation, suggestion generation, and state review. The current prototype follows a local-first design, while optional speech services may rely on third-party APIs when enabled. It is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
