VisionClaw: Always-On AI Agents through Smart Glasses
Xiaoan Liu, DaeHo Lee, Eric J Gonzalez, Mar Gonzalez-Franco, Ryo Suzuki

TL;DR
VisionClaw introduces an always-on AI system on smart glasses that perceives the environment and executes tasks through speech, enabling seamless, hands-free interaction and delegation in real-world scenarios.
Contribution
It presents a novel wearable AI agent that continuously perceives and acts in the environment, demonstrating improved efficiency and a shift towards opportunistic, delegated interactions.
Findings
Faster task completion compared to non-always-on baselines.
Reduced interaction overhead in real-world tasks.
Shift towards opportunistic task initiation and delegation.
Abstract
We present VisionClaw, an always-on wearable AI agent that integrates live egocentric perception with agentic task execution. Running on Meta Ray-Ban smart glasses, VisionClaw continuously perceives real-world context and enables in-situ, speech-driven action initiation and delegation via OpenClaw AI agents. Therefore, users can directly execute tasks through the smart glasses, such as adding real-world objects to an Amazon cart, generating notes from physical documents, receiving meeting briefings on the go, creating events from posters, or controlling IoT devices. We evaluate VisionClaw through a controlled laboratory study (N=12) and a longitudinal deployment study (N=5). Results show that integrating perception and execution enables faster task completion and reduces interaction overhead compared to non-always-on and non-agent baselines. Beyond performance gains, deployment findings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
