VisionClaw: Always-On AI Agents through Smart Glasses

Xiaoan Liu; DaeHo Lee; Eric J Gonzalez; Mar Gonzalez-Franco; Ryo Suzuki

arXiv:2604.03486·cs.HC·April 9, 2026

VisionClaw: Always-On AI Agents through Smart Glasses

Xiaoan Liu, DaeHo Lee, Eric J Gonzalez, Mar Gonzalez-Franco, Ryo Suzuki

PDF

TL;DR

VisionClaw introduces an always-on AI system on smart glasses that perceives the environment and executes tasks through speech, enabling seamless, hands-free interaction and delegation in real-world scenarios.

Contribution

It presents a novel wearable AI agent that continuously perceives and acts in the environment, demonstrating improved efficiency and a shift towards opportunistic, delegated interactions.

Findings

01

Faster task completion compared to non-always-on baselines.

02

Reduced interaction overhead in real-world tasks.

03

Shift towards opportunistic task initiation and delegation.

Abstract

We present VisionClaw, an always-on wearable AI agent that integrates live egocentric perception with agentic task execution. Running on Meta Ray-Ban smart glasses, VisionClaw continuously perceives real-world context and enables in-situ, speech-driven action initiation and delegation via OpenClaw AI agents. Therefore, users can directly execute tasks through the smart glasses, such as adding real-world objects to an Amazon cart, generating notes from physical documents, receiving meeting briefings on the go, creating events from posters, or controlling IoT devices. We evaluate VisionClaw through a controlled laboratory study (N=12) and a longitudinal deployment study (N=5). Results show that integrating perception and execution enables faster task completion and reduces interaction overhead compared to non-always-on and non-agent baselines. Beyond performance gains, deployment findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.