SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Shaofei Cai; Yulei Qin; Haojia Lin; Zihan Xu; Gang Li; Yuchen Shi; Zongyi Li; Yong Mao; Siqi Cai; Xiaoyu Tan; Yitao Liang; Ke Li; Xing Sun

arXiv:2512.22322·cs.CL·January 7, 2026

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Shaofei Cai, Yulei Qin, Haojia Lin, Zihan Xu, Gang Li, Yuchen Shi, Zongyi Li, Yong Mao, Siqi Cai, Xiaoyu Tan, Yitao Liang, Ke Li, Xing Sun

PDF

Open Access 4 Models 2 Datasets

TL;DR

SmartSnap introduces a proactive self-verification paradigm for autonomous agents, enabling them to prove task completion with minimal evidence, improving scalability and reliability in complex GUI tasks.

Contribution

It proposes a novel self-verifying agent framework guided by 3C principles, shifting from passive post-hoc verification to in-situ evidence-based self-assessment.

Findings

01

Achieves up to 26.08% performance improvement on mobile tasks.

02

Demonstrates scalability across different model sizes.

03

Outperforms existing verification methods in reliability and efficiency.

Abstract

Agentic reinforcement learning (RL) holds great promise for the development of autonomous agents under complex GUI tasks, but its scalability remains severely hampered by the verification of task completion. Existing task verification is treated as a passive, post-hoc process: a verifier (i.e., rule-based scoring script, reward or critic model, and LLM-as-a-Judge) analyzes the agent's entire interaction trajectory to determine if the agent succeeds. Such processing of verbose context that contains irrelevant, noisy history poses challenges to the verification protocols and therefore leads to prohibitive cost and low reliability. To overcome this bottleneck, we propose SmartSnap, a paradigm shift from this passive, post-hoc verification to proactive, in-situ self-verification by the agent itself. We introduce the Self-Verifying Agent, a new type of agent designed with dual missions: to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)