Loading paper
GUIDE: Interpretable GUI Agent Evaluation via Hierarchical Diagnosis | Tomesphere