MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments

Xiao Yang; Jiawei Chen; Jun Luo; Zhengwei Fang; Yinpeng Dong; Hang Su; Jun Zhu

arXiv:2506.01616·cs.AI·June 3, 2025

MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments

Xiao Yang, Jiawei Chen, Jun Luo, Zhengwei Fang, Yinpeng Dong, Hang Su, Jun Zhu

PDF

Open Access

TL;DR

MLA-Trust introduces a comprehensive framework for evaluating trustworthiness of multimodal LLM agents in GUI environments, addressing unique challenges like safety, privacy, and control through extensive experiments and a new evaluation toolbox.

Contribution

This paper presents the first unified benchmark for assessing trustworthiness of multimodal LLM agents in interactive GUI settings, highlighting vulnerabilities and providing evaluation tools.

Findings

01

MLAs pose greater trustworthiness risks than static MLLMs in high-stakes domains.

02

Transition to interactive MLAs increases risks of harmful content generation.

03

Multi-step interactions can accumulate risks, bypassing safeguards.

Abstract

The emergence of multimodal LLM-based agents (MLAs) has transformed interaction paradigms by seamlessly integrating vision, language, action and dynamic environments, enabling unprecedented autonomous capabilities across GUI applications ranging from web automation to mobile systems. However, MLAs introduce critical trustworthiness challenges that extend far beyond traditional language models' limitations, as they can directly modify digital states and trigger irreversible real-world consequences. Existing benchmarks inadequately tackle these unique challenges posed by MLAs' actionable outputs, long-horizon uncertainty and multimodal attack vectors. In this paper, we introduce MLA-Trust, the first comprehensive and unified framework that evaluates the MLA trustworthiness across four principled dimensions: truthfulness, controllability, safety and privacy. We utilize websites and mobile…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAccess Control and Trust · Multi-Agent Systems and Negotiation · Cloud Data Security Solutions