Towards Scalable Web Accessibility Audit with MLLMs as Copilots
Ming Gu, Ziwei Wang, Sicen Lai, Zirui Gao, Sheng Zhou, Jiajun Bu

TL;DR
This paper introduces AAA, a scalable web accessibility auditing framework that combines multimodal sampling and large language model copilots to enhance efficiency and coverage in compliance evaluation.
Contribution
The work presents a novel human-AI partnership model for web accessibility auditing, integrating GRASP sampling and MaC LLM copilots, along with four new benchmarking datasets.
Findings
Effective in scaling web accessibility audits
Small language models can serve as capable experts when fine-tuned
Demonstrates improved coverage and efficiency in auditing processes
Abstract
Ensuring web accessibility is crucial for advancing social welfare, justice, and equality in digital spaces, yet the vast majority of website user interfaces remain non-compliant, due in part to the resource-intensive and unscalable nature of current auditing practices. While WCAG-EM offers a structured methodology for site-wise conformance evaluation, it involves great human efforts and lacks practical support for execution at scale. In this work, we present an auditing framework, AAA, which operationalizes WCAG-EM through a human-AI partnership model. AAA is anchored by two key innovations: GRASP, a graph-based multimodal sampling method that ensures representative page coverage via learned embeddings of visual, textual, and relational cues; and MaC, a multimodal large language model-based copilot that supports auditors through cross-modal reasoning and intelligent assistance in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDigital Accessibility for Disabilities · Text Readability and Simplification · Subtitles and Audiovisual Media
