AXNav: Replaying Accessibility Tests from Natural Language
Maryam Taeb, Amanda Swearngin, Eldon Schoop, Ruijia Cheng, Yue Jiang,, Jeffrey Nichols

TL;DR
This paper introduces AXNav, a system that uses large language models and UI understanding to automate accessibility testing, generating navigable videos with flagged issues, thus aiding QA professionals and improving testing efficiency.
Contribution
The paper presents a novel approach combining LLMs and pixel-based UI understanding to automate accessibility testing and generate informative videos for QA professionals.
Findings
Participants found the tool very useful for their work.
The system performed tests similarly to manual testing.
Insights for future LLM applications in accessibility testing.
Abstract
Developers and quality assurance testers often rely on manual testing to test accessibility features throughout the product lifecycle. Unfortunately, manual testing can be tedious, often has an overwhelming scope, and can be difficult to schedule amongst other development milestones. Recently, Large Language Models (LLMs) have been used for a variety of tasks including automation of UIs, however to our knowledge no one has yet explored their use in controlling assistive technologies for the purposes of supporting accessibility testing. In this paper, we explore the requirements of a natural language based accessibility testing workflow, starting with a formative study. From this we build a system that takes as input a manual accessibility test (e.g., ``Search for a show in VoiceOver'') and uses an LLM combined with pixel-based UI Understanding models to execute the test and produce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Accessibility for Disabilities · Tactile and Sensory Interactions · Multimodal Machine Learning Applications
