DexAssist: A Voice-Enabled Dual-LLM Framework for Accessible Web Navigation
Shridhar Mehendale, Ankit Walishetti

TL;DR
DexAssist introduces a dual-LLM framework that enhances web navigation accessibility for users with motor impairments by iteratively interpreting commands and assessing success, significantly improving accuracy over previous methods.
Contribution
This paper presents a novel dual-LLM system for accessible web navigation, demonstrating improved reliability and real-time error correction in interface control.
Findings
~36 percentage points increase in accuracy after first iteration
Effective real-time error resolution in web navigation tasks
Initial evaluation on 3 e-commerce websites
Abstract
Individuals with fine motor impairments, such as those caused by conditions like Parkinson's disease, cerebral palsy, or dyspraxia, face significant challenges in interacting with traditional computer interfaces. Historically, scripted automation has offered some assistance, but these solutions are often too rigid and task-specific, failing to adapt to the diverse needs of users. The advent of Large Language Models (LLMs) promised a more flexible approach, capable of interpreting natural language commands to navigate complex user interfaces. However, current LLMs often misinterpret user intent and have no fallback measures when user instructions do not directly align with the specific wording used in the Document Object Model (DOM). This research presents Dexterity Assist (DexAssist), a dual-LLM system designed to improve the reliability of automated user interface control. Both LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Accessibility for Disabilities · Text Readability and Simplification · Web Data Mining and Analysis
