NaviQAte: Functionality-Guided Web Application Navigation
Mobina Shahbandeh, Parsa Alian, Noor Nashid, Ali Mesbah

TL;DR
NaviQAte introduces a functionality-guided approach to web application navigation using large language models, significantly improving success rates in automated testing without relying on detailed task descriptions.
Contribution
It presents a novel three-phase method leveraging GPT-4 models to explore web functionalities effectively, surpassing existing methods like WebCanvas.
Findings
Achieves 44.23% success in user task navigation
Attains 38.46% success in functionality navigation
Improves over WebCanvas by 15% and 33% respectively
Abstract
End-to-end web testing is challenging due to the need to explore diverse web application functionalities. Current state-of-the-art methods, such as WebCanvas, are not designed for broad functionality exploration; they rely on specific, detailed task descriptions, limiting their adaptability in dynamic web environments. We introduce NaviQAte, which frames web application exploration as a question-and-answer task, generating action sequences for functionalities without requiring detailed parameters. Our three-phase approach utilizes advanced large language models like GPT-4o for complex decision-making and cost-effective models, such as GPT-4o mini, for simpler tasks. NaviQAte focuses on functionality-guided web application navigation, integrating multi-modal inputs such as text and images to enhance contextual understanding. Evaluations on the Mind2Web-Live and Mind2Web-Live-Abstracted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Applications and Data Management · Service-Oriented Architecture and Web Services · Web Data Mining and Analysis
