Avgust: Automating Usage-Based Test Generation from Videos of App Executions
Yixue Zhao, Saghar Talebipour, Kesina Baral, Hyojae Park, Leon Yee,, Safwat Ali Khan, Yuriy Brun, Nenad Medvidovic, Kevin Moran

TL;DR
Avgust automates the creation of usage-based UI tests from app videos using neural models, enabling efficient test generation that aligns with developer preferences and improves test success rates.
Contribution
It introduces a novel neural-based approach to generate usage-based UI tests from videos, automating semantic understanding and test synthesis for mobile apps.
Findings
69% of generated tests successfully execute desired usage
Avgust's classifiers outperform existing methods
Effective on 374 videos of 18 popular apps
Abstract
Writing and maintaining UI tests for mobile apps is a time-consuming and tedious task. While decades of research have produced automated approaches for UI test generation, these approaches typically focus on testing for crashes or maximizing code coverage. By contrast, recent research has shown that developers prefer usage-based tests, which center around specific uses of app features, to help support activities such as regression testing. Very few existing techniques support the generation of such tests, as doing so requires automating the difficult task of understanding the semantics of UI screens and user inputs. In this paper, we introduce Avgust, which automates key steps of generating usage-based tests. Avgust uses neural models for image understanding to process video recordings of app uses to synthesize an app-agnostic state-machine encoding of those uses. Then, Avgust uses this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
