Translating Video Recordings of Mobile App Usages into Replayable Scenarios
Carlos Bernal-C\'ardenas, Nathan Cooper, Kevin Moran, Oscar Chaparro,, Andrian Marcus, Denys Poshyvanyk

TL;DR
This paper presents V2S, an automated computer vision-based system that translates mobile app screen recordings into replayable scenarios, aiding developers in analyzing user interactions and reproducing bugs efficiently.
Contribution
V2S is a novel lightweight approach that converts Android app screen recordings into executable test scenarios using computer vision techniques, addressing challenges of analyzing graphical software artifacts.
Findings
V2S accurately replays approximately 89% of test scenarios from videos.
V2S can process 175 videos with 3,534 GUI actions effectively.
Case studies show potential usefulness for mobile developers.
Abstract
Screen recordings of mobile applications are easy to obtain and capture a wealth of information pertinent to software developers (e.g., bugs or feature requests), making them a popular mechanism for crowdsourced app feedback. Thus, these videos are becoming a common artifact that developers must manage. In light of unique mobile development constraints, including swift release cycles and rapidly evolving platforms, automated techniques for analyzing all types of rich software artifacts provide benefit to mobile developers. Unfortunately, automatically analyzing screen recordings presents serious challenges, due to their graphical nature, compared to other types of (textual) artifacts. To address these challenges, this paper introduces V2S, a lightweight, automated approach for translating video recordings of Android app usages into replayable scenarios. V2S is based primarily on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
