TL;DR
StateLens is a comprehensive reverse engineering system that makes dynamic touchscreens accessible to blind users by reconstructing interface states, guiding task execution, and preventing accidental touches.
Contribution
It introduces a novel three-part approach combining computer vision, conversational guidance, and physical accessories to enable blind interaction with inaccessible touchscreens.
Findings
Accurately reconstructs interface state diagrams from videos
Enables blind users to access dynamic touchscreens effectively
Reduces accidental touches with 3D-printed accessories
Abstract
Blind people frequently encounter inaccessible dynamic touchscreens in their everyday lives that are difficult, frustrating, and often impossible to use independently. Touchscreens are often the only way to control everything from coffee machines and payment terminals, to subway ticket machines and in-flight entertainment systems. Interacting with dynamic touchscreens is difficult non-visually because the visual user interfaces change, interactions often occur over multiple different screens, and it is easy to accidentally trigger interface actions while exploring the screen. To solve these problems, we introduce StateLens - a three-part reverse engineering solution that makes existing dynamic touchscreens accessible. First, StateLens reverse engineers the underlying state diagrams of existing interfaces using point-of-view videos found online or taken by users using a hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
