Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision
Justin M. Kasowski, Apurv Varshney, Michael Beyeler

TL;DR
This study compares two semantic scene simplification methods in immersive virtual reality to improve wayfinding for simulated bionic vision, demonstrating that both enhance performance and user experience in cluttered environments.
Contribution
It introduces and evaluates two novel semantic preprocessing approaches, SemanticEdges and SemanticRaster, for aiding navigation in prosthetic vision simulations.
Findings
Both methods improved navigation success over baseline.
SemanticEdges increased success odds; SemanticRaster reduced collisions.
Adaptive semantic preprocessing benefits low-bandwidth visual interfaces.
Abstract
Visual neuroprostheses (bionic eye) aim to restore a rudimentary form of vision by translating camera input into patterns of electrical stimulation. To improve scene understanding under extreme resolution and bandwidth constraints, prior work has explored computer vision techniques such as semantic segmentation and depth estimation. However, presenting all task-relevant information simultaneously can overwhelm users in cluttered environments. We compare two complementary approaches to semantic preprocessing in immersive virtual reality: SemanticEdges, which highlights all relevant objects at once, and SemanticRaster, which staggers object categories over time to reduce visual clutter. Using a biologically grounded simulation of prosthetic vision, 18 sighted participants performed a wayfinding task in a dynamic urban environment across three conditions: edge-based baseline (Control),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · 3D Surveying and Cultural Heritage · Human Pose and Action Recognition
