Can ChatGPT assist visually impaired people with micro-navigation?
Junxian He, Shrinivas Pundlik, Gang Luo

TL;DR
This study evaluates ChatGPT's ability to assist visually impaired individuals with micro-navigation using scene images and descriptions, highlighting current limitations and potential improvements for scene understanding in navigation tasks.
Contribution
It introduces a novel evaluation of ChatGPT's micro-navigation assistance capabilities with multimodal inputs and analyzes how prompt instructions affect performance.
Findings
ChatGPT's accuracy improves with scene descriptions over images.
Prompt instructions on unanswerable questions increase specificity.
Current ChatGPT has limitations in scene understanding for navigation.
Abstract
Objective: Micro-navigation poses challenges for blind and visually impaired individuals. They often need to ask for sighted assistance. We explored the feasibility of utilizing ChatGPT as a virtual assistant to provide navigation directions. Methods: We created a test set of outdoor and indoor micro-navigation scenarios consisting of 113 scene images and their human-generated text descriptions. A total of 412 way-finding queries and their expected responses were compiled based on the scenarios. Not all queries are answerable based on the information available in the scene image. "I do not know"response was expected for unanswerable queries, which served as negative cases. High level orientation responses were expected, and step-by-step guidance was not required. ChatGPT 4o was evaluated based on sensitivity (SEN) and specificity (SPE) under different conditions. Results: The default…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTactile and Sensory Interactions
