DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding
Shuijing Liu, Aamir Hasan, Kaiwen Hong, Runxuan Wang, Peixin Chang,, Zachary Mizrachi, Justin Lin, D. Livingston McPherson, Wendy A. Rogers, and, Katherine Driggs-Campbell

TL;DR
DRAGON is a dialogue-based assistive robot that helps visually impaired users navigate and understand their environment through natural language communication and visual grounding, improving accessibility and user experience.
Contribution
This paper introduces DRAGON, a novel robot system combining dialogue and visual-language grounding for assistive navigation, which is a new approach in this domain.
Findings
Effective communication with users in indoor environments
Successful grounding of free-form descriptions to environmental landmarks
Positive user study results demonstrating improved navigation experience
Abstract
Persons with visual impairments (PwVI) have difficulties understanding and navigating spaces around them. Current wayfinding technologies either focus solely on navigation or provide limited communication about the environment. Motivated by recent advances in visual-language grounding and semantic navigation, we propose DRAGON, a guiding robot powered by a dialogue system and the ability to associate the environment with natural language. By understanding the commands from the user, DRAGON is able to guide the user to the desired landmarks on the map, describe the environment, and answer questions from visual observations. Through effective utilization of dialogue, the robot can ground the user's free-form descriptions to landmarks in the environment, and give the user semantic information through spoken language. We conduct a user study with blindfolded participants in an everyday…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Tactile and Sensory Interactions · Robotics and Sensor-Based Localization
MethodsFocus
