VisBuddy -- A Smart Wearable Assistant for the Visually Challenged
Ishwarya Sivakumar, Nishaali Meenakshisundaram, Ishwarya Ramesh,, Shiloah Elizabeth D, Sunil Retmin Raj C

TL;DR
VisBuddy is a cost-effective, voice-controlled wearable assistant that combines deep learning and IoT to aid visually challenged individuals in navigation, object recognition, reading, and accessing news through image captioning, OCR, and object detection.
Contribution
The paper introduces VisBuddy, a novel integrated wearable assistant that leverages deep learning and IoT for comprehensive support to the visually impaired, addressing limitations of existing systems.
Findings
Successfully integrates image captioning, OCR, and object detection in a wearable device.
Provides a cost-efficient, all-in-one solution for daily assistance.
Enhances independence for the visually challenged.
Abstract
Vision plays a crucial role in comprehending the world around us. More than 85% of the external information is obtained through the vision system. It influences our mobility, cognition, information access, and interaction with the environment and other people. Blindness prevents a person from gaining knowledge of the surrounding environment and makes unassisted navigation, object recognition, obstacle avoidance, and reading tasks significant challenges. Many existing systems are often limited by cost and complexity. To help the visually challenged overcome these difficulties faced in everyday life, we propose VisBuddy, a smart assistant to help the visually challenged with their day-to-day activities. VisBuddy is a voice-based assistant where the user can give voice commands to perform specific tasks. It uses the techniques of image captioning for describing the user's surroundings,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Tactile and Sensory Interactions · Advanced Neural Network Applications
