VisBuddy -- A Smart Wearable Assistant for the Visually Challenged

Ishwarya Sivakumar; Nishaali Meenakshisundaram; Ishwarya Ramesh,; Shiloah Elizabeth D; Sunil Retmin Raj C

arXiv:2108.07761·cs.CV·January 5, 2022

VisBuddy -- A Smart Wearable Assistant for the Visually Challenged

Ishwarya Sivakumar, Nishaali Meenakshisundaram, Ishwarya Ramesh,, Shiloah Elizabeth D, Sunil Retmin Raj C

PDF

Open Access

TL;DR

VisBuddy is a cost-effective, voice-controlled wearable assistant that combines deep learning and IoT to aid visually challenged individuals in navigation, object recognition, reading, and accessing news through image captioning, OCR, and object detection.

Contribution

The paper introduces VisBuddy, a novel integrated wearable assistant that leverages deep learning and IoT for comprehensive support to the visually impaired, addressing limitations of existing systems.

Findings

01

Successfully integrates image captioning, OCR, and object detection in a wearable device.

02

Provides a cost-efficient, all-in-one solution for daily assistance.

03

Enhances independence for the visually challenged.

Abstract

Vision plays a crucial role in comprehending the world around us. More than 85% of the external information is obtained through the vision system. It influences our mobility, cognition, information access, and interaction with the environment and other people. Blindness prevents a person from gaining knowledge of the surrounding environment and makes unassisted navigation, object recognition, obstacle avoidance, and reading tasks significant challenges. Many existing systems are often limited by cost and complexity. To help the visually challenged overcome these difficulties faced in everyday life, we propose VisBuddy, a smart assistant to help the visually challenged with their day-to-day activities. VisBuddy is a voice-based assistant where the user can give voice commands to perform specific tasks. It uses the techniques of image captioning for describing the user's surroundings,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Tactile and Sensory Interactions · Advanced Neural Network Applications