A Novel mapping for visual to auditory sensory substitution

Ezsan Mehrbani; Sezedeh Fatemeh Mirhoseini; Noushin Riahi

arXiv:2106.07448·cs.SD·June 17, 2021

A Novel mapping for visual to auditory sensory substitution

Ezsan Mehrbani, Sezedeh Fatemeh Mirhoseini, Noushin Riahi

PDF

Open Access

TL;DR

This paper introduces a new method for converting visual environmental features into audio cues for sensory substitution, improving training efficiency and achieving high accuracy in blind object recognition.

Contribution

The study presents a novel mapping technique that enhances training efficiency and recognition accuracy over previous sinusoidal tone-based methods.

Findings

01

Higher training time efficiency compared to previous methods

02

Achieved 88.05% accuracy in blind object recognition

03

Effective conversion of visual features to audio cues

Abstract

visual information can be converted into audio stream via sensory substitution devices in order to give visually impaired people the chance of perception of their surrounding easily and simultaneous to performing everyday tasks. In this study, visual environmental features namely, coordinate, type of objects and their size are assigned to audio features related to music tones such as frequency, time duration and note permutations. Results demonstrated that this new method has more training time efficiency in comparison with our previous method named VBTones which sinusoidal tones were applied. Moreover, results in blind object recognition for real objects was achieved 88.05 on average.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTactile and Sensory Interactions · Hearing Loss and Rehabilitation · Speech and Audio Processing