Semantic scene synthesis: Application to assistive systems

Chayma Zatout; Slimane Larabi

arXiv:2008.03685·cs.CV·April 22, 2021

Semantic scene synthesis: Application to assistive systems

Chayma Zatout, Slimane Larabi

PDF

TL;DR

This paper presents a method for semantic scene synthesis from a single depth image to assist visually impaired individuals, combining deep learning classification with tactile-based scene representation.

Contribution

It introduces a novel approach that classifies scene segments and encodes them with semantic labels inspired by Braille and Kanji, enabling tactile scene synthesis.

Findings

01

Successfully predicts over 17 object classes

02

Effective on noisy and incomplete depth data

03

Provides tactile scene representations for assistive use

Abstract

The aim of this work is to provide a semantic scene synthesis from a single depth image. This is used in assistive aid systems for visually impaired and blind people that allow them to understand their surroundings by the touch sense. The fact that blind people use touch to recognize objects and rely on listening to replace sight, motivated us to propose this work. First, the acquired depth image is segmented and each segment is classified in the context of assistive systems using a deep learning network. Second, inspired by the Braille system and the Japanese writing system Kanji, the obtained classes are coded with semantic labels. The scene is then synthesized using these labels and the extracted geometric features. Our system is able to predict more than 17 classes only by understanding the provided illustrative labels. For the remaining objects, their geometric features are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.