Invisible-to-Visible: Privacy-Aware Human Instance Segmentation using Airborne Ultrasound via Collaborative Learning Variational Autoencoder
Risako Tanigawa, Yasunori Ishii, Kazuki Kozuka, Takayoshi Yamashita

TL;DR
This paper introduces a privacy-preserving human instance segmentation method using airborne ultrasound and collaborative learning variational autoencoders, enabling action recognition without camera images.
Contribution
It proposes a novel task and a CL-VAE model that learns from sound and RGB images during training to perform segmentation solely from sound images at inference.
Findings
CL-VAE outperforms conventional VAEs in segmentation accuracy
The method enables privacy-preserving human action recognition
Sound images can be effectively used for human segmentation
Abstract
In action understanding in indoor, we have to recognize human pose and action considering privacy. Although camera images can be used for highly accurate human action recognition, camera images do not preserve privacy. Therefore, we propose a new task for human instance segmentation from invisible information, especially airborne ultrasound, for action recognition. To perform instance segmentation from invisible information, we first convert sound waves to reflected sound directional images (sound images). Although the sound images can roughly identify the location of a person, the detailed shape is ambiguous. To address this problem, we propose a collaborative learning variational autoencoder (CL-VAE) that simultaneously uses sound and RGB images during training. In inference, it is possible to obtain instance segmentation results only from sound images. As a result of performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGait Recognition and Analysis · Human Pose and Action Recognition · Advanced Neural Network Applications
