Generative Partial Visual-Tactile Fused Object Clustering
Tao Zhang, Yang Cong, Gan Sun, Jiahua Dong, Yuyang Liu and, Zhengming Ding

TL;DR
This paper introduces GPVTF, a generative framework that addresses partial data issues in visual-tactile object clustering by synthesizing missing modality data through adversarial learning, improving clustering accuracy.
Contribution
The paper proposes a novel generative adversarial network that synthesizes missing visual or tactile data conditioned on the other modality for improved partial multi-modal clustering.
Findings
Outperforms existing methods on three public datasets
Effectively compensates for missing modality data
Enhances clustering performance with adversarially generated samples
Abstract
Visual-tactile fused sensing for object clustering has achieved significant progresses recently, since the involvement of tactile modality can effectively improve clustering performance. However, the missing data (i.e., partial data) issues always happen due to occlusion and noises during the data collecting process. This issue is not well solved by most existing partial multi-view clustering methods for the heterogeneous modality challenge. Naively employing these methods would inevitably induce a negative effect and further hurt the performance. To solve the mentioned challenges, we propose a Generative Partial Visual-Tactile Fused (i.e., GPVTF) framework for object clustering. More specifically, we first do partial visual and tactile features extraction from the partial visual and tactile data, respectively, and encode the extracted features in modality-specific feature subspaces. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTactile and Sensory Interactions · Visual Attention and Saliency Detection · Gaze Tracking and Assistive Technology
