Visual Tactile Fusion Object Clustering

Tao Zhang; Yang Cong; Gan Sun; Qianqian Wang; Zhenming Ding

arXiv:1911.09430·cs.LG·November 22, 2019

Visual Tactile Fusion Object Clustering

Tao Zhang, Yang Cong, Gan Sun, Qianqian Wang, Zhenming Ding

PDF

Open Access

TL;DR

This paper introduces a deep auto-encoder-based non-negative matrix factorization framework that fuses visual and tactile data for improved object clustering, leveraging hierarchical feature learning and data alignment techniques.

Contribution

It proposes a novel deep fusion clustering method combining visual and tactile modalities with a graph regularizer and a modality alignment strategy.

Findings

01

Enhanced clustering performance on public datasets.

02

Effective integration of tactile information improves object grouping.

03

Robustness demonstrated through extensive experiments.

Abstract

Object clustering, aiming at grouping similar objects into one cluster with an unsupervised strategy, has been extensivelystudied among various data-driven applications. However, most existing state-of-the-art object clustering methods (e.g., single-view or multi-view clustering methods) only explore visual information, while ignoring one of most important sensing modalities, i.e., tactile information which can help capture different object properties and further boost the performance of object clustering task. To effectively benefit both visual and tactile modalities for object clustering, in this paper, we propose a deep Auto-Encoder-like Non-negative Matrix Factorization framework for visual-tactile fusion clustering. Specifically, deep matrix factorization constrained by an under-complete Auto-Encoder-like architecture is employed to jointly learn hierarchical expression of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Tactile and Sensory Interactions · Video Surveillance and Tracking Methods