VisTaNet: Attention Guided Deep Fusion for Surface Roughness   Classification

Prasanna Kumar Routray; Aditya Sanjiv Kanade; Jay Bhanushali,; Manivannan Muniyandi

arXiv:2209.08516·cs.CV·September 20, 2022

VisTaNet: Attention Guided Deep Fusion for Surface Roughness Classification

Prasanna Kumar Routray, Aditya Sanjiv Kanade, Jay Bhanushali,, Manivannan Muniyandi

PDF

Open Access

TL;DR

This paper introduces VisTaNet, a deep fusion model combining visual and tactile data for surface roughness classification, achieving significant accuracy improvements by mimicking human multisensory perception.

Contribution

The paper proposes a novel attention-guided deep fusion architecture for visuotactile data, along with a new visual dataset, enhancing surface roughness classification performance.

Findings

01

Achieved 97.22% accuracy with the proposed model.

02

Attention-guided fusion outperforms other strategies.

03

Model mimics human multisensory weighting in texture perception.

Abstract

Human texture perception is a weighted average of multi-sensory inputs: visual and tactile. While the visual sensing mechanism extracts global features, the tactile mechanism complements it by extracting local features. The lack of coupled visuotactile datasets in the literature is a challenge for studying multimodal fusion strategies analogous to human texture perception. This paper presents a visual dataset that augments an existing tactile dataset. We propose a novel deep fusion architecture that fuses visual and tactile data using four types of fusion strategies: summation, concatenation, max-pooling, and attention. Our model shows significant performance improvements (97.22%) in surface roughness classification accuracy over tactile only (SVM - 92.60%) and visual only (FENet-50 - 85.01%) architectures. Among the several fusion techniques, attention-guided architecture results in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTactile and Sensory Interactions · Advanced Sensor and Energy Harvesting Materials · Visual Attention and Saliency Detection