Clarifying the Half Full or Half Empty Question: Multimodal Container   Classification

Josua Spisak; Matthias Kerzel; and Stefan Wermter

arXiv:2307.08471·cs.RO·July 18, 2023

Clarifying the Half Full or Half Empty Question: Multimodal Container Classification

Josua Spisak, Matthias Kerzel, and Stefan Wermter

PDF

Open Access

TL;DR

This paper evaluates multimodal data fusion techniques for robotic container classification, demonstrating that combining visual, tactile, and proprioceptive data significantly improves accuracy over single-modality approaches.

Contribution

It compares and analyzes three different multimodal fusion strategies in a robotic context, highlighting the advantages of multimodal integration for classification tasks.

Findings

01

Multimodal fusion improves classification accuracy by 15%.

02

Different fusion strategies have varying effectiveness depending on data timing.

03

Multimodal integration outperforms single-sense approaches in robotic perception.

Abstract

Multimodal integration is a key component of allowing robots to perceive the world. Multimodality comes with multiple challenges that have to be considered, such as how to integrate and fuse the data. In this paper, we compare different possibilities of fusing visual, tactile and proprioceptive data. The data is directly recorded on the NICOL robot in an experimental setup in which the robot has to classify containers and their content. Due to the different nature of the containers, the use of the modalities can wildly differ between the classes. We demonstrate the superiority of multimodal solutions in this use case and evaluate three fusion strategies that integrate the data at different time steps. We find that the accuracy of the best fusion strategy is 15% higher than the best strategy using only one singular sense.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Chemical Sensor Technologies · Robot Manipulation and Learning