Learning Multimodal Confidence for Intention Recognition in Human-Robot   Interaction

Xiyuan Zhao; Huijun Li; Tianyuan Miao; Xianyi Zhu; Zhikai Wei and; Aiguo Song

arXiv:2405.14116·cs.RO·August 15, 2024

Learning Multimodal Confidence for Intention Recognition in Human-Robot Interaction

Xiyuan Zhao, Huijun Li, Tianyuan Miao, Xianyi Zhu, Zhikai Wei and, Aiguo Song

PDF

Open Access

TL;DR

This paper introduces a novel multimodal fusion framework, BMCLOP, that enhances intention recognition accuracy and reliability in human-robot interaction by combining Bayesian fusion with confidence learning, validated on a robot with multiple modalities.

Contribution

The paper proposes a new learning-based multimodal fusion framework, BMCLOP, that improves intention recognition accuracy and uncertainty reduction in human-robot collaboration.

Findings

01

BMCLOP outperforms baseline methods in intention recognition accuracy.

02

The framework effectively reduces uncertainty in multimodal intention recognition.

03

Validated on a six-DoF robot with gestures, speech, and gaze modalities.

Abstract

The rapid development of collaborative robotics has provided a new possibility of helping the elderly who has difficulties in daily life, allowing robots to operate according to specific intentions. However, efficient human-robot cooperation requires natural, accurate and reliable intention recognition in shared environments. The current paramount challenge for this is reducing the uncertainty of multimodal fused intention to be recognized and reasoning adaptively a more reliable result despite current interactive condition. In this work we propose a novel learning-based multimodal fusion framework Batch Multimodal Confidence Learning for Opinion Pool (BMCLOP). Our approach combines Bayesian multimodal fusion method and batch confidence learning algorithm to improve accuracy, uncertainty reduction and success rate given the interactive condition. In particular, the generic and practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition