Multimodal Visual-haptic pose estimation in the presence of transient occlusion
Michael Zechmair, Yannick Morel

TL;DR
This paper presents a multimodal approach combining vision and capacitive sensing with a modified observer model to improve human pose estimation during transient occlusion in human-robot collaboration.
Contribution
It introduces a novel combination of deep predictive coding vision and capacitive sensing with a modified Luenberger observer for robust pose estimation.
Findings
Outperforms single-sensor methods in occlusion scenarios
Effective in estimating human forearm pose during transient occlusion
Demonstrates improved safety in human-robot collaboration environments
Abstract
Human-robot collaboration requires the establishment of methods to guarantee the safety of participating operators. A necessary part of this process is ensuring reliable human pose estimation. Established vision-based modalities encounter problems when under conditions of occlusion. This article describes the combination of two perception modalities for pose estimation in environments containing such transient occlusion. We first introduce a vision-based pose estimation method, based on a deep Predictive Coding (PC) model featuring robustness to partial occlusion. Next, capacitive sensing hardware capable of detecting various objects is introduced. The sensor is compact enough to be mounted on the exterior of any given robotic system. The technology is particularly well-suited to detection of capacitive material, such as living tissue. Pose estimation from the two individual sensing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Teleoperation and Haptic Systems · Tactile and Sensory Interactions
