Testing Human-Hand Segmentation on In-Distribution and Out-of-Distribution Data in Human-Robot Interactions Using a Deep Ensemble Model
Reza Jalayer, Yuxin Chen, Masoud Jalayer, Carlotta Orsenigo, Masayoshi Tomizuka

TL;DR
This paper evaluates the robustness of deep learning models for human hand segmentation in human-robot interactions across in-distribution and out-of-distribution scenarios, emphasizing the importance of context-specific training and uncertainty quantification.
Contribution
It introduces a comprehensive evaluation of pre-trained deep ensemble models on diverse ID and OOD datasets, highlighting the impact of training data context on model generalization in real-world scenarios.
Findings
Models trained on industrial datasets perform better in both ID and OOD scenarios.
All models show decreased performance on OOD data, but industrial-trained models generalize more effectively.
Uncertainty quantification helps identify challenging OOD cases.
Abstract
Reliable detection and segmentation of human hands are critical for enhancing safety and facilitating advanced interactions in human-robot collaboration. Current research predominantly evaluates hand segmentation under in-distribution (ID) data, which reflects the training data of deep learning (DL) models. However, this approach fails to address out-of-distribution (OOD) scenarios that often arise in real-world human-robot interactions. In this study, we present a novel approach by evaluating the performance of pre-trained DL models under both ID data and more challenging OOD scenarios. To mimic realistic industrial scenarios, we designed a diverse dataset featuring simple and cluttered backgrounds with industrial tools, varying numbers of hands (0 to 4), and hands with and without gloves. For OOD scenarios, we incorporated unique and rare conditions such as finger-crossing gestures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Video Surveillance and Tracking Methods
MethodsBalanced Selection
