Joint Engagement Classification using Video Augmentation Techniques for   Multi-person Human-robot Interaction

Yubin Kim; Huili Chen; Sharifa Alghowinem; Cynthia Breazeal; and Hae; Won Park

arXiv:2212.14128·cs.CV·January 2, 2023·1 cites

Joint Engagement Classification using Video Augmentation Techniques for Multi-person Human-robot Interaction

Yubin Kim, Huili Chen, Sharifa Alghowinem, Cynthia Breazeal, and Hae, Won Park

PDF

Open Access

TL;DR

This paper introduces a hybrid deep learning framework utilizing video augmentation techniques to improve joint engagement recognition in multi-person human-robot interactions, specifically focusing on parent-child dyads in a home setting.

Contribution

It presents a novel combination of video augmentation methods with deep learning models for joint engagement classification in multi-person interactions, enhancing interpretability and performance.

Findings

01

Augmentation techniques improved classification accuracy.

02

Models successfully recognized joint engagement in real interaction scenarios.

03

Introduced a behavior-based metric for model interpretability.

Abstract

Affect understanding capability is essential for social robots to autonomously interact with a group of users in an intuitive and reciprocal way. However, the challenge of multi-person affect understanding comes from not only the accurate perception of each user's affective state (e.g., engagement) but also the recognition of the affect interplay between the members (e.g., joint engagement) that presents as complex, but subtle, nonverbal exchanges between them. Here we present a novel hybrid framework for identifying a parent-child dyad's joint engagement by combining a deep learning framework with various video augmentation techniques. Using a dataset of parent-child dyads reading storybooks together with a social robot at home, we first train RGB frame- and skeleton-based joint engagement recognition models with four video augmentation techniques (General Aug, DeepFake, CutOut, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Social Robot Interaction and HRI · Human Pose and Action Recognition