Diffusion-Based Imitation Learning for Social Pose Generation
Antonio Lech Martin-Ozimek, Isuru Jayarathne, Su Larb Mon, Jouh Yeong, Chew

TL;DR
This paper introduces a diffusion-based imitation learning approach to generate social pose behaviors for facilitators in human-robot interactions, comparing different scene observation representations for efficiency and realism.
Contribution
It adapts diffusion behavior cloning for social pose generation and evaluates the impact of pre-processing on model performance and computational load.
Findings
Pre-processed pose data improves social behavior realism.
Trade-off observed between inference time and accuracy.
Diffusion models can effectively generate social cues.
Abstract
Intelligent agents, such as robots and virtual agents, must understand the dynamics of complex social interactions to interact with humans. Effectively representing social dynamics is challenging because we require multi-modal, synchronized observations to understand a scene. We explore how using a single modality, the pose behavior, of multiple individuals in a social interaction can be used to generate nonverbal social cues for the facilitator of that interaction. The facilitator acts to make a social interaction proceed smoothly and is an essential role for intelligent agents to replicate in human-robot interactions. In this paper, we adapt an existing diffusion behavior cloning model to learn and replicate facilitator behaviors. Furthermore, we evaluate two representations of pose observations from a scene, one representation has pre-processing applied and one does not. The purpose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Robot Manipulation and Learning
MethodsDiffusion
