Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living
Zdravko Marinov, David Schneider, Alina Roitberg, Rainer Stiefelhagen

TL;DR
This paper introduces a novel domain generation framework that creates synthetic activity appearances from existing video data to improve activity recognition across domain shifts, especially from simulation to real-world scenarios.
Contribution
The authors propose a new method to generate diverse activity domains from single videos, enhancing model robustness to domain shifts without relying on multi-sensor data.
Findings
Achieved state-of-the-art results on the Sims4Action benchmark.
Generated activity domains significantly improved cross-domain recognition accuracy.
Framework effectively preserves activity semantics during domain generation.
Abstract
Domain shifts, such as appearance changes, are a key challenge in real-world applications of activity recognition models, which range from assistive robotics and smart homes to driver observation in intelligent vehicles. For example, while simulations are an excellent way of economical data collection, a Synthetic-to-Real domain shift leads to a > 60% drop in accuracy when recognizing activities of Daily Living (ADLs). We tackle this challenge and introduce an activity domain generation framework which creates novel ADL appearances (novel domains) from different existing activity modalities (source domains) inferred from video training data. Our framework computes human poses, heatmaps of body joints, and optical flow maps and uses them alongside the original RGB videos to learn the essence of source domains in order to generate completely new ADL domains. The model is optimized by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · IoT-based Smart Home Systems
