Scaling Human Activity Recognition: A Comparative Evaluation of Synthetic Data Generation and Augmentation Techniques
Zikang Leng, Archith Iyer, Thomas Pl\"otz

TL;DR
This paper compares synthetic IMU data generation methods and traditional augmentation for human activity recognition, showing virtual data enhances model performance especially with limited data, and provides practical guidance on strategy selection.
Contribution
It provides a comprehensive comparison of cross-modality transfer and classical augmentation techniques for HAR, highlighting their relative effectiveness and practical considerations.
Findings
Virtual IMU data improves HAR accuracy over real or augmented data.
Synthetic data is especially beneficial in limited-data scenarios.
The study offers practical guidance for selecting data generation strategies.
Abstract
Human activity recognition (HAR) is often limited by the scarcity of labeled datasets due to the high cost and complexity of real-world data collection. To mitigate this, recent work has explored generating virtual inertial measurement unit (IMU) data via cross-modality transfer. While video-based and language-based pipelines have each shown promise, they differ in assumptions and computational cost. Moreover, their effectiveness relative to traditional sensor-level data augmentation remains unclear. In this paper, we present a direct comparison between these two virtual IMU generation approaches against classical data augmentation techniques. We construct a large-scale virtual IMU dataset spanning 100 diverse activities from Kinetics-400 and simulate sensor signals at 22 body locations. The three data generation strategies are evaluated on benchmark HAR datasets (UTD-MHAD, PAMAP2,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · Inertial Sensor and Navigation
