Yet it moves: Learning from Generic Motions to Generate IMU data from YouTube videos
Vitor Fortes Rey, Kamalveer Kaur Garewal, Paul Lukowicz

TL;DR
This paper introduces a method to generate synthetic IMU sensor data from YouTube videos using a regression model trained on generic motions, enabling improved human activity recognition without extensive real sensor data collection.
Contribution
It presents a novel approach to simulate IMU data from videos, bridging the gap between video data and sensor-based activity recognition, and demonstrating near real-data performance.
Findings
Synthetic data achieves within 10% of real sensor data F1 score.
Including small real sensor datasets enhances model calibration.
Generated data can compensate for limited real sensor data availability.
Abstract
Human activity recognition (HAR) using wearable sensors has benefited much less from recent advances in Machine Learning than fields such as computer vision and natural language processing. This is to a large extent due to the lack of large scale repositories of labeled training data. In our research we aim to facilitate the use of online videos, which exists in ample quantity for most activities and are much easier to label than sensor data, to simulate labeled wearable motion sensor data. In previous work we already demonstrate some preliminary results in this direction focusing on very simple, activity specific simulation models and a single sensor modality (acceleration norm)\cite{10.1145/3341162.3345590}. In this paper we show how we can train a regression model on generic motions for both accelerometer and gyro signals and then apply it to videos of the target activities to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition
