Video-based Pose-Estimation Data as Source for Transfer Learning in Human Activity Recognition
Shrutarv Awasthi, Fernando Moya Rueda, Gernot A. Fink

TL;DR
This paper explores using video-based human pose estimation datasets as a source for transfer learning to improve human activity recognition from on-body device data, addressing data scarcity issues.
Contribution
It introduces a novel transfer learning approach leveraging pose estimation datasets to enhance HAR performance with limited annotated data.
Findings
Transfer learning from pose estimation datasets improves HAR accuracy.
Pre-training on video datasets benefits on-body device HAR tasks.
The approach outperforms baseline models without transfer learning.
Abstract
Human Activity Recognition (HAR) using on-body devices identifies specific human actions in unconstrained environments. HAR is challenging due to the inter and intra-variance of human movements; moreover, annotated datasets from on-body devices are scarce. This problem is mainly due to the difficulty of data creation, i.e., recording, expensive annotation, and lack of standard definitions of human activities. Previous works demonstrated that transfer learning is a good strategy for addressing scenarios with scarce data. However, the scarcity of annotated on-body device datasets remains. This paper proposes using datasets intended for human-pose estimation as a source for transfer learning; specifically, it deploys sequences of annotated pixel coordinates of human joints from video datasets for HAR and human pose estimation. We pre-train a deep architecture on four benchmark video-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
