TL;DR
BenchHAR systematically evaluates self-supervised learning methods for sensor-based human activity recognition, revealing insights into their generalization capabilities across diverse datasets and architectures.
Contribution
It introduces a large-scale benchmark framework for assessing SSL methods in HAR and provides practical insights into model and data factors affecting generalization.
Findings
Hybrid SSL paradigms outperform single-method approaches.
CNN encoders learn more generalizable features than other architectures.
Increasing pretraining data improves generalization more than adding labeled data.
Abstract
Human Activity Recognition (HAR) from wearable sensors supports broad healthcare and behavior science applications. However, data heterogeneity and the scarcity of labeled data limit its real-world generalization. Recent advances in self-supervised learning (SSL) in vision and language domains have shown strong capability for learning generalizable representations from unlabeled data. Yet, few studies have systematically compared the generalization performance of SSL methods or explored how to adapt them for generalizable HAR. To address these gaps, we present BenchHAR, a unified framework for evaluating the generalization capability of SSL methods for sensor-based HAR on unseen target distributions. BenchHAR curates a large-scale dataset (~258K samples) and evaluates eight representative SSL methods across 12 encoder-classifier architectures. Our results reveal that existing SSL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
