TL;DR
This survey explores how foundation models, pretrained at scale, are transforming sensor-based human activity recognition by addressing challenges like data scarcity and heterogeneity, and outlines future research directions.
Contribution
It provides a comprehensive taxonomy and analysis of emerging foundation models for sensor-based HAR, identifying key design patterns and development trajectories.
Findings
Identification of three main development trajectories for foundation models in HAR.
Analysis of design patterns and trade-offs across nine technical axes.
Highlighting open challenges and future directions in data, personalization, and deployment.
Abstract
Sensor-based Human Activity Recognition (HAR) underpins many ubiquitous and wearable computing applications, yet current models remain limited by scarce labels, sensor heterogeneity, and weak generalization across users, devices, and contexts. Foundation models, which are generally pretrained at scale using self-supervised and multimodal learning, offer a unifying paradigm to address these challenges by learning reusable, adaptable representations for activity understanding. This survey synthesizes emerging foundation models for sensor-based HAR. We first clarify foundational concepts, definitions, and evaluation criteria, then organize existing work using a lifecycle-oriented taxonomy spanning input design, pretraining, adaptation, and utilization. Rather than enumerating individual models, we analyze recurring design patterns and trade-offs across nine technical axes, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
