Transformer-based Models to Deal with Heterogeneous Environments in Human Activity Recognition
Sannara EK, Fran\c{c}ois Portet, Philippe Lalanda

TL;DR
This paper introduces Transformer-based models, HART and MobileHART, for human activity recognition that are more robust to data heterogeneity and device variability in real-world scenarios, outperforming previous models with fewer resources.
Contribution
The paper proposes two novel sensor-wise Transformer architectures, HART and MobileHART, specifically designed to handle data heterogeneity in human activity recognition tasks.
Findings
HART and MobileHART outperform previous architectures in accuracy.
They require fewer floating point operations and parameters.
They are more robust to device and position changes.
Abstract
Human Activity Recognition (HAR) on mobile devices has been demonstrated to be possible using neural models trained on data collected from the device's inertial measurement units. These models have used Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTMs), Transformers or a combination of these to achieve state-of-the-art results with real-time performance. However, these approaches have not been extensively evaluated in real-world situations where the input data may be different from the training data. This paper highlights the issue of data heterogeneity in machine learning applications and how it can hinder their deployment in pervasive settings. To address this problem, we propose and publicly release the code of two sensor-wise Transformer architectures called HART and MobileHART for Human Activity Recognition Transformer. Our experiments on several publicly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · Non-Invasive Vital Sign Monitoring
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Adam · Residual Connection · Label Smoothing · Dense Connections
