Transformer-based Models to Deal with Heterogeneous Environments in Human Activity Recognition

Sannara EK; Fran\c{c}ois Portet; Philippe Lalanda

arXiv:2209.11750·cs.CV·August 26, 2025·20 cites

Transformer-based Models to Deal with Heterogeneous Environments in Human Activity Recognition

Sannara EK, Fran\c{c}ois Portet, Philippe Lalanda

PDF

Open Access 1 Repo

TL;DR

This paper introduces Transformer-based models, HART and MobileHART, for human activity recognition that are more robust to data heterogeneity and device variability in real-world scenarios, outperforming previous models with fewer resources.

Contribution

The paper proposes two novel sensor-wise Transformer architectures, HART and MobileHART, specifically designed to handle data heterogeneity in human activity recognition tasks.

Findings

01

HART and MobileHART outperform previous architectures in accuracy.

02

They require fewer floating point operations and parameters.

03

They are more robust to device and position changes.

Abstract

Human Activity Recognition (HAR) on mobile devices has been demonstrated to be possible using neural models trained on data collected from the device's inertial measurement units. These models have used Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTMs), Transformers or a combination of these to achieve state-of-the-art results with real-time performance. However, these approaches have not been extensively evaluated in real-world situations where the input data may be different from the training data. This paper highlights the issue of data heterogeneity in machine learning applications and how it can hinder their deployment in pervasive settings. To address this problem, we propose and publicly release the code of two sensor-wise Transformer architectures called HART and MobileHART for Human Activity Recognition Transformer. Our experiments on several publicly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

getalp/lightweight-transformer-models-for-har-on-mobile-devices
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · Non-Invasive Vital Sign Monitoring

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Adam · Residual Connection · Label Smoothing · Dense Connections