HARMES: A Multi-Modal Dataset for Wearable Human Activity Recognition with Motion, Environmental Sensing and Sound

Robin Burchard; Pascal-Andr\'e Br\"uckner; Marius Bock; Juergen Gall; Kristof Van Laerhoven

arXiv:2605.02596·cs.LG·May 5, 2026

HARMES: A Multi-Modal Dataset for Wearable Human Activity Recognition with Motion, Environmental Sensing and Sound

Robin Burchard, Pascal-Andr\'e Br\"uckner, Marius Bock, Juergen Gall, Kristof Van Laerhoven

PDF

1 Repo

TL;DR

HARMES is a novel multi-modal wearable dataset combining motion, environmental, and audio sensors, enabling improved activity recognition for daily living activities through extensive benchmarking and analysis.

Contribution

This work introduces HARMES, the first dataset combining motion, environmental, and audio data for wearable human activity recognition, with extensive benchmarks and analysis.

Findings

01

Modality contributions vary by activity and provide complementary information.

02

HARMES is nearly six times larger than previous wrist-inertial-acoustic datasets.

03

Cross-subject generalization improves with multi-modal data.

Abstract

With each sensing modality exhibiting inherent strengths and limitations, multi-modal approaches for wearable Human Activity Recognition (HAR) are becoming increasingly relevant -- particularly for recognizing Activities of Daily Living (ADLs), where individual modalities often produce ambiguous signals for similar or complex activities. This work introduces HARMES, a multi-modal wearable dataset combining three wrist-recorded modalities: motion sensing via an Inertial Measurement Unit (IMU), atmospheric environmental sensors (humidity, temperature, and pressure), and audio. Collected from 20 participants performing household activities in their own homes, HARMES totals over 80 hours of recorded data, with approximately three hours of labeled activity data per participant across 15 ADL classes. To the best of our knowledge, HARMES is the first dataset to combine this particular sensor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.