Large Language Model-Guided Semantic Alignment for Human Activity Recognition

Hua Yan; Heng Tan; Yi Ding; Pengfei Zhou; Vinod Namboodiri; Yu Yang

arXiv:2410.00003·cs.CV·October 21, 2025

Large Language Model-Guided Semantic Alignment for Human Activity Recognition

Hua Yan, Heng Tan, Yi Ding, Pengfei Zhou, Vinod Namboodiri, Yu Yang

PDF

Open Access

TL;DR

LanHAR utilizes Large Language Models to generate semantic interpretations of sensor data and labels, effectively addressing dataset heterogeneity and improving human activity recognition across datasets and new activities.

Contribution

This paper introduces LanHAR, a novel LLM-guided semantic alignment approach for HAR that enhances cross-dataset transferability and new activity recognition.

Findings

01

Outperforms state-of-the-art methods on five datasets

02

Effectively bridges cross-dataset heterogeneity

03

Enables recognition of new activities with high accuracy

Abstract

Human Activity Recognition (HAR) using Inertial Measurement Unit (IMU) sensors is critical for applications in healthcare, safety, and industrial production. However, variations in activity patterns, device types, and sensor placements create distribution gaps across datasets, reducing the performance of HAR models. To address this, we propose LanHAR, a novel system that leverages Large Language Models (LLMs) to generate semantic interpretations of sensor readings and activity labels for cross-dataset HAR. This approach not only mitigates cross-dataset heterogeneity but also enhances the recognition of new activities. LanHAR employs an iterative re-generation method to produce high-quality semantic interpretations with LLMs and a two-stage training framework that bridges the semantic interpretations of sensor readings and activity labels. This ultimately leads to a lightweight sensor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Context-Aware Activity Recognition Systems