RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition

Nirhoshan Sivaroopan; Hansi Karunarathna; Chamara Madarasingha; Anura Jayasumana; Kanchana Thilakarathna

arXiv:2512.08984·cs.CV·May 12, 2026

RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition

Nirhoshan Sivaroopan, Hansi Karunarathna, Chamara Madarasingha, Anura Jayasumana, Kanchana Thilakarathna

PDF

TL;DR

RAG-HAR is a training-free, retrieval-augmented framework using large language models for human activity recognition, achieving state-of-the-art results without dataset-specific training.

Contribution

It introduces a novel retrieval-based approach leveraging LLMs and prompt optimization for accurate HAR without training or fine-tuning.

Findings

01

Achieves state-of-the-art performance on six HAR benchmarks.

02

Operates without model training or fine-tuning.

03

Capable of recognizing unseen human activities.

Abstract

Human Activity Recognition (HAR) underpins applications in healthcare, rehabilitation, fitness tracking, and smart environments, yet existing deep learning approaches demand dataset-specific training, large labeled corpora, and significant computational resources.We introduce RAG-HAR, a training-free retrieval-augmented framework that leverages large language models (LLMs) for HAR. RAG-HAR computes lightweight statistical descriptors, retrieves semantically similar samples from a vector database, and uses this contextual evidence to make LLM-based activity identification. We further enhance RAG-HAR by first applying prompt optimization and introducing an LLM-based activity descriptor that generates context-enriched vector databases for delivering accurate and highly relevant contextual information. Along with these mechanisms, RAG-HAR achieves state-of-the-art performance across six…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.